Crawl-Unsplash-CL

A simple node.js command line tool to scrape images from https://unsplash.com. If you happen to find a bug or want to suggest a feature, feel free to contact me or contribute.

Install

Download repository https://github.com/raphaelmayer/crawl-unsplash-cl.git
Navigate to directory or cd yourpath/crawl-unsplash-cl
In your shell of choice type npm install to install dependencies.
Done. Now go and get some images!

Usage

From the directory of the repository run: node crawl-unsplash <parameter> <query> <pages>

Parameter

-j: Download a JSON file containing data and links to images.
-d: Download a JSON file and all images.
-l: Print the data to console without downloading an files.

Parameter is required.

Query

Query has to be a string and is required.

Pages

Define the amount of pages to be fetched. One page is 30 images. You may split the pages arg into 2 seperate args to define a start and end page. The end page will be included. To fetch one specific page eg. page 2 use 2 2.

Pages are optional and if omitted default to 1.

Examples

node crawl-unsplash -d wanderlust fetches the first page and downloads all images and the JSON file.

node crawl-unsplash -l wanderlust 10 fetches pages 1 to 10 and logs JSON to console.

node crawl-unsplash -j wanderlust 10 20 fetches pages 10 to 20 and downloads the JSON file.

Images and JSON lists are in order, although when fetching additional pages at a later point without clearing the downloads directory will most likely result in inconsistent results, since those pages update content so frequently and page 1 today might be page 10 tomorrow.

Concurrent downloads are currently not limited, although it might make sense to implement this in some form.

Contributions

Contributions are welcome. Concisely state the problem and how you solved it. Do not fix multiple things in one PR. If you happen to find something else to fix while fixing any part of the code, please open another PR if you want to address this issue aswell.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
downloads		downloads
.gitignore		.gitignore
README.md		README.md
crawl-unsplash.js		crawl-unsplash.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Crawl-Unsplash-CL

Install

Usage

Parameter

Query

Pages

Examples

Contributions

About

Releases

Packages

Languages

raymanchester/crawl-unsplash-cl

Folders and files

Latest commit

History

Repository files navigation

Crawl-Unsplash-CL

Install

Usage

Parameter

Query

Pages

Examples

Contributions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages