JavLibraryCrawler

This project allows you to scrape all movies from the javLibrary. It crawl the following items:

It will also download cover image in local and generate the corresponded thum, you can configurate the image setting in settings.

The tutorial for the image settings can be found here.

##Install

pip install -r requirements.txt

##Run This project contains two type of crawlers:

To start the crawlers, please run : Crawl only best rated movies (500 movies) :

scrapy crawl best_rated_spider

or crawl all movies in the library(> 150000 movies, the somme of all cover images is around 16 GB ).

scrapy crawl actor_spider

##Credit This project uses the scrapy to build the crawlers.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
javLibraryCrawl		javLibraryCrawl
src/images		src/images
.gitignore		.gitignore
README.md		README.md
scrapy.cfg		scrapy.cfg

Provide feedback