most powerful spider system in python!
demo code: gist:9424801
- python2.7
pip install -r requirements.txt
./run.py
, visit http://localhost:5000/
build:
docker build -t pyspider .
run:
# mysql
docker run -it -d --name rabbitmq dockerfile/mysql
# rabbitmq
docker run -it -d --name rabbitmq dockerfile/rabbitmq
# scheduler
docker run -it -d --name scheduler --link mysql:mysql --link rabbitmq:rabbitmq pyspider scheduler
# fetcher, run multiple instance if needed.
docker run -it -d --link mysql:mysql --link rabbitmq:rabbitmq pyspider fetcher
# processor, run multiple instance if needed.
docker run -it -d --link mysql:mysql --link rabbitmq:rabbitmq pyspider processor
# webui
docker run -it -d -P 5000:5000 --link mysql:mysql --link rabbitmq:rabbitmq --link scheduler:scheduler pyspider webui
- 部署使用,提交 bug、特性 Issue
- 参与 特性讨论 或 完善文档
- 我正在进行 Bugfix and Basic Features 的第二个里程碑开发。欢迎发 pull request (代码、注释和提交日志请用英文)
Licensed under the Apache License, Version 2.0