1.运行环境python3+ubuntu
2.文件夹中为财新网文章爬虫源文件
3.爬虫主体源文件为 caixin (redis) (copy)/caixin/spiders/caixincrawl.py
4.caixin (redis) (copy)/caixin/spiders 目录下的artcontent.json和artinformation.json文件为爬取数据(并不完整)
5.caixin (redis) (copy)/caixin/cookie.py 为登陆财新网,并且获取cookie源文件