forked from syuchiu/PythonSpider
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
lxb321
committed
Jul 27, 2019
1 parent
a560791
commit d35d4b4
Showing
1 changed file
with
36 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
# PythonSpider | ||
|
||
![](https://img.shields.io/pypi/v/nine.svg?color=red&label=spider&logo=bird&logoColor=green&style=flat-square) | ||
|
||
| Author | Lxb | | ||
| --- | --- | | ||
| Wechat | lxbryz | | ||
| BLOG | https://www.jianshu.com/u/8524376e970d| | ||
| Introduce | 数据解密、反爬处理、验证码 | | ||
|
||
## 一、反爬处理案例 | ||
| Time | Summary | weakness | stauts| Folder| | ||
| --- | --- | --- | --- | | ||
| 2019-03 | scrapy-redis分布式抓取当当网所有商品及评论 | 商品与评论抓取速度差异明显 | 有效|dangdang| | ||
| 2019-04 | Js调试及模拟登录知乎抓取用户及问答 | 无法判断用户抓取是否全量 | 有效 | zhihu| | ||
| 2019-05 | 字体反爬系列涵盖58、汽车之家、起点网 | 自定义字体库过大手动映射麻烦,OCR准确率较低 | 有效 | FontDecode | | ||
| 2019-05 | 大众点评详情页字体及SVG反爬 | 未添加cookie池,重定向解决方案待改进 | 2019/7/10失效 | FontDianPing | | ||
| 2019-07 | 百度/谷歌/有道翻译 | 暂无 | 有效 | Translate | | ||
|
||
## 补充 | ||
有朋友说我把多个项目放在一个仓库,导致他只需某个项目只能全部clone,在这里推荐一个chrome插件GitZip for github,开启插件后在需下载的文件夹后双击即可,如下图: | ||
![](http://pulys00v8.bkt.clouddn.com/gitzip.png) | ||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|