Skip to content

Commit

Permalink
urlnormalizer
Browse files Browse the repository at this point in the history
  • Loading branch information
hujunxianligong committed Jul 25, 2015
1 parent dfa581d commit bf5d99c
Show file tree
Hide file tree
Showing 3 changed files with 1,389 additions and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,3 +9,4 @@ nutcher是中文的nutch文档,包含nutch的配置和源码解析,在github

+ [Nutch教程——导入Nutch工程,执行完整爬取](http://nutcher.org/book/articles/run_nutch_in_ide.html)
+ [Nutch流程控制源码详解(bin/crawl中文注释版)](http://nutcher.org/book/code/bin-crawl.html)
+ [URLNormalizer源码详解(Nutch的URL正规化机制)](http://nutcher.org/book/articles/urlnormalizer.html)
Loading

0 comments on commit bf5d99c

Please sign in to comment.