Skip to content

A spider to get all the articles and images from 22/7 members' blogs and convert them into markdown files that can be used in hexo.

License

Notifications You must be signed in to change notification settings

227WiKi/blog_spider

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

22/7 Spider


About

A spider to get all the articles and images from 22/7 members' blogs and convert them into markdown files that can be used in hexo.

Preview of markdown files in 22/7 WiKi blog

All the markdown files can be found in this repo, check them by searching in the folders, most of the file names are renamed by md5, please check the file name at the link suffix on the 22/7 WiKi blog

Updated from 227-blog-generator

Important

Due to the change of the domain, the script is no longer suitable to grab the latest contents. The new version will be a general spider to grab the contents from the official blog.

Updates

  • 天城サリー
  • 河瀬詩
  • 宮瀬玲奈
  • 西條和
  • 白沢かなえ
  • 涼花萌
  • 雨夜音
  • 清井美那
  • 麻丘真央
  • 望月りの
  • 相川奈央
  • 椎名桜月
  • 四条月
  • 月城咲舞

Require

  • Python >= 3.8
  • requests
  • BeautifulSoup

Usage

create the folder named by members' names and a folder called images inside it.

python -m spider_[replace with member's name].py

You may need to change the number of pages the program crawls at a time, which may cause the program to crash.

License

GPL V3.0

About

A spider to get all the articles and images from 22/7 members' blogs and convert them into markdown files that can be used in hexo.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages