Skip to content

A crawler used to retrieve the information of Github users using Github official APIs.

License

Notifications You must be signed in to change notification settings

jiayunz/GitHub_Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GitHub Crawler

A crawler used to retrieve the information of GitHub users using GitHub official APIs.

Copyright (c) 2018 by Jiayun Zhang, Fudan University ([email protected])

Installation

We have tested our crawler on macOS High Sierra 10.13.1. Please make sure that you have installed Python 3.6.7 and Requests (pip install requests).

Usage

  • Change directory to crawler

  • Manually fill in the authorization token in config.py

  • Run crawler using bash command: python main.py TOTAL_USER OUTPUT_PATH

    TOTAL_USER is the total number of users you want to retrieve

    OUTPUT_PATH is the path where you want to store the data

    Example: python main.py 10 data.txt

Data

The crawler will collect the following information:

Each user entry is stored in JSON format.

License

See the LICENSE file for license rights and limitations (MIT).

About

A crawler used to retrieve the information of Github users using Github official APIs.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages