GitHubCompanyInfluence

This is a project about predicting the influence of companies, using the performance on GitHub: h index.

If the amount of api calls is limited, you can use your own token. (The token creation method is written in common.) If the code runs breaks, you can use a cache.

no2_hist.py plots the company's social network, together with bars for each centrality metric. (Use common contributors as a basis for connecting edges.)

no1_plot_roc.py is a prediction model using 11 features and XGBoost, and draw the roc curve.
no3_svm_f1.py is a prediction model using 11 features and svm (takes a long time to run, please be patient)
no4_mlp_f1.py is a prediction model using 11 features and mlp.
Crawled data for all companies is stored in json format in data_jsons.zip
All the output images are stored in output.

This project crawled 486 companies' information, if you only want to use some of the companies data, you can modify in data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GitHubCompanyInfluence

This is a project about predicting the influence of companies, using the performance on GitHub: h index.

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
__pycache__		__pycache__
data		data
output		output
README.md		README.md
code-for_data_json(1).zip		code-for_data_json(1).zip
common.py		common.py
data_jsons.zip		data_jsons.zip
no1_plot_roc.py		no1_plot_roc.py
no2_hist.py		no2_hist.py
no3_svm_f1.py		no3_svm_f1.py
no4_mlp_f1.py		no4_mlp_f1.py

MingyuAmy/GitHubCompanyInfluence

Folders and files

Latest commit

History

Repository files navigation

GitHubCompanyInfluence

This is a project about predicting the influence of companies, using the performance on GitHub: h index.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages