Science4Cast-SanatiesFinest

Scientific publications in field of AI can be viewed as an exponentially growing dynamic graph where vertices represent different concepts and each edge represents the first time two concepts got discussed and connected in a publication. Here, we demonstrate a model to predict future links in this graph based on basic and temporal properties of its vertices. We train various gradient boosting models on this data. Also, we employ dataset augmentation to make these models order-invariant. Finally, we post-process results of individual models to boost our final prediction. Our model won the 3rd place in the Science4Cast competition.

Requirements

Python: 3.6+

To install required packages:

pip install -r requirements.txt

Reproducibility

First, download the dataset from IARAI website at exiting directory. Then run provide python script named boosting_train.py taking the following arguments to regenerate our top-5 submissions:

python boosting_train.py --depth=5 --estimators=400 --features=cn_pr_and_ja --lr=0.05 --minSampleSplit=14 --sampleLeaf=3 --samples=450000 --subsamples=0.5

python boosting_train.py --depth=5 --estimators=400 --features=pr_cn_shp_and_ccpa --lr=0.05 --minSampleSplit=18 --sampleLeaf=5 --samples=400000 --subsamples=0.7

python boosting_train.py --depth=4 --estimators=500 --features=pr_cn_shp_and_ja --lr=0.05 --minSampleSplit=5 --sampleLeaf=5 --samples=450000 --subsamples=0.5

python boosting_train.py --depth=5 --estimators=300 --features=cn_shp_and_ja_ccpa --lr=0.05 --minSampleSplit=18 --sampleLeaf=3 --samples=400000 --subsamples=0.7

python boosting_train.py boosting_train.py --borj2017=8 --depth=5 --estimators=500 --features=cn_shp_and_ja_ccpa --lr=0.05 --negRatio=2 --samples=400000

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
simple_model		simple_model
unsupervised_encoders		unsupervised_encoders
README.md		README.md
Science4Cast_tutarial.ipynb		Science4Cast_tutarial.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Science4Cast-SanatiesFinest

Requirements

Reproducibility

About

Releases

Packages

Languages

MiladInk/Science4Cast-SanatiesFinest

Folders and files

Latest commit

History

Repository files navigation

Science4Cast-SanatiesFinest

Requirements

Reproducibility

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages