Stars
TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learners in level B, and 250 sentences with a native speaker's si…
Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)
Exploring the Sercomm made router of Cosmote - OTE Group (Deutsche Telekom in Greece)
Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)
A toolkit for generating datasets of midi files which have been degraded to be 'un-musical'.
Pipeline module for parallel real-time data processing for machine learning models development and production purposes.
A curated list of gradient boosting research papers with implementations.
Multi-relational Poincaré Graph Embeddings
A repository of pretty cool datasets that I collected for network science and machine learning research.
TuckER: Tensor Factorization for Knowledge Graph Completion
An open source framework for seq2seq models in PyTorch.
Pweave is a scientific report generator and a literate programming tool for Python. It can capture the results and plots from data analysis and works well with numpy, scipy and matplotlib.