Skip to content

πŸ† A ranked list of awesome machine learning Python libraries. Updated weekly.

License

Notifications You must be signed in to change notification settings

matthewchangkit/best-of-ml-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

52 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Best of Machine Learning with Python

πŸ†Β  A ranked list of awesome machine learning python libraries. Updated weekly.

This curated list contains 780 awesome open-source projects with a total of 2.8M stars grouped into 32 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from Github and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!

Contents

Explanation

  • πŸ₯‡πŸ₯ˆπŸ₯‰Β  Combined project-quality score
  • ⭐️  Star count from Github
  • 🐣  New project (less than 6 month old)
  • πŸ’€Β  Inactive project (6 month no activity)
  • πŸ’€Β  Dead project (12 month no activity)
  • πŸ“ˆπŸ“‰Β  Project is trending up or down
  • βž•Β  Project was recently added
  • ❗️  Warning (e.g. missing/risky license)
  • πŸ‘¨β€πŸ’»Β  Contributors count from Github
  • πŸ”€Β  Fork count from Github
  • πŸ“‹Β  Issue count from Github
  • ⏱️  Last update timestamp on package manager
  • πŸ“₯Β  Download count from package manager
  • πŸ“¦Β  Number of dependent projects

Machine Learning Frameworks

Back to top

General-purpose machine learning and deep learning frameworks.

Tensorflow (πŸ₯‡44 Β· ⭐ 150K) - An Open Source Machine Learning Framework for Everyone. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 3.5K Β· πŸ”€ 84K Β· πŸ“¦ 110K Β· πŸ“‹ 30K - 14% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/tensorflow/tensorflow
  • PyPi (πŸ“₯ 7.3M / month Β· πŸ“¦ 23K Β· ⏱️ 14.12.2020):

    pip install tensorflow
  • Conda (πŸ“₯ 2.2M Β· ⏱️ 15.07.2020):

    conda install -c conda-forge tensorflow
scikit-learn (πŸ₯‡41 Β· ⭐ 44K) - scikit-learn: machine learning in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 2.1K Β· πŸ”€ 21K Β· πŸ“₯ 620 Β· πŸ“¦ 180K Β· πŸ“‹ 8.9K - 25% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/scikit-learn/scikit-learn
  • PyPi (πŸ“₯ 10M / month Β· πŸ“¦ 38K Β· ⏱️ 22.12.2020):

    pip install scikit-learn
  • Conda (πŸ“₯ 6.1M Β· ⏱️ 22.12.2020):

    conda install -c conda-forge scikit-learn
PyTorch (πŸ₯‡39 Β· ⭐ 45K) - Tensors and Dynamic neural networks in Python with strong GPU.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 2.4K Β· πŸ”€ 12K Β· πŸ“¦ 56K Β· πŸ“‹ 19K - 36% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/pytorch/pytorch
  • PyPi (πŸ“₯ 1.8M / month Β· πŸ“¦ 6.7K Β· ⏱️ 10.12.2020):

    pip install torch
  • Conda (πŸ“₯ 9.5M Β· ⏱️ 10.12.2020):

    conda install -c pytorch pytorch
PySpark (πŸ₯‡38 Β· ⭐ 28K) - Apache Spark Python API. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 2.4K Β· πŸ”€ 23K Β· πŸ“¦ 530 Β· ⏱️ 26.12.2020):

    git clone https://github.com/apache/spark
  • PyPi (πŸ“₯ 6.8M / month Β· πŸ“¦ 760 Β· ⏱️ 07.09.2020):

    pip install pyspark
  • Conda (πŸ“₯ 850K Β· ⏱️ 07.09.2020):

    conda install -c conda-forge pyspark
StatsModels (πŸ₯‡36 Β· ⭐ 5.8K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 300 Β· πŸ”€ 2.1K Β· πŸ“₯ 25 Β· πŸ“¦ 35K Β· πŸ“‹ 4.3K - 47% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/statsmodels/statsmodels
  • PyPi (πŸ“₯ 2.3M / month Β· πŸ“¦ 6.7K Β· ⏱️ 29.10.2020):

    pip install statsmodels
  • Conda (πŸ“₯ 3M Β· ⏱️ 21.11.2020):

    conda install -c conda-forge statsmodels
Keras (πŸ₯ˆ35 Β· ⭐ 50K) - Deep Learning for humans. MIT
  • GitHub (πŸ‘¨β€πŸ’» 890 Β· πŸ”€ 19K Β· πŸ“‹ 10K - 30% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/keras-team/keras
  • PyPi (πŸ“₯ 2M / month Β· πŸ“¦ 15K Β· ⏱️ 24.06.2020):

    pip install keras
  • Conda (πŸ“₯ 1.4M Β· ⏱️ 25.06.2020):

    conda install -c conda-forge keras
XGBoost (πŸ₯ˆ35 Β· ⭐ 20K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 500 Β· πŸ”€ 7.8K Β· πŸ“₯ 1.8K Β· πŸ“¦ 12K Β· πŸ“‹ 3.9K - 6% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/dmlc/xgboost
  • PyPi (πŸ“₯ 2.8M / month Β· πŸ“¦ 1.6K Β· ⏱️ 09.12.2020):

    pip install xgboost
  • Conda (πŸ“₯ 1.3M Β· ⏱️ 10.12.2020):

    conda install -c conda-forge xgboost
LightGBM (πŸ₯ˆ35 Β· ⭐ 12K) - A fast, distributed, high performance gradient boosting (GBT, GBDT,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 3.1K Β· πŸ“₯ 82K Β· πŸ“¦ 5.1K Β· πŸ“‹ 2K - 4% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/microsoft/LightGBM
  • PyPi (πŸ“₯ 1.5M / month Β· πŸ“¦ 560 Β· ⏱️ 08.12.2020):

    pip install lightgbm
  • Conda (πŸ“₯ 450K Β· ⏱️ 08.12.2020):

    conda install -c conda-forge lightgbm
Theano (πŸ₯ˆ35 Β· ⭐ 9.3K) - Theano is a Python library that allows you to define, optimize, and.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 380 Β· πŸ”€ 2.5K Β· πŸ“¦ 9.9K Β· πŸ“‹ 2.8K - 24% open Β· ⏱️ 05.09.2020):

    git clone https://github.com/Theano/Theano
  • PyPi (πŸ“₯ 230K / month Β· πŸ“¦ 5.5K Β· ⏱️ 27.07.2020):

    pip install theano
  • Conda (πŸ“₯ 1.3M Β· ⏱️ 01.11.2020):

    conda install -c conda-forge theano
MXNet (πŸ₯ˆ34 Β· ⭐ 19K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 950 Β· πŸ”€ 6.8K Β· πŸ“₯ 23K Β· πŸ“¦ 1.7K Β· πŸ“‹ 9.3K - 19% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/apache/incubator-mxnet
  • PyPi (πŸ“₯ 130K / month Β· πŸ“¦ 440 Β· ⏱️ 28.08.2020):

    pip install mxnet
  • Conda (πŸ“₯ 5.6K Β· ⏱️ 29.02.2020):

    conda install -c anaconda mxnet
pytorch-lightning (πŸ₯ˆ33 Β· ⭐ 11K) - The lightweight PyTorch wrapper for high-performance AI.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 350 Β· πŸ”€ 1.3K Β· πŸ“₯ 14 Β· πŸ“¦ 1.5K Β· πŸ“‹ 2.7K - 13% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/PyTorchLightning/pytorch-lightning
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 14 Β· ⏱️ 23.12.2020):

    pip install pytorch-lightning
  • Conda (πŸ“₯ 12K Β· ⏱️ 23.12.2020):

    conda install -c conda-forge pytorch-lightning
Fastai (πŸ₯ˆ32 Β· ⭐ 20K) - The fastai deep learning library, plus lessons and tutorials. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 470 Β· πŸ”€ 6.9K Β· πŸ“‹ 1.3K - 4% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/fastai/fastai
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 370 Β· ⏱️ 22.12.2020):

    pip install fastai
jax (πŸ₯ˆ32 Β· ⭐ 11K) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 240 Β· πŸ”€ 890 Β· πŸ“¦ 910 Β· πŸ“‹ 2K - 34% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/google/jax
  • PyPi (πŸ“₯ 79K / month Β· πŸ“¦ 46 Β· ⏱️ 05.12.2020):

    pip install jax
  • Conda (πŸ“₯ 64K Β· ⏱️ 14.10.2020):

    conda install -c conda-forge jaxlib
Chainer (πŸ₯ˆ32 Β· ⭐ 5.5K) - A flexible framework of neural networks for deep learning. MIT
  • GitHub (πŸ‘¨β€πŸ’» 320 Β· πŸ”€ 1.4K Β· πŸ“¦ 1.8K Β· πŸ“‹ 2K - 0% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/chainer/chainer
  • PyPi (πŸ“₯ 21K / month Β· πŸ“¦ 740 Β· ⏱️ 30.07.2020):

    pip install chainer
Thinc (πŸ₯ˆ32 Β· ⭐ 2.1K) - A refreshing functional take on deep learning, compatible with your.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 180 Β· πŸ“¦ 10K Β· πŸ“‹ 94 - 14% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/explosion/thinc
  • PyPi (πŸ“₯ 890K / month Β· πŸ“¦ 1.1K Β· ⏱️ 16.12.2020):

    pip install thinc
  • Conda (πŸ“₯ 830K Β· ⏱️ 18.12.2020):

    conda install -c conda-forge thinc
Catboost (πŸ₯ˆ31 Β· ⭐ 5.6K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 710 Β· πŸ”€ 850 Β· πŸ“₯ 46K Β· πŸ“‹ 1.3K - 22% open Β· ⏱️ 27.12.2020):

    git clone https://github.com/catboost/catboost
  • PyPi (πŸ“₯ 660K / month Β· πŸ“¦ 81 Β· ⏱️ 26.08.2020):

    pip install catboost
  • Conda (πŸ“₯ 540K Β· ⏱️ 18.11.2020):

    conda install -c conda-forge catboost
PaddlePaddle (πŸ₯ˆ30 Β· ⭐ 14K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 490 Β· πŸ”€ 3.4K Β· πŸ“₯ 15K Β· πŸ“¦ 22 Β· πŸ“‹ 12K - 18% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/PaddlePaddle/Paddle
  • PyPi (πŸ“₯ 7.9K / month Β· πŸ“¦ 16 Β· ⏱️ 14.12.2020):

    pip install paddlepaddle
TFlearn (πŸ₯ˆ30 Β· ⭐ 9.5K) - Deep learning library featuring a higher-level API for TensorFlow. MIT
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 2.4K Β· πŸ“¦ 2.9K Β· πŸ“‹ 930 - 61% open Β· ⏱️ 30.11.2020):

    git clone https://github.com/tflearn/tflearn
  • PyPi (πŸ“₯ 15K / month Β· πŸ“¦ 480 Β· ⏱️ 11.11.2020):

    pip install tflearn
Vowpal Wabbit (πŸ₯ˆ30 Β· ⭐ 7.4K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 300 Β· πŸ”€ 1.7K Β· πŸ“¦ 87 Β· πŸ“‹ 1K - 15% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/VowpalWabbit/vowpal_wabbit
  • PyPi (πŸ“₯ 16K / month Β· πŸ“¦ 8 Β· ⏱️ 12.11.2020):

    pip install vowpalwabbit
Turi Create (πŸ₯‰29 Β· ⭐ 10K) - Turi Create simplifies the development of custom machine learning.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 81 Β· πŸ”€ 1.1K Β· πŸ“₯ 3.4K Β· πŸ“¦ 220 Β· πŸ“‹ 1.7K - 27% open Β· ⏱️ 13.10.2020):

    git clone https://github.com/apple/turicreate
  • PyPi (πŸ“₯ 13K / month Β· πŸ“¦ 34 Β· ⏱️ 30.09.2020):

    pip install turicreate
Sonnet (πŸ₯‰28 Β· ⭐ 8.7K) - TensorFlow-based neural network library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 48 Β· πŸ”€ 1.2K Β· πŸ“¦ 460 Β· πŸ“‹ 150 - 10% open Β· ⏱️ 08.10.2020):

    git clone https://github.com/deepmind/sonnet
  • PyPi (πŸ“₯ 36K / month Β· πŸ“¦ 82 Β· ⏱️ 27.03.2020):

    pip install dm-sonnet
  • Conda (πŸ“₯ 6.7K Β· ⏱️ 14.11.2020):

    conda install -c conda-forge sonnet
tensorpack (πŸ₯‰28 Β· ⭐ 5.9K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 55 Β· πŸ”€ 1.7K Β· πŸ“₯ 95 Β· πŸ“¦ 640 Β· πŸ“‹ 1.3K - 0% open Β· ⏱️ 01.11.2020):

    git clone https://github.com/tensorpack/tensorpack
  • PyPi (πŸ“₯ 50K / month Β· πŸ“¦ 76 Β· ⏱️ 24.04.2020):

    pip install tensorpack
dyNET (πŸ₯‰28 Β· ⭐ 3.2K) - DyNet: The Dynamic Neural Network Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 690 Β· πŸ“₯ 1.2K Β· πŸ“¦ 160 Β· πŸ“‹ 910 - 27% open Β· ⏱️ 22.10.2020):

    git clone https://github.com/clab/dynet
  • PyPi (πŸ“₯ 5.7K / month Β· πŸ“¦ 38 Β· ⏱️ 21.10.2020):

    pip install dyNET
Flax (πŸ₯‰28 Β· ⭐ 1.3K) - Flax is a neural network ecosystem for JAX that is designed for.. Apache-2 jax
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 150 Β· πŸ“₯ 18 Β· πŸ“¦ 73 Β· πŸ“‹ 220 - 52% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/google/flax
  • PyPi (πŸ“₯ 61K / month Β· πŸ“¦ 3 Β· ⏱️ 08.12.2020):

    pip install flax
CNTK (πŸ₯‰27 Β· ⭐ 17K Β· πŸ’€) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit. MIT
  • GitHub (πŸ‘¨β€πŸ’» 260 Β· πŸ”€ 4.4K Β· πŸ“₯ 14K Β· πŸ“‹ 3.3K - 24% open Β· ⏱️ 31.03.2020):

    git clone https://github.com/microsoft/CNTK
  • PyPi (πŸ“₯ 1.5K / month Β· πŸ“¦ 17 Β· ⏱️ 09.12.2020):

    pip install cntk
skorch (πŸ₯‰27 Β· ⭐ 3.7K) - A scikit-learn compatible neural network library that wraps.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 270 Β· πŸ“¦ 240 Β· πŸ“‹ 360 - 13% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/skorch-dev/skorch
  • PyPi (πŸ“₯ 7.9K / month Β· πŸ“¦ 13 Β· ⏱️ 30.08.2020):

    pip install skorch
  • Conda (πŸ“₯ 160K Β· ⏱️ 19.09.2020):

    conda install -c conda-forge skorch
Ignite (πŸ₯‰27 Β· ⭐ 3.2K) - High-level library to help with training and evaluating neural.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 400 Β· πŸ“¦ 670 Β· πŸ“‹ 680 - 14% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/pytorch/ignite
  • PyPi (πŸ“₯ 28K / month Β· πŸ“¦ 62 Β· ⏱️ 26.12.2020):

    pip install pytorch-ignite
  • Conda (πŸ“₯ 50K Β· ⏱️ 20.09.2020):

    conda install -c pytorch ignite
Ludwig (πŸ₯‰26 Β· ⭐ 7.4K) - Ludwig is a toolbox that allows to train and evaluate deep.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 84 Β· πŸ”€ 870 Β· πŸ“¦ 72 Β· πŸ“‹ 500 - 19% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/ludwig-ai/ludwig
  • PyPi (πŸ“₯ 1.2K / month Β· πŸ“¦ 6 Β· ⏱️ 05.10.2020):

    pip install ludwig
mlpack (πŸ₯‰26 Β· ⭐ 3.5K) - mlpack: a scalable C++ machine learning library --. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 260 Β· πŸ”€ 1.3K Β· πŸ“‹ 1.3K - 11% open Β· ⏱️ 27.12.2020):

    git clone https://github.com/mlpack/mlpack
  • PyPi (πŸ“₯ 360 / month Β· ⏱️ 28.10.2020):

    pip install mlpack
  • Conda (πŸ“₯ 61K Β· ⏱️ 29.10.2020):

    conda install -c conda-forge mlpack
Neural Network Libraries (πŸ₯‰25 Β· ⭐ 2.4K) - Neural Network Libraries. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 55 Β· πŸ”€ 300 Β· πŸ“₯ 510 Β· πŸ“‹ 53 - 66% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/sony/nnabla
  • PyPi (πŸ“₯ 3.9K / month Β· πŸ“¦ 34 Β· ⏱️ 24.12.2020):

    pip install nnabla
xLearn (πŸ₯‰24 Β· ⭐ 2.8K Β· πŸ’€) - High performance, easy-to-use, and scalable machine learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 480 Β· πŸ“₯ 2.2K Β· πŸ“¦ 38 Β· πŸ“‹ 280 - 62% open Β· ⏱️ 03.03.2020):

    git clone https://github.com/aksnzhy/xlearn
  • PyPi (πŸ“₯ 9.1K / month Β· πŸ“¦ 1 Β· ⏱️ 04.12.2018):

    pip install xlearn
ktrain (πŸ₯‰24 Β· ⭐ 700) - ktrain is a Python library that makes deep learning and AI more.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 170 Β· πŸ“¦ 110 Β· πŸ“‹ 280 - 2% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/amaiya/ktrain
  • PyPi (πŸ“₯ 7.3K / month Β· πŸ“¦ 1 Β· ⏱️ 05.12.2020):

    pip install ktrain
SHOGUN (πŸ₯‰23 Β· ⭐ 2.8K) - Unified and efficient Machine Learning. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 250 Β· πŸ”€ 1K Β· πŸ“‹ 1.6K - 33% open Β· ⏱️ 08.12.2020):

    git clone https://github.com/shogun-toolbox/shogun
  • Conda (πŸ“₯ 89K Β· ⏱️ 25.06.2018):

    conda install -c conda-forge shogun
  • Dockerhub (πŸ“₯ 1.4K Β· ⭐ 1 Β· ⏱️ 31.01.2019):

    docker pull shogun/shogun
einops (πŸ₯‰23 Β· ⭐ 2K) - Deep learning operations reinvented (for pytorch, tensorflow, chainer,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 64 Β· πŸ“¦ 150 Β· πŸ“‹ 59 - 37% open Β· ⏱️ 14.11.2020):

    git clone https://github.com/arogozhnikov/einops
  • PyPi (πŸ“₯ 9.9K / month Β· πŸ“¦ 10 Β· ⏱️ 08.09.2020):

    pip install einops
  • Conda (πŸ“₯ 3.4K Β· ⏱️ 15.10.2020):

    conda install -c conda-forge einops
mace (πŸ₯‰21 Β· ⭐ 4.2K) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 54 Β· πŸ”€ 740 Β· πŸ“₯ 1.3K Β· πŸ“‹ 620 - 5% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/XiaoMi/mace
Neural Tangents (πŸ₯‰21 Β· ⭐ 1.2K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 140 Β· πŸ“₯ 65 Β· πŸ“¦ 12 Β· πŸ“‹ 75 - 37% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/google/neural-tangents
  • PyPi (πŸ“₯ 680 / month Β· ⏱️ 18.11.2020):

    pip install neural-tangents
ThunderSVM (πŸ₯‰20 Β· ⭐ 1.2K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 160 Β· πŸ“₯ 2.1K Β· πŸ“‹ 190 - 22% open Β· ⏱️ 14.12.2020):

    git clone https://github.com/Xtra-Computing/thundersvm
  • PyPi (πŸ“₯ 350 / month Β· ⏱️ 13.03.2020):

    pip install thundersvm
Haiku (πŸ₯‰20 Β· ⭐ 870) - JAX-based neural network library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 32 Β· πŸ”€ 58 Β· πŸ“¦ 53 Β· πŸ“‹ 65 - 27% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/deepmind/dm-haiku
Torchbearer (πŸ₯‰18 Β· ⭐ 580 Β· πŸ’€) - torchbearer: A model fitting library for PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 62 Β· πŸ“¦ 37 Β· πŸ“‹ 240 - 3% open Β· ⏱️ 17.02.2020):

    git clone https://github.com/pytorchbearer/torchbearer
  • PyPi (πŸ“₯ 400 / month Β· πŸ“¦ 3 Β· ⏱️ 31.01.2020):

    pip install torchbearer
ThunderGBM (πŸ₯‰17 Β· ⭐ 570) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 71 Β· πŸ“‹ 43 - 39% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/Xtra-Computing/thundergbm
  • PyPi (πŸ“₯ 150 / month Β· ⏱️ 01.05.2020):

    pip install thundergbm
StarSpace (πŸ₯‰13 Β· ⭐ 3.5K Β· πŸ’€) - Learning embeddings for classification, retrieval and ranking. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 480 Β· πŸ“‹ 190 - 23% open Β· ⏱️ 13.12.2019):

    git clone https://github.com/facebookresearch/StarSpace
Show 6 hidden projects...
  • dlib (πŸ₯ˆ33 Β· ⭐ 9.7K Β· βž•) - A toolkit for making real world machine learning and data.. ❗️BSL-1.0
  • NuPIC (πŸ₯‰25 Β· ⭐ 6.2K Β· πŸ’€) - Numenta Platform for Intelligent Computing is an.. ❗️AGPL-3.0
  • MindsDB (πŸ₯‰20 Β· ⭐ 3.1K) - Predictive AI layer for existing databases. ❗️GPL-3.0
  • NeuPy (πŸ₯‰20 Β· ⭐ 660 Β· πŸ’€) - NeuPy is a Tensorflow based python library for prototyping and.. MIT
  • elegy (πŸ₯‰16 Β· ⭐ 140 Β· 🐣) - Elegy is a Neural Networks framework based on Jax and.. Apache-2 jax
  • NeoML (πŸ₯‰11 Β· ⭐ 550 Β· 🐣) - Machine learning framework for both deep learning and.. ❗️Unlicensed

Data Visualization

Back to top

General-purpose and task-specific data visualization libraries.

Matplotlib (πŸ₯‡40 Β· ⭐ 13K) - matplotlib: plotting with Python. ❗️Python-2.0
  • GitHub (πŸ‘¨β€πŸ’» 1.2K Β· πŸ”€ 5.5K Β· πŸ“¦ 300K Β· πŸ“‹ 7.5K - 21% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/matplotlib/matplotlib
  • PyPi (πŸ“₯ 9.2M / month Β· πŸ“¦ 79K Β· ⏱️ 12.11.2020):

    pip install matplotlib
  • Conda (πŸ“₯ 7.6M Β· ⏱️ 18.11.2020):

    conda install -c conda-forge matplotlib
Plotly (πŸ₯‡35 Β· ⭐ 8.5K) - The interactive graphing library for Python (includes Plotly Express). MIT
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 1.7K Β· πŸ“¦ 5 Β· πŸ“‹ 1.8K - 42% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/plotly/plotly.py
  • PyPi (πŸ“₯ 2.8M / month Β· πŸ“¦ 5K Β· ⏱️ 09.12.2020):

    pip install plotly
  • NPM (πŸ“₯ 36K / month Β· πŸ“¦ 4 Β· ⏱️ 09.12.2020):

    npm install plotlywidget
  • Conda (πŸ“₯ 1.1M Β· ⏱️ 09.12.2020):

    conda install -c conda-forge plotly
Seaborn (πŸ₯‡35 Β· ⭐ 7.9K) - Statistical data visualization using matplotlib. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 1.4K Β· πŸ“₯ 110 Β· πŸ“¦ 75K Β· πŸ“‹ 1.7K - 4% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/mwaskom/seaborn
  • PyPi (πŸ“₯ 2.1M / month Β· πŸ“¦ 13K Β· ⏱️ 20.12.2020):

    pip install seaborn
  • Conda (πŸ“₯ 1.8M Β· ⏱️ 21.12.2020):

    conda install -c conda-forge seaborn
dash (πŸ₯‡34 Β· ⭐ 14K Β· βž•) - Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 1.4K Β· πŸ“¦ 16K Β· πŸ“‹ 940 - 41% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/plotly/dash
  • PyPi (πŸ“₯ 240K / month Β· πŸ“¦ 1.6K Β· ⏱️ 09.12.2020):

    pip install dash
  • Conda (πŸ“₯ 200K Β· ⏱️ 11.12.2020):

    conda install -c conda-forge dash
Bokeh (πŸ₯ˆ33 Β· ⭐ 14K) - Interactive Data Visualization in the browser, from Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 540 Β· πŸ”€ 3.6K Β· πŸ“¦ 29K Β· πŸ“‹ 6.3K - 9% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/bokeh/bokeh
  • PyPi (πŸ“₯ 1M / month Β· πŸ“¦ 5.9K Β· ⏱️ 08.12.2020):

    pip install bokeh
  • Conda (πŸ“₯ 3.5M Β· ⏱️ 23.11.2020):

    conda install -c conda-forge bokeh
pyecharts (πŸ₯ˆ31 Β· ⭐ 10K) - Python Echarts Plotting Library. MIT
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 2.3K Β· πŸ“¦ 1.4K Β· πŸ“‹ 1.3K - 2% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/pyecharts/pyecharts
  • PyPi (πŸ“₯ 32K / month Β· πŸ“¦ 280 Β· ⏱️ 29.10.2020):

    pip install pyecharts
wordcloud (πŸ₯ˆ31 Β· ⭐ 7.7K) - A little word cloud generator in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 58 Β· πŸ”€ 2K Β· πŸ“¦ 8.1K Β· πŸ“‹ 430 - 20% open Β· ⏱️ 11.11.2020):

    git clone https://github.com/amueller/word_cloud
  • PyPi (πŸ“₯ 250K / month Β· πŸ“¦ 1.1K Β· ⏱️ 11.11.2020):

    pip install wordcloud
  • Conda (πŸ“₯ 180K Β· ⏱️ 16.11.2020):

    conda install -c conda-forge wordcloud
bqplot (πŸ₯ˆ30 Β· ⭐ 3K) - Plotting library for IPython/Jupyter notebooks. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 51 Β· πŸ”€ 400 Β· πŸ“¦ 1.1K Β· πŸ“‹ 500 - 37% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/bqplot/bqplot
  • PyPi (πŸ“₯ 15K / month Β· πŸ“¦ 110 Β· ⏱️ 05.11.2020):

    pip install bqplot
  • NPM (πŸ“₯ 140K / month Β· πŸ“¦ 10 Β· ⏱️ 12.10.2020):

    npm install bqplot
  • Conda (πŸ“₯ 440K Β· ⏱️ 12.10.2020):

    conda install -c conda-forge bqplot
pandas-profiling (πŸ₯ˆ29 Β· ⭐ 6.5K) - Create HTML profiling reports from pandas DataFrame objects. MIT
  • GitHub (πŸ‘¨β€πŸ’» 61 Β· πŸ”€ 970 Β· πŸ“¦ 2.7K Β· πŸ“‹ 400 - 12% open Β· ⏱️ 30.11.2020):

    git clone https://github.com/pandas-profiling/pandas-profiling
  • PyPi (πŸ“₯ 240K / month Β· πŸ“¦ 160 Β· ⏱️ 03.09.2020):

    pip install pandas-profiling
  • Conda (πŸ“₯ 94K Β· ⏱️ 03.09.2020):

    conda install -c conda-forge pandas-profiling
UMAP (πŸ₯ˆ29 Β· ⭐ 4.4K) - Uniform Manifold Approximation and Projection. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 67 Β· πŸ”€ 470 Β· πŸ“‹ 410 - 49% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/lmcinnes/umap
  • PyPi (πŸ“₯ 120K / month Β· πŸ“¦ 230 Β· ⏱️ 26.11.2020):

    pip install umap-learn
PyQtGraph (πŸ₯ˆ29 Β· ⭐ 2.2K) - Fast data visualization and GUI tools for scientific / engineering.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 790 Β· πŸ“‹ 710 - 39% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/pyqtgraph/pyqtgraph
  • PyPi (πŸ“₯ 34K / month Β· πŸ“¦ 890 Β· ⏱️ 20.12.2020):

    pip install pyqtgraph
  • Conda (πŸ“₯ 160K Β· ⏱️ 20.12.2020):

    conda install -c conda-forge pyqtgraph
HoloViews (πŸ₯ˆ29 Β· ⭐ 1.8K) - With Holoviews, your data visualizes itself. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 99 Β· πŸ”€ 290 Β· πŸ“‹ 2.5K - 26% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/holoviz/holoviews
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 170 Β· ⏱️ 03.12.2020):

    pip install holoviews
  • NPM (πŸ“₯ 6.8K / month Β· ⏱️ 24.05.2020):

    npm install @pyviz/jupyterlab_pyviz
  • Conda (πŸ“₯ 390K Β· ⏱️ 03.12.2020):

    conda install -c conda-forge holoviews
Graphviz (πŸ₯ˆ29 Β· ⭐ 880 Β· βž•) - Simple Python interface for Graphviz. MIT
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 140 Β· πŸ“¦ 16K Β· πŸ“‹ 99 - 5% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/xflr6/graphviz
  • PyPi (πŸ“₯ 2.3M / month Β· πŸ“¦ 4.4K Β· ⏱️ 24.12.2020):

    pip install graphviz
VisPy (πŸ₯ˆ28 Β· ⭐ 2.5K) - High-performance interactive 2D/3D data visualization library. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 540 Β· πŸ“¦ 440 Β· πŸ“‹ 1.1K - 31% open Β· ⏱️ 28.11.2020):

    git clone https://github.com/vispy/vispy
  • PyPi (πŸ“₯ 14K / month Β· πŸ“¦ 120 Β· ⏱️ 28.11.2020):

    pip install vispy
  • NPM (πŸ“₯ 67 / month Β· ⏱️ 15.03.2020):

    npm install vispy
  • Conda (πŸ“₯ 120K Β· ⏱️ 28.11.2020):

    conda install -c conda-forge vispy
datashader (πŸ₯ˆ28 Β· ⭐ 2.4K) - Quickly and accurately render even the largest data. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 43 Β· πŸ”€ 310 Β· πŸ“¦ 540 Β· πŸ“‹ 460 - 31% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/holoviz/datashader
  • PyPi (πŸ“₯ 15K / month Β· πŸ“¦ 70 Β· ⏱️ 23.12.2020):

    pip install datashader
  • Conda (πŸ“₯ 130K Β· ⏱️ 17.08.2020):

    conda install -c conda-forge datashader
missingno (πŸ₯‰27 Β· ⭐ 2.6K) - Missing data visualization module for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 330 Β· πŸ“¦ 2.5K Β· πŸ“‹ 100 - 14% open Β· ⏱️ 23.11.2020):

    git clone https://github.com/ResidentMario/missingno
  • PyPi (πŸ“₯ 200K / month Β· πŸ“¦ 76 Β· ⏱️ 29.06.2018):

    pip install missingno
  • Conda (πŸ“₯ 65K Β· ⏱️ 15.02.2020):

    conda install -c conda-forge missingno
Cufflinks (πŸ₯‰27 Β· ⭐ 2K) - Productivity Tools for Plotly + Pandas. MIT
  • GitHub (πŸ‘¨β€πŸ’» 37 Β· πŸ”€ 500 Β· πŸ“¦ 2.4K Β· πŸ“‹ 200 - 39% open Β· ⏱️ 07.10.2020):

    git clone https://github.com/santosjorge/cufflinks
  • PyPi (πŸ“₯ 51K / month Β· πŸ“¦ 230 Β· ⏱️ 01.03.2020):

    pip install cufflinks
Perspective (πŸ₯‰26 Β· ⭐ 3.1K) - Streaming pivot visualization via WebAssembly. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 59 Β· πŸ”€ 330 Β· πŸ“¦ 160 Β· πŸ“‹ 350 - 18% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/finos/perspective
  • PyPi (πŸ“₯ 560 / month Β· πŸ“¦ 4 Β· ⏱️ 15.10.2020):

    pip install perspective-python
  • NPM (πŸ“₯ 1K / month Β· ⏱️ 15.10.2020):

    npm install @finos/perspective-jupyterlab
PyVista (πŸ₯‰26 Β· ⭐ 620) - 3D plotting and mesh analysis through a streamlined interface for.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 47 Β· πŸ”€ 130 Β· πŸ“₯ 30 Β· πŸ“¦ 220 Β· πŸ“‹ 380 - 30% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/pyvista/pyvista
  • PyPi (πŸ“₯ 8.9K / month Β· πŸ“¦ 26 Β· ⏱️ 10.12.2020):

    pip install pyvista
  • Conda (πŸ“₯ 51K Β· ⏱️ 10.12.2020):

    conda install -c conda-forge pyvista
hvPlot (πŸ₯‰26 Β· ⭐ 330) - A high-level plotting API for pandas, dask, xarray, and networkx built.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 49 Β· πŸ“¦ 410 Β· πŸ“‹ 310 - 31% open Β· ⏱️ 18.11.2020):

    git clone https://github.com/holoviz/hvplot
  • PyPi (πŸ“₯ 68K / month Β· πŸ“¦ 15 Β· ⏱️ 02.06.2020):

    pip install hvplot
  • Conda (πŸ“₯ 54K Β· ⏱️ 02.06.2020):

    conda install -c conda-forge hvplot
Chartify (πŸ₯‰25 Β· ⭐ 2.8K) - Python library that makes it easy for data scientists to create.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 240 Β· πŸ“¦ 52 Β· πŸ“‹ 70 - 57% open Β· ⏱️ 02.11.2020):

    git clone https://github.com/spotify/chartify
  • PyPi (πŸ“₯ 8.5K / month Β· πŸ“¦ 5 Β· ⏱️ 02.11.2020):

    pip install chartify
  • Conda (πŸ“₯ 12K Β· ⏱️ 07.11.2020):

    conda install -c conda-forge chartify
Facets Overview (πŸ₯‰24 Β· ⭐ 6.4K) - Visualizations for machine learning datasets. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 790 Β· πŸ“¦ 35 Β· πŸ“‹ 140 - 48% open Β· ⏱️ 27.10.2020):

    git clone https://github.com/pair-code/facets
  • PyPi (πŸ“₯ 22K / month Β· ⏱️ 24.07.2019):

    pip install facets-overview
HyperTools (πŸ₯‰24 Β· ⭐ 1.6K) - A Python toolbox for gaining geometric insights into high-.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 150 Β· πŸ“¦ 90 Β· πŸ“‹ 190 - 36% open Β· ⏱️ 02.10.2020):

    git clone https://github.com/ContextLab/hypertools
  • PyPi (πŸ“₯ 520 / month Β· πŸ“¦ 9 Β· ⏱️ 02.10.2020):

    pip install hypertools
pythreejs (πŸ₯‰24 Β· ⭐ 690) - A Jupyter - Three.js bridge. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 24 Β· πŸ”€ 160 Β· πŸ“¦ 15 Β· πŸ“‹ 190 - 29% open Β· ⏱️ 09.10.2020):

    git clone https://github.com/jupyter-widgets/pythreejs
  • PyPi (πŸ“₯ 6.7K / month Β· πŸ“¦ 13 Β· ⏱️ 09.10.2020):

    pip install pythreejs
  • NPM (πŸ“₯ 4.5K / month Β· πŸ“¦ 8 Β· ⏱️ 19.03.2020):

    npm install jupyter-threejs
  • Conda (πŸ“₯ 250K Β· ⏱️ 12.10.2020):

    conda install -c conda-forge pythreejs
openTSNE (πŸ₯‰23 Β· ⭐ 730) - Extensible, parallel implementations of t-SNE. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 81 Β· πŸ“¦ 160 Β· πŸ“‹ 71 - 4% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/pavlin-policar/openTSNE
  • PyPi (πŸ“₯ 13K / month Β· πŸ“¦ 4 Β· ⏱️ 21.12.2020):

    pip install opentsne
  • Conda (πŸ“₯ 71K Β· ⏱️ 21.12.2020):

    conda install -c conda-forge opentsne
PandasGUI (πŸ₯‰22 Β· ⭐ 1.9K) - A GUI for Pandas DataFrames. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 110 Β· πŸ“¦ 24 Β· πŸ“‹ 60 - 16% open Β· ⏱️ 05.12.2020):

    git clone https://github.com/adamerose/pandasgui
  • PyPi (πŸ“₯ 2.1K / month Β· πŸ“¦ 1 Β· ⏱️ 10.11.2020):

    pip install pandasgui
Pandas-Bokeh (πŸ₯‰22 Β· ⭐ 590) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 74 Β· πŸ“¦ 120 Β· πŸ“‹ 71 - 28% open Β· ⏱️ 27.11.2020):

    git clone https://github.com/PatrikHlobil/Pandas-Bokeh
  • PyPi (πŸ“₯ 7.4K / month Β· πŸ“¦ 10 Β· ⏱️ 27.11.2020):

    pip install pandas-bokeh
vega (πŸ₯‰22 Β· ⭐ 290) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 46 Β· πŸ“‹ 87 - 9% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/vega/ipyvega
  • PyPi (πŸ“₯ 6.9K / month Β· πŸ“¦ 150 Β· ⏱️ 15.05.2020):

    pip install vega
  • Conda (πŸ“₯ 370K Β· ⏱️ 10.12.2020):

    conda install -c conda-forge vega
PyWaffle (πŸ₯‰20 Β· ⭐ 370) - Make Waffle Charts in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 59 Β· πŸ“¦ 38 Β· πŸ“‹ 12 - 16% open Β· ⏱️ 26.07.2020):

    git clone https://github.com/gyli/PyWaffle
  • PyPi (πŸ“₯ 1.1K / month Β· ⏱️ 26.07.2020):

    pip install pywaffle
AutoViz (πŸ₯‰20 Β· ⭐ 270) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 54 Β· πŸ“¦ 24 Β· πŸ“‹ 19 - 15% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/AutoViML/AutoViz
  • PyPi (πŸ“₯ 1.2K / month Β· ⏱️ 10.12.2020):

    pip install autoviz
HiPlot (πŸ₯‰19 Β· ⭐ 1.6K) - HiPlot makes understanding high dimensional data easy. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 86 Β· πŸ“¦ 2 Β· πŸ“‹ 45 - 8% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/facebookresearch/hiplot
  • PyPi (πŸ“₯ 3.1K / month Β· ⏱️ 23.12.2020):

    pip install hiplot
  • Conda (πŸ“₯ 40K Β· ⏱️ 23.12.2020):

    conda install -c conda-forge hiplot
Sweetviz (πŸ₯‰19 Β· ⭐ 1.2K) - Visualize and compare datasets, target values and associations, with.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 130 Β· πŸ“‹ 64 - 32% open Β· ⏱️ 10.12.2020):

    git clone https://github.com/fbdesignpro/sweetviz
  • PyPi (πŸ“₯ 7.9K / month Β· ⏱️ 10.12.2020):

    pip install sweetviz
nx-altair (πŸ₯‰14 Β· ⭐ 150) - Draw interactive NetworkX graphs with Altair. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 19 Β· πŸ“‹ 8 - 50% open Β· ⏱️ 02.06.2020):

    git clone https://github.com/Zsailer/nx_altair
  • PyPi (πŸ“₯ 190 / month Β· πŸ“¦ 2 Β· ⏱️ 02.06.2020):

    pip install nx-altair
Show 5 hidden projects...
  • Altair (πŸ₯‰25 Β· ⭐ 240 Β· πŸ’€) - Declarative statistical visualization library for Python. BSD-3
  • PDPbox (πŸ₯‰22 Β· ⭐ 520 Β· πŸ’€) - python partial dependence plot toolbox. MIT
  • pivottablejs (πŸ₯‰20 Β· ⭐ 410 Β· πŸ’€) - Dragndrop Pivot Tables and Charts for Jupyter/IPython.. MIT
  • pdvega (πŸ₯‰16 Β· ⭐ 340 Β· πŸ’€) - Interactive plotting for Pandas using Vega-Lite. MIT
  • nptsne (πŸ₯‰15 Β· ⭐ 24) - nptsne is a numpy compatible python binary package that offers a.. Apache-2

Text Data & NLP

Back to top

Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.

spaCy (πŸ₯‡37 Β· ⭐ 18K) - Industrial-strength Natural Language Processing (NLP) with Python and.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 540 Β· πŸ”€ 3.2K Β· πŸ“₯ 2.9K Β· πŸ“¦ 20K Β· πŸ“‹ 4.2K - 2% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/explosion/spaCy
  • PyPi (πŸ“₯ 910K / month Β· πŸ“¦ 3.1K Β· ⏱️ 11.12.2020):

    pip install spacy
  • Conda (πŸ“₯ 1.4M Β· ⏱️ 18.12.2020):

    conda install -c conda-forge spacy
transformers (πŸ₯‡36 Β· ⭐ 39K) - Transformers: State-of-the-art Natural Language Processing.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 740 Β· πŸ”€ 9.4K Β· πŸ“₯ 1.2K Β· πŸ“¦ 6.4K Β· πŸ“‹ 5.5K - 10% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/huggingface/transformers
  • PyPi (πŸ“₯ 720K / month Β· πŸ“¦ 130 Β· ⏱️ 17.12.2020):

    pip install transformers
  • Conda (πŸ“₯ 13K Β· ⏱️ 19.12.2020):

    conda install -c conda-forge transformers
nltk (πŸ₯‡34 Β· ⭐ 9.5K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 380 Β· πŸ”€ 2.4K Β· πŸ“¦ 84K Β· πŸ“‹ 1.5K - 16% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/nltk/nltk
  • PyPi (πŸ“₯ 4.2M / month Β· πŸ“¦ 21K Β· ⏱️ 12.04.2020):

    pip install nltk
  • Conda (πŸ“₯ 610K Β· ⏱️ 08.08.2019):

    conda install -c conda-forge nltk
Rasa (πŸ₯‡32 Β· ⭐ 10K) - Open source machine learning framework to automate text- and voice-.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 450 Β· πŸ”€ 3.2K Β· πŸ“‹ 4.8K - 10% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/RasaHQ/rasa
  • PyPi (πŸ“₯ 69K / month Β· πŸ“¦ 64 Β· ⏱️ 21.12.2020):

    pip install rasa
ChatterBot (πŸ₯‡31 Β· ⭐ 11K) - ChatterBot is a machine learning, conversational dialog engine for.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 3.6K Β· πŸ“¦ 3K Β· πŸ“‹ 1.4K - 14% open Β· ⏱️ 20.08.2020):

    git clone https://github.com/gunthercox/ChatterBot
  • PyPi (πŸ“₯ 23K / month Β· πŸ“¦ 680 Β· ⏱️ 22.08.2020):

    pip install chatterbot
sentencepiece (πŸ₯‡31 Β· ⭐ 4.7K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 46 Β· πŸ”€ 600 Β· πŸ“₯ 9K Β· πŸ“¦ 5.1K Β· πŸ“‹ 400 - 8% open Β· ⏱️ 12.12.2020):

    git clone https://github.com/google/sentencepiece
  • PyPi (πŸ“₯ 1.1M / month Β· πŸ“¦ 120 Β· ⏱️ 24.10.2020):

    pip install sentencepiece
  • Conda (πŸ“₯ 20K Β· ⏱️ 10.10.2020):

    conda install -c conda-forge sentencepiece
fastText (πŸ₯‡30 Β· ⭐ 22K) - Library for fast text representation and classification. MIT
  • GitHub (πŸ‘¨β€πŸ’» 58 Β· πŸ”€ 4.2K Β· πŸ“¦ 1.4K Β· πŸ“‹ 990 - 40% open Β· ⏱️ 18.07.2020):

    git clone https://github.com/facebookresearch/fastText
  • PyPi (πŸ“₯ 120K / month Β· πŸ“¦ 190 Β· ⏱️ 28.04.2020):

    pip install fasttext
  • Conda (πŸ“₯ 17K Β· ⏱️ 12.10.2020):

    conda install -c conda-forge fasttext
fairseq (πŸ₯‡30 Β· ⭐ 11K Β· πŸ“‰) - Facebook AI Research Sequence-to-Sequence Toolkit written in.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 280 Β· πŸ”€ 2.7K Β· πŸ“₯ 49 Β· πŸ“¦ 250 Β· πŸ“‹ 2.3K - 24% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/pytorch/fairseq
  • PyPi (πŸ“₯ 8K / month Β· πŸ“¦ 20 Β· ⏱️ 21.11.2020):

    pip install fairseq
flair (πŸ₯‡30 Β· ⭐ 9.7K) - A very simple framework for state-of-the-art Natural Language.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 1.4K Β· πŸ“¦ 570 Β· πŸ“‹ 1.4K - 7% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/flairNLP/flair
  • PyPi (πŸ“₯ 36K / month Β· πŸ“¦ 48 Β· ⏱️ 01.12.2020):

    pip install flair
torchtext (πŸ₯‡30 Β· ⭐ 2.6K) - Data loaders and abstractions for text and NLP. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 580 Β· πŸ“¦ 3.4K Β· πŸ“‹ 570 - 45% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/pytorch/text
  • PyPi (πŸ“₯ 97K / month Β· πŸ“¦ 680 Β· ⏱️ 10.12.2020):

    pip install torchtext
AllenNLP (πŸ₯ˆ29 Β· ⭐ 9.5K Β· πŸ“‰) - An open-source NLP research library, built on PyTorch. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 230 Β· πŸ”€ 2K Β· πŸ“₯ 36 Β· πŸ“¦ 1.3K Β· πŸ“‹ 2.2K - 4% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/allenai/allennlp
  • PyPi (πŸ“₯ 23K / month Β· πŸ“¦ 230 Β· ⏱️ 17.11.2020):

    pip install allennlp
TextBlob (πŸ₯ˆ29 Β· ⭐ 7.5K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 950 Β· πŸ“₯ 88 Β· πŸ“¦ 9.5K Β· πŸ“‹ 220 - 31% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/sloria/TextBlob
  • PyPi (πŸ“₯ 270K / month Β· πŸ“¦ 2.5K Β· ⏱️ 24.02.2019):

    pip install textblob
  • Conda (πŸ“₯ 110K Β· ⏱️ 24.02.2019):

    conda install -c conda-forge textblob
Tokenizers (πŸ₯ˆ28 Β· ⭐ 4.1K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 300 Β· πŸ“¦ 20 Β· πŸ“‹ 340 - 23% open Β· ⏱️ 15.12.2020):

    git clone https://github.com/huggingface/tokenizers
  • PyPi (πŸ“₯ 880K / month Β· ⏱️ 08.12.2020):

    pip install tokenizers
  • Conda (πŸ“₯ 15K Β· ⏱️ 19.11.2020):

    conda install -c conda-forge tokenizers
Dedupe (πŸ₯ˆ28 Β· ⭐ 2.8K) - A python library for accurate and scalable fuzzy matching, record.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 55 Β· πŸ”€ 400 Β· πŸ“¦ 170 Β· πŸ“‹ 620 - 7% open Β· ⏱️ 09.11.2020):

    git clone https://github.com/dedupeio/dedupe
  • PyPi (πŸ“₯ 52K / month Β· πŸ“¦ 78 Β· ⏱️ 06.09.2020):

    pip install dedupe
phonenumbers (πŸ₯ˆ28 Β· ⭐ 2.6K) - Python port of Google's libphonenumber. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 320 Β· πŸ“‹ 110 - 2% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/daviddrysdale/python-phonenumbers
  • PyPi (πŸ“₯ 790K / month Β· πŸ“¦ 2.3K Β· ⏱️ 16.12.2020):

    pip install phonenumbers
  • Conda (πŸ“₯ 370K Β· ⏱️ 04.08.2019):

    conda install -c conda-forge phonenumbers
inflect (πŸ₯ˆ28 Β· ⭐ 460) - Correctly generate plurals, ordinals, indefinite articles; convert.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 57 Β· πŸ“‹ 71 - 18% open Β· ⏱️ 15.11.2020):

    git clone https://github.com/jaraco/inflect
  • PyPi (πŸ“₯ 910K / month Β· πŸ“¦ 1.4K Β· ⏱️ 15.11.2020):

    pip install inflect
  • Conda (πŸ“₯ 120K Β· ⏱️ 15.11.2020):

    conda install -c conda-forge inflect
OpenNMT (πŸ₯ˆ27 Β· ⭐ 4.7K) - Open Source Neural Machine Translation in PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 1.7K Β· πŸ“¦ 63 Β· πŸ“‹ 1.2K - 7% open Β· ⏱️ 09.12.2020):

    git clone https://github.com/OpenNMT/OpenNMT-py
  • PyPi (πŸ“₯ 4K / month Β· πŸ“¦ 4 Β· ⏱️ 10.11.2020):

    pip install OpenNMT-py
GluonNLP (πŸ₯ˆ27 Β· ⭐ 2.2K) - Toolkit that enables easy text preprocessing, datasets.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 72 Β· πŸ”€ 490 Β· πŸ“¦ 350 Β· πŸ“‹ 510 - 44% open Β· ⏱️ 08.10.2020):

    git clone https://github.com/dmlc/gluon-nlp
  • PyPi (πŸ“₯ 34K / month Β· πŸ“¦ 11 Β· ⏱️ 13.08.2020):

    pip install gluonnlp
textacy (πŸ₯ˆ27 Β· ⭐ 1.6K) - NLP, before and after spaCy. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 210 Β· πŸ“‹ 220 - 13% open Β· ⏱️ 06.12.2020):

    git clone https://github.com/chartbeat-labs/textacy
  • PyPi (πŸ“₯ 29K / month Β· πŸ“¦ 77 Β· ⏱️ 29.08.2020):

    pip install textacy
  • Conda (πŸ“₯ 73K Β· ⏱️ 19.11.2020):

    conda install -c conda-forge textacy
DeepPavlov (πŸ₯ˆ26 Β· ⭐ 4.9K) - An open source library for deep learning end-to-end dialog.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 61 Β· πŸ”€ 880 Β· πŸ“¦ 170 Β· πŸ“‹ 510 - 13% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/deepmipt/DeepPavlov
  • PyPi (πŸ“₯ 7.4K / month Β· πŸ“¦ 4 Β· ⏱️ 24.12.2020):

    pip install deeppavlov
TensorFlow Text (πŸ₯ˆ26 Β· ⭐ 670) - Making text a first-class citizen in TensorFlow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 84 Β· πŸ“¦ 380 Β· πŸ“‹ 120 - 32% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/tensorflow/text
  • PyPi (πŸ“₯ 150K / month Β· πŸ“¦ 4 Β· ⏱️ 22.12.2020):

    pip install tensorflow-text
ftfy (πŸ₯ˆ25 Β· ⭐ 2.9K) - Fixes mojibake and other glitches in Unicode text, after the fact. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 98 Β· πŸ“¦ 2.5K Β· πŸ“‹ 110 - 13% open Β· ⏱️ 17.07.2020):

    git clone https://github.com/LuminosoInsight/python-ftfy
  • PyPi (πŸ“₯ 250K / month Β· πŸ“¦ 760 Β· ⏱️ 20.07.2020):

    pip install ftfy
  • Conda (πŸ“₯ 93K Β· ⏱️ 03.02.2019):

    conda install -c conda-forge ftfy
fastNLP (πŸ₯ˆ25 Β· ⭐ 1.9K Β· πŸ“‰) - fastNLP: A Modularized and Extensible NLP Framework. Currently.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 48 Β· πŸ”€ 340 Β· πŸ“₯ 54 Β· πŸ“¦ 21 Β· πŸ“‹ 140 - 9% open Β· ⏱️ 23.11.2020):

    git clone https://github.com/fastnlp/fastNLP
  • PyPi (πŸ“₯ 1.3K / month Β· πŸ“¦ 1 Β· ⏱️ 04.02.2019):

    pip install fastnlp
TextDistance (πŸ₯ˆ25 Β· ⭐ 1.8K Β· πŸ’€) - Compute distance between sequences. 30+ algorithms, pure.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 160 Β· πŸ“₯ 96 Β· πŸ“¦ 270 Β· ⏱️ 13.04.2020):

    git clone https://github.com/life4/textdistance
  • PyPi (πŸ“₯ 100K / month Β· πŸ“¦ 28 Β· ⏱️ 13.04.2020):

    pip install textdistance
  • Conda (πŸ“₯ 14K Β· ⏱️ 10.11.2020):

    conda install -c conda-forge textdistance
spark-nlp (πŸ₯ˆ25 Β· ⭐ 1.8K Β· βž•) - State of the Art Natural Language Processing. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 71 Β· πŸ”€ 390 Β· πŸ“‹ 390 - 20% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/JohnSnowLabs/spark-nlp
  • PyPi (πŸ“₯ 360K / month Β· ⏱️ 15.12.2020):

    pip install spark-nlp
Jina (πŸ₯ˆ25 Β· ⭐ 1.8K) - An easier way to build neural search in the cloud. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 74 Β· πŸ”€ 300 Β· πŸ“¦ 41 Β· πŸ“‹ 540 - 7% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/jina-ai/jina
  • PyPi (πŸ“₯ 1.3K / month Β· ⏱️ 23.12.2020):

    pip install jina
  • Dockerhub (πŸ“₯ 63K Β· ⏱️ 26.12.2020):

    docker pull jinaai/jina
jellyfish (πŸ₯ˆ25 Β· ⭐ 1.4K) - a python library for doing approximate and phonetic matching of.. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 120 Β· πŸ“¦ 2K Β· πŸ“‹ 95 - 9% open Β· ⏱️ 08.12.2020):

    git clone https://github.com/jamesturk/jellyfish
  • PyPi (πŸ“₯ 790K / month Β· πŸ“¦ 650 Β· ⏱️ 21.05.2020):

    pip install jellyfish
  • Conda (πŸ“₯ 110K Β· ⏱️ 10.10.2020):

    conda install -c conda-forge jellyfish
haystack (πŸ₯ˆ25 Β· ⭐ 1.2K Β· βž•) - Transformers at scale for question answering & neural search... Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 210 Β· πŸ“¦ 23 Β· πŸ“‹ 390 - 13% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/deepset-ai/haystack
  • PyPi (πŸ“₯ 4.6K / month Β· πŸ“¦ 74 Β· ⏱️ 03.07.2017):

    pip install haystack
CLTK (πŸ₯ˆ25 Β· ⭐ 630) - The Classical Language Toolkit. MIT
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 300 Β· πŸ“₯ 22 Β· πŸ“¦ 150 Β· πŸ“‹ 480 - 26% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/cltk/cltk
  • PyPi (πŸ“₯ 1.3K / month Β· πŸ“¦ 68 Β· ⏱️ 03.12.2020):

    pip install cltk
ParlAI (πŸ₯‰24 Β· ⭐ 6.9K) - A framework for training and evaluating AI models on a variety of.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 1.4K Β· πŸ“¦ 5 Β· πŸ“‹ 910 - 12% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/facebookresearch/ParlAI
  • PyPi (πŸ“₯ 780 / month Β· ⏱️ 16.12.2020):

    pip install parlai
PyText (πŸ₯‰24 Β· ⭐ 6.1K) - A natural language modeling framework based on PyTorch. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 760 Β· πŸ“₯ 210 Β· πŸ“¦ 52 Β· πŸ“‹ 190 - 60% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/facebookresearch/pytext
  • PyPi (πŸ“₯ 350 / month Β· πŸ“¦ 1 Β· ⏱️ 27.04.2020):

    pip install pytext-nlp
stanza (πŸ₯‰24 Β· ⭐ 5K) - Official Stanford NLP Python Library for Many Human Languages. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 620 Β· πŸ“‹ 420 - 17% open Β· ⏱️ 13.08.2020):

    git clone https://github.com/stanfordnlp/stanza
  • PyPi (πŸ“₯ 14K / month Β· πŸ“¦ 2 Β· ⏱️ 13.08.2020):

    pip install stanza
  • Conda (πŸ“₯ 2.9K Β· ⏱️ 13.08.2020):

    conda install -c stanfordnlp stanza
T5 (πŸ₯‰24 Β· ⭐ 3.1K) - Code for the paper Exploring the Limits of Transfer Learning with a.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 410 Β· πŸ“¦ 18 Β· πŸ“‹ 310 - 12% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/google-research/text-to-text-transfer-transformer
  • PyPi (πŸ“₯ 6K / month Β· ⏱️ 18.12.2020):

    pip install t5
Sumy (πŸ₯‰24 Β· ⭐ 2.4K) - Module for automatic summarization of text documents and HTML pages. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 430 Β· πŸ“¦ 610 Β· πŸ“‹ 88 - 11% open Β· ⏱️ 21.11.2020):

    git clone https://github.com/miso-belica/sumy
  • PyPi (πŸ“₯ 7.5K / month Β· πŸ“¦ 190 Β· ⏱️ 19.05.2019):

    pip install sumy
PyTextRank (πŸ₯‰24 Β· ⭐ 1.4K) - Python implementation of TextRank for phrase extraction and.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 280 Β· πŸ“¦ 120 Β· πŸ“‹ 52 - 28% open Β· ⏱️ 21.09.2020):

    git clone https://github.com/DerwenAI/pytextrank
  • PyPi (πŸ“₯ 12K / month Β· πŸ“¦ 6 Β· ⏱️ 15.09.2020):

    pip install pytextrank
SciSpacy (πŸ₯‰24 Β· ⭐ 760) - A full spaCy pipeline and models for scientific/biomedical.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 120 Β· πŸ“¦ 190 Β· πŸ“‹ 140 - 12% open Β· ⏱️ 10.11.2020):

    git clone https://github.com/allenai/scispacy
  • PyPi (πŸ“₯ 8.8K / month Β· πŸ“¦ 12 Β· ⏱️ 16.10.2020):

    pip install scispacy
pyahocorasick (πŸ₯‰24 Β· ⭐ 560) - Python module (C extension and plain python) implementing Aho-.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 85 Β· πŸ“¦ 460 Β· πŸ“‹ 93 - 32% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/WojciechMula/pyahocorasick
  • PyPi (πŸ“₯ 120K / month Β· πŸ“¦ 64 Β· ⏱️ 14.01.2019):

    pip install pyahocorasick
  • Conda (πŸ“₯ 110K Β· ⏱️ 13.10.2020):

    conda install -c conda-forge pyahocorasick
Ciphey (πŸ₯‰23 Β· ⭐ 5.9K) - Automatically decrypt encryptions without knowing the key or cipher,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 330 Β· πŸ“‹ 210 - 21% open Β· ⏱️ 07.12.2020):

    git clone https://github.com/Ciphey/Ciphey
  • PyPi (πŸ“₯ 4.8K / month Β· ⏱️ 02.12.2020):

    pip install ciphey
  • Dockerhub (πŸ“₯ 7K Β· ⭐ 1 Β· ⏱️ 17.12.2020):

    docker pull remnux/ciphey
flashtext (πŸ₯‰23 Β· ⭐ 4.6K Β· πŸ’€) - Extract Keywords from sentence or Replace keywords in sentences. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 520 Β· πŸ“¦ 320 Β· πŸ“‹ 97 - 49% open Β· ⏱️ 03.05.2020):

    git clone https://github.com/vi3k6i5/flashtext
  • PyPi (πŸ“₯ 230K / month Β· πŸ“¦ 32 Β· ⏱️ 16.02.2018):

    pip install flashtext
textgenrnn (πŸ₯‰23 Β· ⭐ 4.2K Β· βž•) - Easily train your own text-generating neural network of any.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 650 Β· πŸ“₯ 380 Β· πŸ“¦ 550 Β· πŸ“‹ 190 - 57% open Β· ⏱️ 14.07.2020):

    git clone https://github.com/minimaxir/textgenrnn
  • PyPi (πŸ“₯ 800 / month Β· πŸ“¦ 14 Β· ⏱️ 02.02.2020):

    pip install textgenrnn
Snips NLU (πŸ₯‰23 Β· ⭐ 3.4K Β· πŸ’€) - Snips Python library to extract meaning from text. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 450 Β· πŸ“‹ 250 - 20% open Β· ⏱️ 08.02.2020):

    git clone https://github.com/snipsco/snips-nlu
  • PyPi (πŸ“₯ 1.5K / month Β· πŸ“¦ 8 Β· ⏱️ 15.01.2020):

    pip install snips-nlu
pytorch-nlp (πŸ₯‰23 Β· ⭐ 1.8K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 220 Β· πŸ“¦ 160 Β· πŸ“‹ 63 - 22% open Β· ⏱️ 11.10.2020):

    git clone https://github.com/PetrochukM/PyTorch-NLP
  • PyPi (πŸ“₯ 3.9K / month Β· πŸ“¦ 18 Β· ⏱️ 04.11.2019):

    pip install pytorch-nlp
sense2vec (πŸ₯‰23 Β· ⭐ 1.1K Β· πŸ’€) - Contextually-keyed word vectors. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 200 Β· πŸ“₯ 12K Β· πŸ“¦ 47 Β· πŸ“‹ 93 - 16% open Β· ⏱️ 29.05.2020):

    git clone https://github.com/explosion/sense2vec
  • PyPi (πŸ“₯ 2.3K / month Β· πŸ“¦ 6 Β· ⏱️ 22.11.2019):

    pip install sense2vec
  • Conda (πŸ“₯ 14K Β· ⏱️ 16.03.2020):

    conda install -c conda-forge sense2vec
pySBD (πŸ₯‰23 Β· ⭐ 240) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 25 Β· πŸ“¦ 90 Β· πŸ“‹ 47 - 8% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/nipunsadvilkar/pySBD
  • PyPi (πŸ“₯ 12K / month Β· πŸ“¦ 2 Β· ⏱️ 08.10.2020):

    pip install pysbd
neuralcoref (πŸ₯‰22 Β· ⭐ 2.2K Β· βž•) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 370 Β· πŸ“₯ 160 Β· πŸ“¦ 270 Β· πŸ“‹ 250 - 15% open Β· ⏱️ 07.09.2020):

    git clone https://github.com/huggingface/neuralcoref
  • PyPi (πŸ“₯ 3.7K / month Β· πŸ“¦ 9 Β· ⏱️ 08.04.2019):

    pip install neuralcoref
  • Conda (πŸ“₯ 5.5K Β· ⏱️ 21.02.2020):

    conda install -c conda-forge neuralcoref
scattertext (πŸ₯‰22 Β· ⭐ 1.5K) - Beautiful visualizations of how language differs among.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 200 Β· πŸ“¦ 140 Β· πŸ“‹ 67 - 25% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/JasonKessler/scattertext
  • PyPi (πŸ“₯ 3.4K / month Β· πŸ“¦ 8 Β· ⏱️ 14.12.2020):

    pip install scattertext
  • Conda (πŸ“₯ 42K Β· ⏱️ 18.12.2020):

    conda install -c conda-forge scattertext
fast-bert (πŸ₯‰22 Β· ⭐ 1.5K) - Super easy library for BERT based NLP models. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 34 Β· πŸ”€ 300 Β· πŸ“‹ 230 - 60% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/kaushaltrivedi/fast-bert
  • PyPi (πŸ“₯ 2K / month Β· πŸ“¦ 2 Β· ⏱️ 09.12.2020):

    pip install fast-bert
spacy-transformers (πŸ₯‰22 Β· ⭐ 860) - Use pretrained transformers like BERT, XLNet and GPT-2 in.. MIT spacy
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 97 Β· πŸ“¦ 65 Β· πŸ“‹ 180 - 26% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/explosion/spacy-transformers
  • PyPi (πŸ“₯ 7.3K / month Β· πŸ“¦ 3 Β· ⏱️ 14.10.2020):

    pip install spacy-transformers
NLP Architect (πŸ₯‰21 Β· ⭐ 2.6K) - A model library for exploring state-of-the-art deep learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 390 Β· πŸ“¦ 6 Β· πŸ“‹ 120 - 13% open Β· ⏱️ 26.11.2020):

    git clone https://github.com/IntelLabs/nlp-architect
  • PyPi (πŸ“₯ 270 / month Β· ⏱️ 12.04.2020):

    pip install nlp-architect
NeMo (πŸ₯‰21 Β· ⭐ 2.3K) - NeMo: a toolkit for conversational AI. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 67 Β· πŸ”€ 360 Β· πŸ“₯ 710 Β· πŸ“‹ 390 - 40% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/NVIDIA/NeMo
  • PyPi (πŸ“₯ 2K / month Β· πŸ“¦ 4 Β· ⏱️ 11.12.2020):

    pip install nemo-toolkit
Texar (πŸ₯‰21 Β· ⭐ 2.1K) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 330 Β· πŸ“¦ 12 Β· πŸ“‹ 160 - 20% open Β· ⏱️ 29.07.2020):

    git clone https://github.com/asyml/texar
  • PyPi (πŸ“₯ 84 / month Β· πŸ“¦ 2 Β· ⏱️ 19.11.2019):

    pip install texar
Texthero (πŸ₯‰21 Β· ⭐ 2K) - Text preprocessing, representation and visualization from zero to hero. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 180 Β· πŸ“₯ 57 Β· πŸ“‹ 120 - 60% open Β· ⏱️ 03.11.2020):

    git clone https://github.com/jbesomi/texthero
  • PyPi (πŸ“₯ 3.1K / month Β· ⏱️ 06.07.2020):

    pip install texthero
DELTA (πŸ₯‰21 Β· ⭐ 1.4K) - DELTA is a deep learning based natural language and speech.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 270 Β· πŸ“‹ 73 - 12% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/Delta-ML/delta
  • PyPi (πŸ“₯ 31 / month Β· ⏱️ 27.03.2020):

    pip install delta-nlp
  • Dockerhub (πŸ“₯ 12K Β· ⏱️ 26.12.2020):

    docker pull zh794390558/delta
DeepMatcher (πŸ₯‰20 Β· ⭐ 3.3K Β· πŸ’€) - Python package for performing Entity and Text Matching using.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 1.3K Β· πŸ“¦ 9 Β· πŸ“‹ 62 - 75% open Β· ⏱️ 20.04.2020):

    git clone https://github.com/anhaidgroup/deepmatcher
  • PyPi (πŸ“₯ 270 / month Β· ⏱️ 30.12.2019):

    pip install deepmatcher
Kashgari (πŸ₯‰20 Β· ⭐ 2K) - Kashgari is a production-level NLP Transfer learning framework.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 370 Β· πŸ“¦ 19 Β· πŸ“‹ 330 - 9% open Β· ⏱️ 29.10.2020):

    git clone https://github.com/BrikerMan/Kashgari
  • PyPi (πŸ“₯ 190 / month Β· πŸ“¦ 1 Β· ⏱️ 26.06.2019):

    pip install kashgari-tf
FARM (πŸ₯‰20 Β· ⭐ 1.1K) - Fast & easy transfer learning for NLP. Harvesting language models.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 140 Β· πŸ“‹ 310 - 10% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/deepset-ai/FARM
  • PyPi (πŸ“₯ 6.7K / month Β· πŸ“¦ 1 Β· ⏱️ 30.10.2020):

    pip install farm
Sockeye (πŸ₯‰20 Β· ⭐ 980) - Sequence-to-sequence framework with a focus on Neural Machine.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 52 Β· πŸ”€ 290 Β· πŸ“‹ 240 - 7% open Β· ⏱️ 14.12.2020):

    git clone https://github.com/awslabs/sockeye
  • PyPi (πŸ“₯ 540 / month Β· πŸ“¦ 2 Β· ⏱️ 05.11.2020):

    pip install sockeye
YouTokenToMe (πŸ₯‰20 Β· ⭐ 700 Β· πŸ’€) - Unsupervised text tokenizer focused on computational efficiency. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 42 Β· πŸ“¦ 91 Β· πŸ“‹ 41 - 51% open Β· ⏱️ 13.02.2020):

    git clone https://github.com/vkcom/youtokentome
  • PyPi (πŸ“₯ 7K / month Β· πŸ“¦ 6 Β· ⏱️ 12.02.2020):

    pip install youtokentome
finetune (πŸ₯‰20 Β· ⭐ 630) - Scikit-learn style model finetuning for NLP. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 67 Β· πŸ“¦ 8 Β· πŸ“‹ 130 - 13% open Β· ⏱️ 24.11.2020):

    git clone https://github.com/IndicoDataSolutions/finetune
  • PyPi (πŸ“₯ 230 / month Β· πŸ“¦ 2 Β· ⏱️ 10.01.2020):

    pip install finetune
textpipe (πŸ₯‰19 Β· ⭐ 260) - Textpipe: clean and extract metadata from text. MIT
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 17 Β· πŸ“¦ 6 Β· πŸ“‹ 44 - 45% open Β· ⏱️ 12.10.2020):

    git clone https://github.com/textpipe/textpipe
  • PyPi (πŸ“₯ 410 / month Β· πŸ“¦ 1 Β· ⏱️ 12.10.2020):

    pip install textpipe
Camphr (πŸ₯‰17 Β· ⭐ 320) - spaCy plugin for Transformers , Udify, ELmo, etc. Apache-2 spacy
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 15 Β· πŸ“‹ 27 - 48% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/PKSHATechnology-Research/camphr
  • PyPi (πŸ“₯ 570 / month Β· ⏱️ 25.12.2020):

    pip install camphr
VizSeq (πŸ₯‰16 Β· ⭐ 300) - An Analysis Toolkit for Natural Language Generation (Translation,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 39 Β· πŸ“¦ 1 Β· πŸ“‹ 15 - 46% open Β· ⏱️ 13.12.2020):

    git clone https://github.com/facebookresearch/vizseq
  • PyPi (πŸ“₯ 86 / month Β· ⏱️ 07.08.2020):

    pip install vizseq
Headliner (πŸ₯‰16 Β· ⭐ 220 Β· πŸ’€) - Easy training and deployment of seq2seq models. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 37 Β· πŸ“¦ 3 Β· πŸ“‹ 14 - 7% open Β· ⏱️ 14.02.2020):

    git clone https://github.com/as-ideas/headliner
  • PyPi (πŸ“₯ 390 / month Β· πŸ“¦ 1 Β· ⏱️ 24.01.2020):

    pip install headliner
Translate (πŸ₯‰15 Β· ⭐ 670) - Translate - a PyTorch Language Library. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 83 Β· πŸ”€ 160 Β· πŸ“‹ 93 - 70% open Β· ⏱️ 16.11.2020):

    git clone https://github.com/pytorch/translate
  • PyPi (πŸ“₯ 14 / month Β· ⏱️ 01.05.2018):

    pip install pytorch-translate
NeuralQA (πŸ₯‰15 Β· ⭐ 180) - NeuralQA: A Usable Library for Question Answering on Large Datasets.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 23 Β· πŸ“¦ 1 Β· πŸ“‹ 26 - 69% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/victordibia/neuralqa
  • PyPi (πŸ“₯ 120 / month Β· ⏱️ 29.07.2020):

    pip install neuralqa
OpenNRE (πŸ₯‰14 Β· ⭐ 2.9K Β· βž•) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 830 Β· πŸ“‹ 300 - 4% open Β· ⏱️ 15.09.2020):

    git clone https://github.com/thunlp/OpenNRE
TransferNLP (πŸ₯‰14 Β· ⭐ 280 Β· πŸ’€) - NLP library designed for reproducible experimentation.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 18 Β· πŸ“‹ 23 - 17% open Β· ⏱️ 28.05.2020):

    git clone https://github.com/feedly/transfer-nlp
  • PyPi (πŸ“₯ 80 / month Β· ⏱️ 28.05.2020):

    pip install transfer-nlp
Show 10 hidden projects...
  • gensim (πŸ₯‡35 Β· ⭐ 12K) - Topic Modelling for Humans. ❗️LGPL-2.1
  • fuzzywuzzy (πŸ₯ˆ29 Β· ⭐ 7.7K Β· πŸ’€) - Fuzzy String Matching in Python. ❗️GPL-2.0
  • langid (πŸ₯ˆ26 Β· ⭐ 1.7K Β· πŸ’€) - Stand-alone language identification system. BSD-3
  • polyglot (πŸ₯ˆ25 Β· ⭐ 1.7K) - Multilingual text (NLP) processing toolkit. ❗️GPL-3.0
  • anaGo (πŸ₯‰22 Β· ⭐ 1.4K Β· πŸ’€) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,.. MIT
  • MatchZoo (πŸ₯‰21 Β· ⭐ 3.3K Β· πŸ’€) - Facilitating the design, comparison and sharing of deep.. Apache-2
  • stop-words (πŸ₯‰20 Β· ⭐ 120 Β· πŸ’€) - Get list of common stop words in various languages in Python. BSD-3
  • pyfasttext (πŸ₯‰19 Β· ⭐ 230 Β· πŸ’€) - Yet another Python binding for fastText. ❗️GPL-3.0
  • NeuroNER (πŸ₯‰18 Β· ⭐ 1.5K Β· πŸ’€) - Named-entity recognition using neural networks. Easy-to-use and.. MIT
  • ONNX-T5 (πŸ₯‰11 Β· ⭐ 130 Β· 🐣) - Summarization, translation, sentiment-analysis, text-generation.. Apache-2

Image Data

Back to top

Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.

scikit-image (πŸ₯‡36 Β· ⭐ 4.1K) - Image processing in Python. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 470 Β· πŸ”€ 1.7K Β· πŸ“¦ 57K Β· πŸ“‹ 2.1K - 30% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/scikit-image/scikit-image
  • PyPi (πŸ“₯ 1.8M / month Β· πŸ“¦ 15K Β· ⏱️ 23.12.2020):

    pip install scikit-image
  • Conda (πŸ“₯ 2M Β· ⏱️ 23.12.2020):

    conda install -c conda-forge scikit-image
torchvision (πŸ₯‡35 Β· ⭐ 8K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 350 Β· πŸ”€ 4.1K Β· πŸ“¦ 38K Β· πŸ“‹ 1.5K - 30% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/pytorch/vision
  • PyPi (πŸ“₯ 720K / month Β· πŸ“¦ 4.6K Β· ⏱️ 10.12.2020):

    pip install torchvision
  • Conda (πŸ“₯ 34K Β· ⏱️ 14.10.2018):

    conda install -c conda-forge torchvision
imgaug (πŸ₯‡32 Β· ⭐ 10K) - Image augmentation for machine learning experiments. MIT
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 1.9K Β· πŸ“¦ 4.9K Β· πŸ“‹ 420 - 51% open Β· ⏱️ 01.06.2020):

    git clone https://github.com/aleju/imgaug
  • PyPi (πŸ“₯ 150K / month Β· πŸ“¦ 280 Β· ⏱️ 05.02.2020):

    pip install imgaug
  • Conda (πŸ“₯ 30K Β· ⏱️ 14.02.2020):

    conda install -c conda-forge imgaug
imageio (πŸ₯‡32 Β· ⭐ 820) - Python library for reading and writing image data. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 69 Β· πŸ”€ 160 Β· πŸ“¦ 34K Β· πŸ“‹ 330 - 18% open Β· ⏱️ 07.12.2020):

    git clone https://github.com/imageio/imageio
  • PyPi (πŸ“₯ 1.8M / month Β· πŸ“¦ 3.8K Β· ⏱️ 06.07.2020):

    pip install imageio
  • Conda (πŸ“₯ 1.6M Β· ⏱️ 06.07.2020):

    conda install -c conda-forge imageio
opencv-python (πŸ₯ˆ30 Β· ⭐ 1.6K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 300 Β· πŸ“‹ 380 - 5% open Β· ⏱️ 11.12.2020):

    git clone https://github.com/skvark/opencv-python
  • PyPi (πŸ“₯ 2.2M / month Β· πŸ“¦ 13K Β· ⏱️ 02.11.2020):

    pip install opencv-python
Wand (πŸ₯ˆ30 Β· ⭐ 1K Β· βž•) - The ctypes-based simple ImageMagick binding for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 94 Β· πŸ”€ 180 Β· πŸ“₯ 5.2K Β· πŸ“¦ 3K Β· πŸ“‹ 330 - 3% open Β· ⏱️ 08.12.2020):

    git clone https://github.com/emcconville/wand
  • PyPi (πŸ“₯ 130K / month Β· πŸ“¦ 1.2K Β· ⏱️ 29.11.2020):

    pip install wand
Face Recognition (πŸ₯ˆ29 Β· ⭐ 38K) - The world's simplest facial recognition api for Python and.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 45 Β· πŸ”€ 10K Β· πŸ“₯ 430 Β· πŸ“‹ 1.1K - 51% open Β· ⏱️ 26.09.2020):

    git clone https://github.com/ageitgey/face_recognition
  • PyPi (πŸ“₯ 32K / month Β· πŸ“¦ 410 Β· ⏱️ 21.08.2018):

    pip install face_recognition
MoviePy (πŸ₯ˆ29 Β· ⭐ 7.2K) - Video editing with Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 1K Β· πŸ“¦ 5.5K Β· πŸ“‹ 970 - 34% open Β· ⏱️ 25.11.2020):

    git clone https://github.com/Zulko/moviepy
  • PyPi (πŸ“₯ 140K / month Β· πŸ“¦ 1.1K Β· ⏱️ 05.10.2020):

    pip install moviepy
  • Conda (πŸ“₯ 71K Β· ⏱️ 23.02.2020):

    conda install -c conda-forge moviepy
Albumentations (πŸ₯ˆ29 Β· ⭐ 6.9K) - Fast image augmentation library and easy to use wrapper.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 71 Β· πŸ”€ 880 Β· πŸ“¦ 2.4K Β· πŸ“‹ 380 - 39% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/albumentations-team/albumentations
  • PyPi (πŸ“₯ 61K / month Β· πŸ“¦ 64 Β· ⏱️ 29.11.2020):

    pip install albumentations
  • Conda (πŸ“₯ 13K Β· ⏱️ 29.11.2020):

    conda install -c conda-forge albumentations
Kornia (πŸ₯ˆ28 Β· ⭐ 3.4K) - Open Source Differentiable Computer Vision Library for PyTorch. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 78 Β· πŸ”€ 340 Β· πŸ“¦ 210 Β· πŸ“‹ 340 - 30% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/kornia/kornia
  • PyPi (πŸ“₯ 16K / month Β· ⏱️ 20.10.2020):

    pip install kornia
ImageHash (πŸ₯ˆ28 Β· ⭐ 1.8K) - A Python Perceptual Image Hashing Module. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 240 Β· πŸ“¦ 1.8K Β· πŸ“‹ 86 - 19% open Β· ⏱️ 27.11.2020):

    git clone https://github.com/JohannesBuchner/imagehash
  • PyPi (πŸ“₯ 370K / month Β· πŸ“¦ 530 Β· ⏱️ 19.11.2020):

    pip install ImageHash
  • Conda (πŸ“₯ 95K Β· ⏱️ 19.11.2020):

    conda install -c conda-forge imagehash
GluonCV (πŸ₯ˆ27 Β· ⭐ 4.4K Β· πŸ“‰) - Gluon CV Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 1K Β· πŸ“¦ 380 Β· πŸ“‹ 740 - 49% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/dmlc/gluon-cv
  • PyPi (πŸ“₯ 45K / month Β· πŸ“¦ 30 Β· ⏱️ 26.12.2020):

    pip install gluoncv
detectron2 (πŸ₯ˆ26 Β· ⭐ 14K) - Detectron2 is FAIR's next-generation platform for object.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 3.5K Β· πŸ“¦ 140 Β· πŸ“‹ 2.1K - 3% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/facebookresearch/detectron2
  • Conda (πŸ“₯ 12K Β· ⏱️ 05.08.2020):

    conda install -c conda-forge detectron2
InsightFace (πŸ₯ˆ26 Β· ⭐ 8.3K) - Face Analysis Project on MXNet. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 2.8K Β· πŸ“¦ 46 Β· πŸ“‹ 1.3K - 51% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/deepinsight/insightface
  • PyPi (πŸ“₯ 6.2K / month Β· πŸ“¦ 1 Β· ⏱️ 07.09.2019):

    pip install insightface
imageai (πŸ₯ˆ26 Β· ⭐ 5.6K Β· πŸ’€) - A python library built to empower developers to build applications.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 1.6K Β· πŸ“₯ 520K Β· πŸ“¦ 630 Β· πŸ“‹ 580 - 33% open Β· ⏱️ 14.01.2020):

    git clone https://github.com/OlafenwaMoses/ImageAI
  • PyPi (πŸ“₯ 6K / month Β· πŸ“¦ 12 Β· ⏱️ 13.09.2019):

    pip install imageai
MMDetection (πŸ₯‰25 Β· ⭐ 13K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 4.4K Β· πŸ“¦ 18 Β· πŸ“‹ 3.2K - 11% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/open-mmlab/mmdetection
PyTorch Image Models (πŸ₯‰25 Β· ⭐ 6.1K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 860 Β· πŸ“₯ 150K Β· πŸ“¦ 220 Β· πŸ“‹ 220 - 9% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/rwightman/pytorch-image-models
Augmentor (πŸ₯‰25 Β· ⭐ 4.3K Β· πŸ’€) - Image augmentation library in Python for machine learning. MIT
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 780 Β· πŸ“¦ 270 Β· πŸ“‹ 180 - 63% open Β· ⏱️ 09.03.2020):

    git clone https://github.com/mdbloice/Augmentor
  • PyPi (πŸ“₯ 2.6K / month Β· πŸ“¦ 42 Β· ⏱️ 17.01.2020):

    pip install Augmentor
chainercv (πŸ₯‰25 Β· ⭐ 1.4K Β· πŸ’€) - ChainerCV: a Library for Deep Learning in Computer Vision. MIT
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 320 Β· πŸ“¦ 210 Β· πŸ“‹ 220 - 24% open Β· ⏱️ 07.01.2020):

    git clone https://github.com/chainer/chainercv
  • PyPi (πŸ“₯ 3.8K / month Β· πŸ“¦ 24 Β· ⏱️ 31.05.2019):

    pip install chainercv
PyTorch3D (πŸ₯‰24 Β· ⭐ 4.1K) - PyTorch3D is FAIR's library of reusable components for deep.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 450 Β· πŸ“¦ 38 Β· πŸ“‹ 440 - 12% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/facebookresearch/pytorch3d
  • PyPi (πŸ“₯ 4.7K / month Β· ⏱️ 12.11.2020):

    pip install pytorch3d
  • Conda (πŸ“₯ 5.2K Β· ⏱️ 12.11.2020):

    conda install -c pytorch3d pytorch3d
mtcnn (πŸ₯‰24 Β· ⭐ 1.3K Β· βž•) - MTCNN face detection implementation for TensorFlow, as a PIP.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 360 Β· πŸ“¦ 770 Β· πŸ“‹ 87 - 58% open Β· ⏱️ 07.06.2020):

    git clone https://github.com/ipazc/mtcnn
  • PyPi (πŸ“₯ 14K / month Β· πŸ“¦ 21 Β· ⏱️ 14.11.2019):

    pip install mtcnn
segmentation_models (πŸ₯‰23 Β· ⭐ 2.8K Β· πŸ’€) - Segmentation models with pretrained backbones. Keras and.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 650 Β· πŸ“‹ 370 - 36% open Β· ⏱️ 17.04.2020):

    git clone https://github.com/qubvel/segmentation_models
  • PyPi (πŸ“₯ 15K / month Β· πŸ“¦ 12 Β· ⏱️ 10.01.2020):

    pip install segmentation_models
Face Alignment (πŸ₯‰22 Β· ⭐ 4.6K) - 2D and 3D Face alignment library build using pytorch. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 990 Β· πŸ“‹ 220 - 13% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/1adrianb/face-alignment
  • PyPi (πŸ“₯ 5.5K / month Β· πŸ“¦ 6 Β· ⏱️ 22.12.2020):

    pip install face-alignment
Image Deduplicator (πŸ₯‰22 Β· ⭐ 3.3K) - Finding duplicate images made easy!. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 270 Β· πŸ“¦ 8 Β· πŸ“‹ 64 - 21% open Β· ⏱️ 23.11.2020):

    git clone https://github.com/idealo/imagededup
  • PyPi (πŸ“₯ 2.3K / month Β· πŸ“¦ 1 Β· ⏱️ 22.11.2020):

    pip install imagededup
pyvips (πŸ₯‰22 Β· ⭐ 280 Β· βž•) - python binding for libvips using cffi. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 25 Β· πŸ“¦ 130 Β· πŸ“‹ 200 - 34% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/libvips/pyvips
  • PyPi (πŸ“₯ 4.1K / month Β· πŸ“¦ 22 Β· ⏱️ 18.12.2020):

    pip install pyvips
  • Conda (πŸ“₯ 5.8K Β· ⏱️ 14.10.2020):

    conda install -c conda-forge pyvips
MMF (πŸ₯‰21 Β· ⭐ 4K) - A modular framework for vision & language multimodal research from.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 590 Β· πŸ“¦ 6 Β· πŸ“‹ 400 - 21% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/facebookresearch/mmf
  • PyPi (πŸ“₯ 250 / month Β· ⏱️ 11.06.2020):

    pip install mmf
Image Super-Resolution (πŸ₯‰21 Β· ⭐ 2.4K) - Super-scale your images and run experiments with Residual.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 450 Β· πŸ“¦ 35 Β· πŸ“‹ 140 - 32% open Β· ⏱️ 11.11.2020):

    git clone https://github.com/idealo/image-super-resolution
  • PyPi (πŸ“₯ 2.7K / month Β· πŸ“¦ 4 Β· ⏱️ 08.01.2020):

    pip install ISR
  • Dockerhub (πŸ“₯ 110 Β· ⏱️ 01.04.2019):

    docker pull idealo/image-super-resolution-gpu
tensorflow-graphics (πŸ₯‰21 Β· ⭐ 2.4K) - TensorFlow Graphics: Differentiable Graphics Layers for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 260 Β· πŸ“‹ 170 - 51% open Β· ⏱️ 30.10.2020):

    git clone https://github.com/tensorflow/graphics
  • PyPi (πŸ“₯ 850 / month Β· πŸ“¦ 3 Β· ⏱️ 20.05.2020):

    pip install tensorflow-graphics
Luminoth (πŸ₯‰21 Β· ⭐ 2.3K Β· πŸ’€) - Deep Learning toolkit for Computer Vision. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 400 Β· πŸ“₯ 11K Β· πŸ“¦ 27 Β· πŸ“‹ 180 - 28% open Β· ⏱️ 07.01.2020):

    git clone https://github.com/tryolabs/luminoth
  • PyPi (πŸ“₯ 550 / month Β· ⏱️ 09.11.2018):

    pip install luminoth
caer (πŸ₯‰21 Β· ⭐ 280 Β· 🐣) - A lightweight, scalable, GPU-accelerated Computer Vision library for.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 32 Β· πŸ“₯ 10 Β· πŸ“‹ 9 - 44% open Β· ⏱️ 15.12.2020):

    git clone https://github.com/jasmcaus/caer
  • PyPi (πŸ“₯ 600K / month Β· ⏱️ 09.12.2020):

    pip install caer
Classy Vision (πŸ₯‰20 Β· ⭐ 1K) - An end-to-end PyTorch framework for image and video.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 51 Β· πŸ”€ 180 Β· πŸ“‹ 85 - 54% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/facebookresearch/ClassyVision
  • PyPi (πŸ“₯ 530 / month Β· ⏱️ 20.11.2020):

    pip install classy_vision
  • Conda (πŸ“₯ 5.4K Β· ⏱️ 11.12.2020):

    conda install -c conda-forge classy_vision
nude.py (πŸ₯‰20 Β· ⭐ 780 Β· βž•) - Nudity detection with Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 130 Β· πŸ“¦ 47 Β· πŸ“‹ 8 - 75% open Β· ⏱️ 23.11.2020):

    git clone https://github.com/hhatto/nude.py
  • PyPi (πŸ“₯ 520 / month Β· πŸ“¦ 9 Β· ⏱️ 23.11.2020):

    pip install nudepy
PaddleDetection (πŸ₯‰17 Β· ⭐ 2.1K) - Object detection and instance segmentation toolkit based.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 43 Β· πŸ”€ 590 Β· πŸ“‹ 1.1K - 25% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/PaddlePaddle/PaddleDetection
Norfair (πŸ₯‰17 Β· ⭐ 580 Β· 🐣) - Lightweight Python library for adding real-time 2D object tracking.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 37 Β· πŸ“‹ 16 - 31% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/tryolabs/norfair
  • PyPi (πŸ“₯ 250 / month Β· ⏱️ 07.12.2020):

    pip install norfair
lightly (πŸ₯‰16 Β· ⭐ 380 Β· 🐣) - A python library for self-supervised learning. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 18 Β· πŸ“‹ 32 - 46% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/lightly-ai/lightly
  • PyPi (πŸ“₯ 370 / month Β· ⏱️ 17.12.2020):

    pip install lightly
DEβ«ΆTR (πŸ₯‰14 Β· ⭐ 5.7K) - End-to-End Object Detection with Transformers. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 800 Β· πŸ“‹ 250 - 22% open Β· ⏱️ 15.11.2020):

    git clone https://github.com/facebookresearch/detr
PySlowFast (πŸ₯‰14 Β· ⭐ 3.2K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 580 Β· πŸ“¦ 2 Β· πŸ“‹ 330 - 45% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/facebookresearch/SlowFast
pycls (πŸ₯‰13 Β· ⭐ 1.4K) - Codebase for Image Classification Research, written in PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 140 Β· πŸ“¦ 1 Β· πŸ“‹ 54 - 18% open Β· ⏱️ 20.11.2020):

    git clone https://github.com/facebookresearch/pycls
Show 5 hidden projects...
  • Pillow (πŸ₯‡38 Β· ⭐ 8K) - The friendly PIL fork (Python Imaging Library). ❗️PIL
  • glfw (πŸ₯ˆ29 Β· ⭐ 7.1K Β· βž•) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and.. ❗️Zlib
  • imutils (πŸ₯ˆ27 Β· ⭐ 3.4K Β· πŸ’€) - A series of convenience functions to make basic image processing.. MIT
  • Pillow-SIMD (πŸ₯‰23 Β· ⭐ 1.5K) - The friendly PIL fork. ❗️PIL
  • Torch Points 3D (πŸ₯‰17 Β· ⭐ 940) - Pytorch framework for doing deep learning on point clouds. ❗️Unlicensed

Graph Data

Back to top

Libraries for graph processing, clustering, embedding, and machine learning tasks.

networkx (πŸ₯‡37 Β· ⭐ 8.4K) - Network Analysis in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 480 Β· πŸ”€ 2.2K Β· πŸ“₯ 51 Β· πŸ“¦ 63K Β· πŸ“‹ 2.5K - 9% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/networkx/networkx
  • PyPi (πŸ“₯ 5.5M / month Β· πŸ“¦ 21K Β· ⏱️ 22.08.2020):

    pip install networkx
  • Conda (πŸ“₯ 2.8M Β· ⏱️ 23.08.2020):

    conda install -c conda-forge networkx
PyTorch Geometric (πŸ₯‡28 Β· ⭐ 9.8K) - Geometric Deep Learning Extension Library for PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 1.6K Β· πŸ“‹ 1.7K - 37% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/rusty1s/pytorch_geometric
  • PyPi (πŸ“₯ 12K / month Β· πŸ“¦ 24 Β· ⏱️ 27.11.2020):

    pip install torch-geometric
dgl (πŸ₯‡27 Β· ⭐ 6.4K) - Python package built to ease deep learning on graph, on top of existing.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 93 Β· πŸ”€ 1.3K Β· πŸ“‹ 870 - 24% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/dmlc/dgl
  • PyPi (πŸ“₯ 15K / month Β· πŸ“¦ 15 Β· ⏱️ 26.12.2020):

    pip install dgl
StellarGraph (πŸ₯ˆ25 Β· ⭐ 1.7K) - StellarGraph - Machine Learning on Graphs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 240 Β· πŸ“¦ 40 Β· πŸ“‹ 900 - 23% open Β· ⏱️ 21.07.2020):

    git clone https://github.com/stellargraph/stellargraph
  • PyPi (πŸ“₯ 8.4K / month Β· πŸ“¦ 2 Β· ⏱️ 25.06.2020):

    pip install stellargraph
Spektral (πŸ₯ˆ23 Β· ⭐ 1.6K) - Graph Neural Networks with Keras and Tensorflow 2. MIT
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 180 Β· πŸ“¦ 26 Β· πŸ“‹ 110 - 20% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/danielegrattarola/spektral
  • PyPi (πŸ“₯ 4K / month Β· ⏱️ 10.12.2020):

    pip install spektral
Node2Vec (πŸ₯ˆ22 Β· ⭐ 610) - Implementation of the node2vec algorithm. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 160 Β· πŸ“¦ 100 Β· πŸ“‹ 47 - 2% open Β· ⏱️ 28.11.2020):

    git clone https://github.com/eliorc/node2vec
  • PyPi (πŸ“₯ 5.3K / month Β· πŸ“¦ 5 Β· ⏱️ 28.11.2020):

    pip install node2vec
  • Conda (πŸ“₯ 15K Β· ⏱️ 25.04.2020):

    conda install -c conda-forge node2vec
ogb (πŸ₯ˆ21 Β· ⭐ 670 Β· βž•) - Benchmark datasets, data loaders, and evaluators for graph machine.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 91 Β· πŸ“¦ 36 Β· πŸ“‹ 86 - 2% open Β· ⏱️ 29.10.2020):

    git clone https://github.com/snap-stanford/ogb
  • PyPi (πŸ“₯ 1.6K / month Β· ⏱️ 12.08.2020):

    pip install ogb
torch-cluster (πŸ₯ˆ21 Β· ⭐ 310) - PyTorch Extension Library of Optimized Graph Cluster Algorithms. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 61 Β· πŸ“‹ 71 - 47% open Β· ⏱️ 07.12.2020):

    git clone https://github.com/rusty1s/pytorch_cluster
  • PyPi (πŸ“₯ 3K / month Β· πŸ“¦ 16 Β· ⏱️ 31.10.2020):

    pip install torch-cluster
AmpliGraph (πŸ₯ˆ20 Β· ⭐ 1.4K) - Python library for Representation Learning on Knowledge.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 150 Β· πŸ“¦ 9 Β· πŸ“‹ 170 - 10% open Β· ⏱️ 25.08.2020):

    git clone https://github.com/Accenture/AmpliGraph
  • PyPi (πŸ“₯ 580 / month Β· ⏱️ 25.08.2020):

    pip install ampligraph
graph-nets (πŸ₯‰19 Β· ⭐ 4.7K) - Build Graph Nets in Tensorflow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 710 Β· πŸ“‹ 110 - 11% open Β· ⏱️ 04.12.2020):

    git clone https://github.com/deepmind/graph_nets
  • PyPi (πŸ“₯ 830 / month Β· πŸ“¦ 7 Β· ⏱️ 29.01.2020):

    pip install graph-nets
PyTorch-BigGraph (πŸ₯‰19 Β· ⭐ 2.6K) - Generate embeddings from large-scale graph-structured data. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 360 Β· πŸ“₯ 100 Β· πŸ“‹ 130 - 24% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/facebookresearch/PyTorch-BigGraph
  • PyPi (πŸ“₯ 260 / month Β· πŸ“¦ 4 Β· ⏱️ 01.05.2019):

    pip install torchbiggraph
Paddle Graph Learning (πŸ₯‰19 Β· ⭐ 850) - Paddle Graph Learning (PGL) is an efficient and flexible.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 110 Β· πŸ“¦ 9 Β· πŸ“‹ 41 - 34% open Β· ⏱️ 14.12.2020):

    git clone https://github.com/PaddlePaddle/PGL
  • PyPi (πŸ“₯ 760 / month Β· πŸ“¦ 1 Β· ⏱️ 20.11.2020):

    pip install pgl
PyKEEN (πŸ₯‰19 Β· ⭐ 270) - A Python library for learning and evaluating knowledge graph embeddings. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 41 Β· πŸ“₯ 37 Β· πŸ“‹ 90 - 36% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/pykeen/pykeen
  • PyPi (πŸ“₯ 350 / month Β· πŸ“¦ 1 Β· ⏱️ 25.08.2020):

    pip install pykeen
DeepGraph (πŸ₯‰18 Β· ⭐ 230) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 33 Β· πŸ“¦ 1 Β· πŸ“‹ 12 - 66% open Β· ⏱️ 01.10.2020):

    git clone https://github.com/deepgraph/deepgraph
  • PyPi (πŸ“₯ 280 / month Β· ⏱️ 01.10.2020):

    pip install deepgraph
  • Conda (πŸ“₯ 75K Β· ⏱️ 13.10.2020):

    conda install -c conda-forge deepgraph
kglib (πŸ₯‰17 Β· ⭐ 380 Β· βž•) - Grakn Knowledge Graph Library (ML R&D). Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 68 Β· πŸ“₯ 210 Β· πŸ“‹ 59 - 16% open Β· ⏱️ 10.11.2020):

    git clone https://github.com/graknlabs/kglib
  • PyPi (πŸ“₯ 84 / month Β· ⏱️ 19.08.2020):

    pip install grakn-kglib
Euler (πŸ₯‰15 Β· ⭐ 2.4K) - A distributed graph deep learning framework. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 500 Β· πŸ“‹ 300 - 67% open Β· ⏱️ 29.07.2020):

    git clone https://github.com/alibaba/euler
  • PyPi (πŸ“₯ 16 / month Β· ⏱️ 10.04.2019):

    pip install euler-gl
GraphEmbedding (πŸ₯‰15 Β· ⭐ 1.7K) - Implementation and experiments of graph embedding algorithms. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 530 Β· πŸ“¦ 7 Β· πŸ“‹ 39 - 66% open Β· ⏱️ 18.10.2020):

    git clone https://github.com/shenweichen/GraphEmbedding
OpenKE (πŸ₯‰13 Β· ⭐ 2.3K Β· πŸ’€) - An Open-Source Package for Knowledge Embedding (KE). MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 710 Β· πŸ“‹ 270 - 17% open Β· ⏱️ 08.04.2020):

    git clone https://github.com/thunlp/OpenKE
GraphVite (πŸ₯‰12 Β· ⭐ 820 Β· πŸ’€) - GraphVite: A General and High-performance Graph Embedding.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1 Β· πŸ”€ 110 Β· πŸ“‹ 74 - 31% open Β· ⏱️ 18.03.2020):

    git clone https://github.com/DeepGraphLearning/graphvite
  • Conda (πŸ“₯ 3.3K Β· ⏱️ 19.03.2020):

    conda install -c milagraph graphvite
Show 7 hidden projects...
  • pygal (πŸ₯‡27 Β· ⭐ 2.3K Β· βž•) - PYthon svg GrAph plotting Library. ❗️LGPL-3.0
  • igraph (πŸ₯‡27 Β· ⭐ 750) - Python interface for igraph. ❗️GPL-2.0
  • Karate Club (πŸ₯ˆ20 Β· ⭐ 1.1K) - Karate Club: An API Oriented Open-source Python Framework for.. ❗️GPL-3.0
  • DeepWalk (πŸ₯‰19 Β· ⭐ 2.1K Β· πŸ’€) - DeepWalk - Deep Learning for Graphs. ❗️GPL-3.0
  • Sematch (πŸ₯‰17 Β· ⭐ 340 Β· πŸ’€) - semantic similarity framework for knowledge graph. Apache-2
  • GraphSAGE (πŸ₯‰14 Β· ⭐ 2K Β· πŸ’€) - Representation learning on large graphs using stochastic graph.. MIT
  • OpenNE (πŸ₯‰14 Β· ⭐ 1.4K Β· πŸ’€) - An Open-Source Package for Network Embedding (NE). MIT

Audio Data

Back to top

Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.

DeepSpeech (πŸ₯‡31 Β· ⭐ 16K Β· πŸ“ˆ) - DeepSpeech is an open source embedded (offline, on-device).. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 3K Β· πŸ“₯ 380K Β· πŸ“¦ 370 Β· πŸ“‹ 1.9K - 4% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/mozilla/DeepSpeech
  • PyPi (πŸ“₯ 17K / month Β· πŸ“¦ 29 Β· ⏱️ 19.12.2020):

    pip install deepspeech
Magenta (πŸ₯ˆ29 Β· ⭐ 16K) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 3.3K Β· πŸ“¦ 250 Β· πŸ“‹ 820 - 34% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/magenta/magenta
  • PyPi (πŸ“₯ 4.9K / month Β· πŸ“¦ 35 Β· ⏱️ 12.11.2020):

    pip install magenta
Pydub (πŸ₯ˆ29 Β· ⭐ 5K) - Manipulate audio with a simple and easy high level interface. MIT
  • GitHub (πŸ‘¨β€πŸ’» 79 Β· πŸ”€ 670 Β· πŸ“¦ 5.4K Β· πŸ“‹ 400 - 41% open Β· ⏱️ 14.12.2020):

    git clone https://github.com/jiaaro/pydub
  • PyPi (πŸ“₯ 130K / month Β· πŸ“¦ 1.4K Β· ⏱️ 03.06.2020):

    pip install pydub
  • Conda (πŸ“₯ 12K Β· ⏱️ 02.02.2019):

    conda install -c conda-forge pydub
torchaudio (πŸ₯ˆ29 Β· ⭐ 1.2K) - Data manipulation and transformation for audio signal.. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 84 Β· πŸ”€ 260 Β· πŸ“¦ 960 Β· πŸ“‹ 380 - 39% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/pytorch/audio
  • PyPi (πŸ“₯ 63K / month Β· πŸ“¦ 9 Β· ⏱️ 27.10.2020):

    pip install torchaudio
spleeter (πŸ₯ˆ28 Β· ⭐ 15K) - Deezer source separation library including pretrained models. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 1.5K Β· πŸ“₯ 870K Β· πŸ“¦ 120 Β· πŸ“‹ 480 - 18% open Β· ⏱️ 07.12.2020):

    git clone https://github.com/deezer/spleeter
  • PyPi (πŸ“₯ 5.3K / month Β· ⏱️ 23.10.2020):

    pip install spleeter
  • Conda (πŸ“₯ 40K Β· ⏱️ 30.06.2020):

    conda install -c conda-forge spleeter
librosa (πŸ₯ˆ28 Β· ⭐ 4.2K) - Python library for audio and music analysis. ISC
  • GitHub (πŸ‘¨β€πŸ’» 80 Β· πŸ”€ 670 Β· πŸ“¦ 8.6K Β· πŸ“‹ 830 - 6% open Β· ⏱️ 11.09.2020):

    git clone https://github.com/librosa/librosa
  • PyPi (πŸ“₯ 270K / month Β· πŸ“¦ 1.8K Β· ⏱️ 22.07.2020):

    pip install librosa
  • Conda (πŸ“₯ 240K Β· ⏱️ 22.07.2020):

    conda install -c conda-forge librosa
pyAudioAnalysis (πŸ₯‰25 Β· ⭐ 3.6K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 920 Β· πŸ“¦ 170 Β· πŸ“‹ 260 - 60% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/tyiannak/pyAudioAnalysis
  • PyPi (πŸ“₯ 7.2K / month Β· πŸ“¦ 11 Β· ⏱️ 29.09.2020):

    pip install pyAudioAnalysis
espnet (πŸ₯‰25 Β· ⭐ 3.2K Β· βž•) - End-to-End Speech Processing Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 960 Β· πŸ“₯ 59 Β· πŸ“¦ 3 Β· πŸ“‹ 1.1K - 10% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/espnet/espnet
  • PyPi (πŸ“₯ 1.2K / month Β· ⏱️ 01.12.2020):

    pip install espnet
python-soundfile (πŸ₯‰25 Β· ⭐ 350 Β· βž•) - SoundFile is an audio library based on libsndfile, CFFI,.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 40 Β· πŸ“₯ 2K Β· πŸ“¦ 5.1K Β· πŸ“‹ 140 - 40% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/bastibe/python-soundfile
  • PyPi (πŸ“₯ 340K / month Β· πŸ“¦ 740 Β· ⏱️ 19.04.2018):

    pip install soundfile
DDSP (πŸ₯‰23 Β· ⭐ 1.7K) - DDSP: Differentiable Digital Signal Processing. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 150 Β· πŸ“¦ 5 Β· πŸ“‹ 97 - 13% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/magenta/ddsp
  • PyPi (πŸ“₯ 1.4K / month Β· ⏱️ 21.08.2020):

    pip install ddsp
Dejavu (πŸ₯‰22 Β· ⭐ 5.3K) - Audio fingerprinting and recognition in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 1.2K Β· πŸ“¦ 16 Β· πŸ“‹ 210 - 39% open Β· ⏱️ 03.06.2020):

    git clone https://github.com/worldveil/dejavu
  • PyPi (πŸ“₯ 160 / month Β· πŸ“¦ 1 Β· ⏱️ 19.04.2015):

    pip install PyDejavu
Porcupine (πŸ₯‰21 Β· ⭐ 2.3K) - On-device wake word detection powered by deep learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 320 Β· πŸ“¦ 4 Β· πŸ“‹ 270 - 2% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/Picovoice/Porcupine
  • PyPi (πŸ“₯ 830 / month Β· ⏱️ 02.10.2020):

    pip install pvporcupine
Madmom (πŸ₯‰20 Β· ⭐ 690 Β· πŸ’€) - Python audio and music signal processing library. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 120 Β· πŸ“¦ 110 Β· πŸ“‹ 220 - 19% open Β· ⏱️ 19.12.2019):

    git clone https://github.com/CPJKU/madmom
  • PyPi (πŸ“₯ 1.1K / month Β· πŸ“¦ 8 Β· ⏱️ 14.11.2018):

    pip install madmom
Muda (πŸ₯‰16 Β· ⭐ 170) - A library for augmenting annotated audio data. ISC
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 30 Β· πŸ“¦ 7 Β· πŸ“‹ 49 - 10% open Β· ⏱️ 20.07.2020):

    git clone https://github.com/bmcfee/muda
  • PyPi (πŸ“₯ 120 / month Β· πŸ“¦ 1 Β· ⏱️ 15.11.2019):

    pip install muda
Julius (πŸ₯‰13 Β· ⭐ 160 Β· 🐣) - Fast PyTorch based DSP for audio and 1D signals. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 6 Β· πŸ“¦ 1 Β· ⏱️ 10.12.2020):

    git clone https://github.com/adefossez/julius
  • PyPi (πŸ“₯ 190 / month Β· ⏱️ 25.11.2020):

    pip install julius
Show 3 hidden projects...

Geospatial Data

Back to top

Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.

folium (πŸ₯‡31 Β· ⭐ 5.1K) - Python Data. Leaflet.js Maps. MIT
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 1.9K Β· πŸ“¦ 7.8K Β· πŸ“‹ 820 - 17% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/python-visualization/folium
  • PyPi (πŸ“₯ 250K / month Β· πŸ“¦ 970 Β· ⏱️ 07.05.2020):

    pip install folium
  • Conda (πŸ“₯ 310K Β· ⏱️ 07.05.2020):

    conda install -c conda-forge folium
geopy (πŸ₯‡31 Β· ⭐ 3.1K) - Geocoding library for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 520 Β· πŸ“¦ 15K Β· πŸ“‹ 240 - 11% open Β· ⏱️ 27.12.2020):

    git clone https://github.com/geopy/geopy
  • PyPi (πŸ“₯ 2.8M / month Β· πŸ“¦ 7.2K Β· ⏱️ 27.06.2020):

    pip install geopy
  • Conda (πŸ“₯ 490K Β· ⏱️ 27.06.2020):

    conda install -c conda-forge geopy
GeoPandas (πŸ₯‡31 Β· ⭐ 2.4K Β· πŸ“‰) - Python tools for geographic data. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 530 Β· πŸ“₯ 830 Β· πŸ“¦ 6.7K Β· πŸ“‹ 940 - 31% open Β· ⏱️ 20.12.2020):

    git clone https://github.com/geopandas/geopandas
  • PyPi (πŸ“₯ 420K / month Β· πŸ“¦ 1.2K Β· ⏱️ 24.06.2020):

    pip install geopandas
  • Conda (πŸ“₯ 790K Β· ⏱️ 16.07.2020):

    conda install -c conda-forge geopandas
Rasterio (πŸ₯ˆ30 Β· ⭐ 1.4K) - Rasterio reads and writes geospatial raster datasets. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 390 Β· πŸ“₯ 700 Β· πŸ“¦ 2.5K Β· πŸ“‹ 1.3K - 10% open Β· ⏱️ 15.12.2020):

    git clone https://github.com/mapbox/rasterio
  • PyPi (πŸ“₯ 160K / month Β· πŸ“¦ 850 Β· ⏱️ 13.12.2020):

    pip install rasterio
  • Conda (πŸ“₯ 820K Β· ⏱️ 30.10.2020):

    conda install -c conda-forge rasterio
Fiona (πŸ₯ˆ30 Β· ⭐ 760) - Fiona reads and writes geographic data files. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 65 Β· πŸ”€ 160 Β· πŸ“¦ 4.7K Β· πŸ“‹ 620 - 11% open Β· ⏱️ 30.11.2020):

    git clone https://github.com/Toblerity/Fiona
  • PyPi (πŸ“₯ 490K / month Β· πŸ“¦ 1.2K Β· ⏱️ 17.11.2020):

    pip install fiona
  • Conda (πŸ“₯ 1.6M Β· ⏱️ 17.11.2020):

    conda install -c conda-forge fiona
pyproj (πŸ₯ˆ28 Β· ⭐ 560 Β· πŸ“ˆ) - Python interface to PROJ (cartographic projections and coordinate.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 150 Β· πŸ“¦ 7.9K Β· πŸ“‹ 380 - 1% open Β· ⏱️ 20.12.2020):

    git clone https://github.com/pyproj4/pyproj
  • PyPi (πŸ“₯ 1.1M / month Β· πŸ“¦ 2.5K Β· ⏱️ 05.11.2020):

    pip install pyproj
  • Conda (πŸ“₯ 1.8M Β· ⏱️ 06.11.2020):

    conda install -c conda-forge pyproj
ipyleaflet (πŸ₯ˆ27 Β· ⭐ 1.1K) - A Jupyter - Leaflet.js bridge. MIT
  • GitHub (πŸ‘¨β€πŸ’» 63 Β· πŸ”€ 260 Β· πŸ“¦ 640 Β· πŸ“‹ 370 - 33% open Β· ⏱️ 30.11.2020):

    git clone https://github.com/jupyter-widgets/ipyleaflet
  • PyPi (πŸ“₯ 16K / month Β· πŸ“¦ 98 Β· ⏱️ 24.07.2020):

    pip install ipyleaflet
  • NPM (πŸ“₯ 150K / month Β· πŸ“¦ 2 Β· ⏱️ 24.07.2020):

    npm install jupyter-leaflet
  • Conda (πŸ“₯ 570K Β· ⏱️ 29.07.2020):

    conda install -c conda-forge ipyleaflet
geojson (πŸ₯ˆ27 Β· ⭐ 580) - Python bindings and utilities for GeoJSON. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 44 Β· πŸ”€ 78 Β· πŸ“¦ 5.4K Β· πŸ“‹ 68 - 26% open Β· ⏱️ 25.11.2020):

    git clone https://github.com/jazzband/geojson
  • PyPi (πŸ“₯ 420K / month Β· πŸ“¦ 1.6K Β· ⏱️ 09.08.2019):

    pip install geojson
  • Conda (πŸ“₯ 350K Β· ⏱️ 11.08.2019):

    conda install -c conda-forge geojson
ArcGIS API (πŸ₯‰25 Β· ⭐ 930) - Documentation and samples for ArcGIS API for Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 60 Β· πŸ”€ 680 Β· πŸ“‹ 310 - 40% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/Esri/arcgis-python-api
  • PyPi (πŸ“₯ 14K / month Β· πŸ“¦ 10 Β· ⏱️ 30.11.2020):

    pip install arcgis
  • Dockerhub (πŸ“₯ 3.6K Β· ⭐ 29 Β· ⏱️ 06.03.2020):

    docker pull esridocker/arcgis-api-python-notebook
PySAL (πŸ₯‰24 Β· ⭐ 800) - PySAL: Python Spatial Analysis Library Meta-Package. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 69 Β· πŸ”€ 240 Β· πŸ“‹ 630 - 9% open Β· ⏱️ 01.08.2020):

    git clone https://github.com/pysal/pysal
  • PyPi (πŸ“₯ 11K / month Β· πŸ“¦ 18 Β· ⏱️ 30.07.2020):

    pip install pysal
  • Conda (πŸ“₯ 390K Β· ⏱️ 30.07.2020):

    conda install -c conda-forge pysal
GeoViews (πŸ₯‰23 Β· ⭐ 320) - Simple, concise geographical visualization in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 59 Β· πŸ“¦ 180 Β· πŸ“‹ 250 - 31% open Β· ⏱️ 21.09.2020):

    git clone https://github.com/holoviz/geoviews
  • PyPi (πŸ“₯ 1.5K / month Β· πŸ“¦ 10 Β· ⏱️ 30.03.2020):

    pip install geoviews
  • Conda (πŸ“₯ 53K Β· ⏱️ 23.09.2020):

    conda install -c conda-forge geoviews
EarthPy (πŸ₯‰22 Β· ⭐ 210) - A package built to support working with spatial data using open source.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 90 Β· πŸ“¦ 72 Β· πŸ“‹ 220 - 9% open Β· ⏱️ 03.12.2020):

    git clone https://github.com/earthlab/earthpy
  • PyPi (πŸ“₯ 2K / month Β· πŸ“¦ 3 Β· ⏱️ 18.06.2020):

    pip install earthpy
  • Conda (πŸ“₯ 26K Β· ⏱️ 19.06.2020):

    conda install -c conda-forge earthpy
pymap3d (πŸ₯‰21 Β· ⭐ 170) - pure-Python (Numpy optional) 3D coordinate conversions for geospace.. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 55 Β· πŸ“‹ 24 - 8% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/geospace-code/pymap3d
  • PyPi (πŸ“₯ 14K / month Β· πŸ“¦ 3 Β· ⏱️ 21.09.2020):

    pip install pymap3d
  • Conda (πŸ“₯ 5.8K Β· ⏱️ 24.09.2020):

    conda install -c conda-forge pymap3d
Mapbox GL (πŸ₯‰20 Β· ⭐ 550 Β· πŸ’€) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 120 Β· πŸ“¦ 85 Β· πŸ“‹ 100 - 33% open Β· ⏱️ 03.12.2019):

    git clone https://github.com/mapbox/mapboxgl-jupyter
  • PyPi (πŸ“₯ 6.2K / month Β· πŸ“¦ 18 Β· ⏱️ 02.06.2019):

    pip install mapboxgl
Show 6 hidden projects...
  • Geocoder (πŸ₯ˆ29 Β· ⭐ 1.3K Β· πŸ’€) - Python Geocoder. MIT
  • Cartopy (πŸ₯ˆ27 Β· ⭐ 1.4K) - Rasterio reads and writes geospatial raster datasets. ❗️LGPL-3.0
  • Satpy (πŸ₯‰25 Β· ⭐ 660) - Python package for earth-observing satellite data processing. ❗️GPL-3.0
  • gmaps (πŸ₯‰22 Β· ⭐ 690 Β· πŸ’€) - Google maps for Jupyter notebooks. BSD-3
  • Sentinelsat (πŸ₯‰22 Β· ⭐ 530) - Search and download Copernicus Sentinel satellite images. ❗️GPL-3.0
  • geoplotlib (πŸ₯‰20 Β· ⭐ 880 Β· πŸ’€) - python toolbox for visualizing geographical data and making maps. MIT

Financial Data

Back to top

Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.

zipline (πŸ₯‡30 Β· ⭐ 13K) - Zipline, a Pythonic Algorithmic Trading Library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 3.6K Β· πŸ“¦ 660 Β· πŸ“‹ 960 - 33% open Β· ⏱️ 14.10.2020):

    git clone https://github.com/quantopian/zipline
  • PyPi (πŸ“₯ 3.9K / month Β· πŸ“¦ 79 Β· ⏱️ 05.10.2020):

    pip install zipline
yfinance (πŸ₯‡29 Β· ⭐ 3.7K) - Yahoo! Finance market data downloader (+faster Pandas Datareader). Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 940 Β· πŸ“¦ 2.4K Β· πŸ“‹ 460 - 63% open Β· ⏱️ 25.09.2020):

    git clone https://github.com/ranaroussi/yfinance
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 26 Β· ⏱️ 05.10.2020):

    pip install yfinance
  • Conda (πŸ“₯ 28K Β· ⏱️ 27.12.2019):

    conda install -c ranaroussi yfinance
Alpha Vantage (πŸ₯ˆ27 Β· ⭐ 2.9K) - A python wrapper for Alpha Vantage API for financial data. MIT
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 540 Β· πŸ“‹ 210 - 7% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/RomelTorres/alpha_vantage
  • PyPi (πŸ“₯ 26K / month Β· πŸ“¦ 200 Β· ⏱️ 26.08.2018):

    pip install alpha_vantage
empyrical (πŸ₯ˆ26 Β· ⭐ 670) - Common financial risk and performance metrics. Used by zipline.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 210 Β· πŸ“¦ 480 Β· πŸ“‹ 53 - 50% open Β· ⏱️ 14.10.2020):

    git clone https://github.com/quantopian/empyrical
  • PyPi (πŸ“₯ 17K / month Β· πŸ“¦ 220 Β· ⏱️ 13.10.2020):

    pip install empyrical
  • Conda (πŸ“₯ 8.8K Β· ⏱️ 14.10.2020):

    conda install -c conda-forge empyrical
pyfolio (πŸ₯ˆ25 Β· ⭐ 3.3K) - Portfolio and risk analytics in Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 55 Β· πŸ”€ 1K Β· πŸ“¦ 180 Β· πŸ“‹ 380 - 31% open Β· ⏱️ 15.07.2020):

    git clone https://github.com/quantopian/pyfolio
  • PyPi (πŸ“₯ 5.5K / month Β· πŸ“¦ 27 Β· ⏱️ 15.04.2019):

    pip install pyfolio
  • Conda (πŸ“₯ 5.3K Β· ⏱️ 16.05.2020):

    conda install -c conda-forge pyfolio
Alphalens (πŸ₯ˆ24 Β· ⭐ 1.7K Β· πŸ’€) - Performance analysis of predictive (alpha) stock factors. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 600 Β· πŸ“¦ 340 Β· πŸ“‹ 180 - 20% open Β· ⏱️ 27.04.2020):

    git clone https://github.com/quantopian/alphalens
  • PyPi (πŸ“₯ 1.6K / month Β· πŸ“¦ 14 Β· ⏱️ 27.04.2020):

    pip install alphalens
  • Conda (πŸ“₯ 10K Β· ⏱️ 16.05.2020):

    conda install -c conda-forge alphalens
IB-insync (πŸ₯ˆ24 Β· ⭐ 1.2K) - Python sync/async framework for Interactive Brokers API. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 370 Β· πŸ“‹ 280 - 1% open Β· ⏱️ 13.12.2020):

    git clone https://github.com/erdewit/ib_insync
  • PyPi (πŸ“₯ 3.1K / month Β· πŸ“¦ 12 Β· ⏱️ 03.11.2020):

    pip install ib_insync
  • Conda (πŸ“₯ 6.9K Β· ⏱️ 07.11.2020):

    conda install -c conda-forge ib-insync
ffn (πŸ₯ˆ24 Β· ⭐ 730) - ffn - a financial function library for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 170 Β· πŸ“¦ 94 Β· πŸ“‹ 73 - 21% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/pmorissette/ffn
  • PyPi (πŸ“₯ 11K / month Β· πŸ“¦ 16 Β· ⏱️ 28.05.2018):

    pip install ffn
bt (πŸ₯‰22 Β· ⭐ 910) - bt - flexible backtesting for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 250 Β· πŸ“¦ 39 Β· πŸ“‹ 210 - 25% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/pmorissette/bt
  • PyPi (πŸ“₯ 3.6K / month Β· πŸ“¦ 20 Β· ⏱️ 28.05.2018):

    pip install bt
TensorTrade (πŸ₯‰21 Β· ⭐ 2K) - An open source reinforcement learning framework for training,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 44 Β· πŸ”€ 530 Β· πŸ“¦ 9 Β· πŸ“‹ 150 - 22% open Β· ⏱️ 20.12.2020):

    git clone https://github.com/tensortrade-org/tensortrade
  • PyPi (πŸ“₯ 720 / month Β· πŸ“¦ 1 Β· ⏱️ 24.08.2020):

    pip install tensortrade
finmarketpy (πŸ₯‰20 Β· ⭐ 2.4K) - Python library for backtesting trading strategies & analyzing.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 410 Β· πŸ“₯ 39 Β· πŸ“¦ 3 Β· πŸ“‹ 24 - 87% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/cuemacro/finmarketpy
  • PyPi (πŸ“₯ 89 / month Β· ⏱️ 20.10.2020):

    pip install finmarketpy
Qlib (πŸ₯‰19 Β· ⭐ 3.3K Β· 🐣) - Qlib is an AI-oriented quantitative investment platform, which aims.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 24 Β· πŸ”€ 480 Β· πŸ“₯ 180 Β· πŸ“‹ 72 - 54% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/microsoft/qlib
  • PyPi (πŸ“₯ 2.5K / month Β· ⏱️ 12.12.2020):

    pip install pyqlib
Crypto Signals (πŸ₯‰18 Β· ⭐ 2.4K) - Github.com/CryptoSignal - #1 Quant Trading & Technical Analysis.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 640 Β· πŸ“‹ 230 - 18% open Β· ⏱️ 03.09.2020):

    git clone https://github.com/CryptoSignal/crypto-signal
  • Dockerhub (πŸ“₯ 41K Β· ⭐ 7 Β· ⏱️ 03.09.2020):

    docker pull shadowreaver/crypto-signal
Show 6 hidden projects...

Time Series Data

Back to top

Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.

Prophet (πŸ₯‡30 Β· ⭐ 12K) - Tool for producing high quality forecasts for time series data that has.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 3.4K Β· πŸ“₯ 520 Β· πŸ“‹ 1.5K - 6% open Β· ⏱️ 08.12.2020):

    git clone https://github.com/facebook/prophet
  • PyPi (πŸ“₯ 670K / month Β· πŸ“¦ 150 Β· ⏱️ 05.09.2020):

    pip install fbprophet
tsfresh (πŸ₯‡26 Β· ⭐ 5.2K) - Automatic extraction of relevant features from time series:. MIT
  • GitHub (πŸ‘¨β€πŸ’» 70 Β· πŸ”€ 810 Β· πŸ“‹ 420 - 6% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/blue-yonder/tsfresh
  • PyPi (πŸ“₯ 160K / month Β· πŸ“¦ 33 Β· ⏱️ 09.09.2020):

    pip install tsfresh
  • Conda (πŸ“₯ 25K Β· ⏱️ 10.09.2020):

    conda install -c conda-forge tsfresh
sktime (πŸ₯‡26 Β· ⭐ 3.3K) - A unified framework for machine learning with time series. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 69 Β· πŸ”€ 390 Β· πŸ“₯ 41 Β· πŸ“¦ 84 Β· πŸ“‹ 300 - 35% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/alan-turing-institute/sktime
  • PyPi (πŸ“₯ 6.4K / month Β· ⏱️ 19.12.2020):

    pip install sktime
tslearn (πŸ₯‡26 Β· ⭐ 1.4K) - A machine learning toolkit dedicated to time-series data. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 210 Β· πŸ“¦ 170 Β· πŸ“‹ 210 - 24% open Β· ⏱️ 07.12.2020):

    git clone https://github.com/tslearn-team/tslearn
  • PyPi (πŸ“₯ 51K / month Β· πŸ“¦ 11 Β· ⏱️ 18.06.2020):

    pip install tslearn
  • Conda (πŸ“₯ 180K Β· ⏱️ 19.06.2020):

    conda install -c conda-forge tslearn
Streamz (πŸ₯ˆ24 Β· ⭐ 890) - Real-time stream processing for python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 110 Β· πŸ“¦ 180 Β· πŸ“‹ 210 - 41% open Β· ⏱️ 10.12.2020):

    git clone https://github.com/python-streamz/streamz
  • PyPi (πŸ“₯ 2.8K / month Β· πŸ“¦ 16 Β· ⏱️ 02.11.2020):

    pip install streamz
  • Conda (πŸ“₯ 99K Β· ⏱️ 03.11.2020):

    conda install -c conda-forge streamz
GluonTS (πŸ₯ˆ23 Β· ⭐ 1.6K) - Probabilistic time series modeling in Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 66 Β· πŸ”€ 350 Β· πŸ“‹ 520 - 34% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/awslabs/gluon-ts
  • PyPi (πŸ“₯ 22K / month Β· ⏱️ 08.12.2020):

    pip install gluonts
STUMPY (πŸ₯‰22 Β· ⭐ 1.6K) - STUMPY is a powerful and scalable Python library for computing a.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 150 Β· πŸ“‹ 200 - 14% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/TDAmeritrade/stumpy
  • PyPi (πŸ“₯ 16K / month Β· ⏱️ 10.12.2020):

    pip install stumpy
  • Conda (πŸ“₯ 14K Β· ⏱️ 10.12.2020):

    conda install -c conda-forge stumpy
Darts (πŸ₯‰21 Β· ⭐ 680) - A python library for easy manipulation and forecasting of time series. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 81 Β· πŸ“¦ 10 Β· πŸ“‹ 55 - 27% open Β· ⏱️ 09.11.2020):

    git clone https://github.com/unit8co/darts
  • PyPi (πŸ“₯ 1K / month Β· ⏱️ 09.11.2020):

    pip install u8darts
  • Dockerhub (πŸ“₯ 75 Β· ⏱️ 06.10.2020):

    docker pull unit8/darts
seglearn (πŸ₯‰19 Β· ⭐ 410) - Python module for machine learning time series:. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 45 Β· πŸ“¦ 6 Β· πŸ“‹ 24 - 25% open Β· ⏱️ 08.11.2020):

    git clone https://github.com/dmbee/seglearn
  • PyPi (πŸ“₯ 2.9K / month Β· πŸ“¦ 1 Β· ⏱️ 02.04.2020):

    pip install seglearn
pytorch-forecasting (πŸ₯‰18 Β· ⭐ 460 Β· 🐣) - Time series forecasting with PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 48 Β· πŸ“‹ 68 - 27% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/jdb78/pytorch-forecasting
  • PyPi (πŸ“₯ 3.9K / month Β· ⏱️ 03.12.2020):

    pip install pytorch-forecasting
ADTK (πŸ₯‰17 Β· ⭐ 580 Β· πŸ’€) - A Python toolkit for rule-based/unsupervised anomaly detection in.. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 70 Β· πŸ“‹ 50 - 40% open Β· ⏱️ 17.04.2020):

    git clone https://github.com/arundo/adtk
  • PyPi (πŸ“₯ 34K / month Β· ⏱️ 17.04.2020):

    pip install adtk
tick (πŸ₯‰17 Β· ⭐ 320) - Module for statistical learning, with a particular emphasis on time-.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 73 Β· πŸ“₯ 160 Β· πŸ“¦ 28 Β· πŸ“‹ 210 - 25% open Β· ⏱️ 15.06.2020):

    git clone https://github.com/X-DataInitiative/tick
  • PyPi (πŸ“₯ 730 / month Β· πŸ“¦ 1 Β· ⏱️ 24.05.2020):

    pip install tick
Show 2 hidden projects...
  • PyFlux (πŸ₯‰22 Β· ⭐ 1.8K Β· πŸ’€) - Open source time series library for Python. BSD-3
  • luminol (πŸ₯‰22 Β· ⭐ 860 Β· πŸ’€) - Anomaly Detection and Correlation library. Apache-2

Medical Data

Back to top

Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.

MNE (πŸ₯‡31 Β· ⭐ 1.5K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 240 Β· πŸ”€ 840 Β· πŸ“¦ 760 Β· πŸ“‹ 3.4K - 8% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/mne-tools/mne-python
  • PyPi (πŸ“₯ 35K / month Β· πŸ“¦ 200 Β· ⏱️ 17.12.2020):

    pip install mne
  • Conda (πŸ“₯ 94K Β· ⏱️ 20.12.2020):

    conda install -c conda-forge mne
Nilearn (πŸ₯‡30 Β· ⭐ 690) - Machine learning for NeuroImaging in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 370 Β· πŸ“¦ 830 Β· πŸ“‹ 1.4K - 26% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/nilearn/nilearn
  • PyPi (πŸ“₯ 12K / month Β· πŸ“¦ 300 Β· ⏱️ 12.11.2020):

    pip install nilearn
  • Conda (πŸ“₯ 74K Β· ⏱️ 12.11.2020):

    conda install -c conda-forge nilearn
Lifelines (πŸ₯ˆ29 Β· ⭐ 1.5K) - Survival analysis in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 88 Β· πŸ”€ 400 Β· πŸ“¦ 470 Β· πŸ“‹ 770 - 23% open Β· ⏱️ 09.12.2020):

    git clone https://github.com/CamDavidsonPilon/lifelines
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 130 Β· ⏱️ 09.12.2020):

    pip install lifelines
  • Conda (πŸ“₯ 120K Β· ⏱️ 10.12.2020):

    conda install -c conda-forge lifelines
NIPYPE (πŸ₯ˆ29 Β· ⭐ 540) - Workflows and interfaces for neuroimaging packages. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 210 Β· πŸ”€ 440 Β· πŸ“¦ 470 Β· πŸ“‹ 1.2K - 26% open Β· ⏱️ 08.12.2020):

    git clone https://github.com/nipy/nipype
  • PyPi (πŸ“₯ 13K / month Β· πŸ“¦ 190 Β· ⏱️ 16.08.2020):

    pip install nipype
  • Conda (πŸ“₯ 340K Β· ⏱️ 28.11.2020):

    conda install -c conda-forge nipype
NiBabel (πŸ₯ˆ29 Β· ⭐ 380) - Python package to access a cacophony of neuro-imaging file formats. MIT
  • GitHub (πŸ‘¨β€πŸ’» 89 Β· πŸ”€ 210 Β· πŸ“¦ 3.4K Β· πŸ“‹ 390 - 28% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/nipy/nibabel
  • PyPi (πŸ“₯ 50K / month Β· πŸ“¦ 1.3K Β· ⏱️ 28.11.2020):

    pip install nibabel
  • Conda (πŸ“₯ 290K Β· ⏱️ 29.11.2020):

    conda install -c conda-forge nibabel
DIPY (πŸ₯ˆ28 Β· ⭐ 370) - DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 310 Β· πŸ“¦ 310 Β· πŸ“‹ 730 - 21% open Β· ⏱️ 10.12.2020):

    git clone https://github.com/dipy/dipy
  • PyPi (πŸ“₯ 7.7K / month Β· πŸ“¦ 94 Β· ⏱️ 05.11.2020):

    pip install dipy
  • Conda (πŸ“₯ 170K Β· ⏱️ 14.11.2020):

    conda install -c conda-forge dipy
Hail (πŸ₯ˆ24 Β· ⭐ 690) - Scalable genomic data analysis. MIT
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 180 Β· πŸ“¦ 26 Β· πŸ“‹ 2.3K - 14% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/hail-is/hail
  • PyPi (πŸ“₯ 1.6K / month Β· ⏱️ 03.12.2020):

    pip install hail
MONAI (πŸ₯ˆ22 Β· ⭐ 1.6K) - AI Toolkit for Healthcare Imaging. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 48 Β· πŸ”€ 280 Β· πŸ“‹ 650 - 12% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/Project-MONAI/MONAI
  • PyPi (πŸ“₯ 3.5K / month Β· ⏱️ 15.12.2020):

    pip install monai
NiftyNet (πŸ₯ˆ22 Β· ⭐ 1.3K Β· πŸ’€) - [unmaintained] An open-source convolutional neural.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 58 Β· πŸ”€ 370 Β· πŸ“¦ 33 Β· πŸ“‹ 320 - 30% open Β· ⏱️ 21.04.2020):

    git clone https://github.com/NifTK/NiftyNet
  • PyPi (πŸ“₯ 260 / month Β· ⏱️ 10.10.2019):

    pip install niftynet
DeepVariant (πŸ₯‰21 Β· ⭐ 2.1K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 520 Β· πŸ“₯ 2.9K Β· πŸ“‹ 360 - 0% open Β· ⏱️ 14.12.2020):

    git clone https://github.com/google/deepvariant
  • Conda (πŸ“₯ 26K Β· ⏱️ 05.09.2020):

    conda install -c bioconda deepvariant
Brainiak (πŸ₯‰19 Β· ⭐ 230) - Brain Imaging Analysis Kit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 32 Β· πŸ”€ 100 Β· πŸ“¦ 11 Β· πŸ“‹ 180 - 35% open Β· ⏱️ 15.10.2020):

    git clone https://github.com/brainiak/brainiak
  • PyPi (πŸ“₯ 130 / month Β· πŸ“¦ 1 Β· ⏱️ 15.10.2020):

    pip install brainiak
  • Dockerhub (πŸ“₯ 450 Β· ⭐ 1 Β· ⏱️ 15.10.2020):

    docker pull brainiak/brainiak
Medical Detection Toolkit (πŸ₯‰13 Β· ⭐ 880 Β· πŸ’€) - The Medical Detection Toolkit contains 2D + 3D.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 230 Β· πŸ“‹ 110 - 24% open Β· ⏱️ 18.04.2020):

    git clone https://github.com/MIC-DKFZ/medicaldetectiontoolkit
MedicalNet (πŸ₯‰11 Β· ⭐ 1K) - Many studies have shown that the performance on deep learning is.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 1 Β· πŸ”€ 270 Β· πŸ“‹ 54 - 74% open Β· ⏱️ 27.08.2020):

    git clone https://github.com/Tencent/MedicalNet
Show 6 hidden projects...
  • NIPY (πŸ₯‰21 Β· ⭐ 290) - Neuroimaging in Python FMRI analysis package. ❗️DSDP
  • MedPy (πŸ₯‰20 Β· ⭐ 310 Β· πŸ’€) - Medical image processing in Python. ❗️GPL-3.0
  • DLTK (πŸ₯‰19 Β· ⭐ 1.2K Β· πŸ’€) - Deep Learning Toolkit for Medical Image Analysis. Apache-2
  • Glow (πŸ₯‰18 Β· ⭐ 150) - An open-source toolkit for large-scale genomic analysis. Apache-2
  • MedicalTorch (πŸ₯‰14 Β· ⭐ 700 Β· πŸ’€) - A medical imaging framework for Pytorch. Apache-2
  • DeepNeuro (πŸ₯‰14 Β· ⭐ 96) - A deep learning python package for neuroimaging data. Made by:. MIT

Optical Character Recognition

Back to top

Libraries for optical character recognition (OCR) and text extraction from images or videos.

Tesseract (πŸ₯‡29 Β· ⭐ 3.3K Β· πŸ“‰) - Python-tesseract is an optical character recognition (OCR).. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 500 Β· πŸ“‹ 240 - 2% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/madmaze/pytesseract
  • PyPi (πŸ“₯ 320K / month Β· πŸ“¦ 1.4K Β· ⏱️ 15.12.2020):

    pip install pytesseract
  • Conda (πŸ“₯ 37K Β· ⏱️ 20.11.2020):

    conda install -c conda-forge pytesseract
OCRmyPDF (πŸ₯ˆ27 Β· ⭐ 3.6K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing.. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 47 Β· πŸ”€ 410 Β· πŸ“¦ 100 Β· πŸ“‹ 640 - 14% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/jbarlow83/OCRmyPDF
  • PyPi (πŸ“₯ 13K / month Β· πŸ“¦ 12 Β· ⏱️ 12.12.2020):

    pip install ocrmypdf
EasyOCR (πŸ₯ˆ26 Β· ⭐ 9.6K) - Ready-to-use OCR with 80+ supported languages and all popular.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 73 Β· πŸ”€ 900 Β· πŸ“₯ 80K Β· πŸ“¦ 93 Β· πŸ“‹ 230 - 32% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/JaidedAI/EasyOCR
  • PyPi (πŸ“₯ 8.6K / month Β· ⏱️ 21.11.2020):

    pip install easyocr
tesserocr (πŸ₯ˆ26 Β· ⭐ 1.4K) - A Python wrapper for the tesseract-ocr API. MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 180 Β· πŸ“¦ 440 Β· πŸ“‹ 200 - 29% open Β· ⏱️ 17.11.2020):

    git clone https://github.com/sirfz/tesserocr
  • PyPi (πŸ“₯ 31K / month Β· πŸ“¦ 50 Β· ⏱️ 17.03.2020):

    pip install tesserocr
  • Conda (πŸ“₯ 31K Β· ⏱️ 14.10.2020):

    conda install -c conda-forge tesserocr
PaddleOCR (πŸ₯‰23 Β· ⭐ 8K) - Awesome multilingual OCR toolkits based on PaddlePaddle.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 1.5K Β· πŸ“¦ 10 Β· πŸ“‹ 1K - 25% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/PaddlePaddle/PaddleOCR
  • PyPi (πŸ“₯ 1.5K / month Β· ⏱️ 18.12.2020):

    pip install paddleocr
attention-ocr (πŸ₯‰21 Β· ⭐ 820) - A Tensorflow model for text recognition (CNN + seq2seq with.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 220 Β· πŸ“¦ 15 Β· πŸ“‹ 140 - 11% open Β· ⏱️ 31.10.2020):

    git clone https://github.com/emedvedev/attention-ocr
  • PyPi (πŸ“₯ 140 / month Β· ⏱️ 19.04.2019):

    pip install aocr
keras-ocr (πŸ₯‰20 Β· ⭐ 730) - A packaged and flexible version of the CRAFT text detector and Keras.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 160 Β· πŸ“₯ 11K Β· πŸ“‹ 120 - 22% open Β· ⏱️ 26.11.2020):

    git clone https://github.com/faustomorales/keras-ocr
  • PyPi (πŸ“₯ 1.8K / month Β· πŸ“¦ 1 Β· ⏱️ 26.11.2020):

    pip install keras-ocr
doc2text (πŸ₯‰19 Β· ⭐ 1.2K) - Detect text blocks and OCR poorly scanned PDFs in bulk. Python module.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 91 Β· πŸ“¦ 32 Β· πŸ“‹ 21 - 57% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/jlsutherland/doc2text
  • PyPi (πŸ“₯ 270 / month Β· ⏱️ 06.09.2016):

    pip install doc2text
calamari (πŸ₯‰18 Β· ⭐ 750) - Line based ATR Engine based on OCRopy. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 160 Β· πŸ“‹ 150 - 13% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/Calamari-OCR/calamari
  • PyPi (πŸ“₯ 540 / month Β· πŸ“¦ 2 Β· ⏱️ 13.11.2018):

    pip install calamari_ocr

Data Containers & Structures

Back to top

General-purpose data containers & structures as well as utilities & extensions for pandas.

pandas (πŸ₯‡43 Β· ⭐ 28K) - Flexible and powerful data analysis / manipulation library for.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 2.5K Β· πŸ”€ 12K Β· πŸ“₯ 57K Β· πŸ“¦ 360K Β· πŸ“‹ 20K - 18% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/pandas-dev/pandas
  • PyPi (πŸ“₯ 25M / month Β· πŸ“¦ 77K Β· ⏱️ 26.12.2020):

    pip install pandas
  • Conda (πŸ“₯ 13M Β· ⏱️ 10.12.2020):

    conda install -c conda-forge pandas
numpy (πŸ₯‡42 Β· ⭐ 16K) - The fundamental package for scientific computing with Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 1.2K Β· πŸ”€ 5.1K Β· πŸ“₯ 290K Β· πŸ“¦ 580K Β· πŸ“‹ 9.4K - 23% open Β· ⏱️ 27.12.2020):

    git clone https://github.com/numpy/numpy
  • PyPi (πŸ“₯ 39M / month Β· πŸ“¦ 170K Β· ⏱️ 25.12.2020):

    pip install numpy
  • Conda (πŸ“₯ 15M Β· ⏱️ 14.12.2020):

    conda install -c conda-forge numpy
h5py (πŸ₯‡36 Β· ⭐ 1.5K) - HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 390 Β· πŸ“₯ 760 Β· πŸ“¦ 96K Β· πŸ“‹ 1.1K - 16% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/h5py/h5py
  • PyPi (πŸ“₯ 5.6M / month Β· πŸ“¦ 23K Β· ⏱️ 06.11.2020):

    pip install h5py
  • Conda (πŸ“₯ 4.4M Β· ⏱️ 07.11.2020):

    conda install -c conda-forge h5py
Arrow (πŸ₯ˆ35 Β· ⭐ 6.8K) - Apache Arrow is a cross-language development platform for in-memory.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 610 Β· πŸ”€ 1.7K Β· πŸ“¦ 26 Β· πŸ“‹ 650 - 17% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/apache/arrow
  • PyPi (πŸ“₯ 11M / month Β· πŸ“¦ 990 Β· ⏱️ 19.10.2020):

    pip install pyarrow
  • Conda (πŸ“₯ 440K Β· ⏱️ 12.10.2020):

    conda install -c conda-forge arrow
xarray (πŸ₯ˆ32 Β· ⭐ 1.9K) - N-D labeled arrays and datasets in Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 290 Β· πŸ”€ 610 Β· πŸ“¦ 4.7K Β· πŸ“‹ 2.6K - 29% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/pydata/xarray
  • PyPi (πŸ“₯ 290K / month Β· πŸ“¦ 1.1K Β· ⏱️ 30.11.2020):

    pip install xarray
  • Conda (πŸ“₯ 2.2M Β· ⏱️ 01.12.2020):

    conda install -c conda-forge xarray
numexpr (πŸ₯ˆ30 Β· ⭐ 1.5K) - Fast numerical array expression evaluator for Python, NumPy, PyTables,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 51 Β· πŸ”€ 160 Β· πŸ“‹ 300 - 17% open Β· ⏱️ 24.09.2020):

    git clone https://github.com/pydata/numexpr
  • PyPi (πŸ“₯ 830K / month Β· πŸ“¦ 5.5K Β· ⏱️ 05.01.2020):

    pip install numexpr
  • Conda (πŸ“₯ 2M Β· ⏱️ 14.10.2020):

    conda install -c conda-forge numexpr
TinyDB (πŸ₯ˆ29 Β· ⭐ 3.9K) - TinyDB is a lightweight document oriented database optimized for your.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 61 Β· πŸ”€ 350 Β· πŸ“‹ 240 - 5% open Β· ⏱️ 05.12.2020):

    git clone https://github.com/msiemens/tinydb
  • PyPi (πŸ“₯ 130K / month Β· πŸ“¦ 1.1K Β· ⏱️ 14.11.2020):

    pip install tinydb
  • Conda (πŸ“₯ 96K Β· ⏱️ 14.11.2020):

    conda install -c conda-forge tinydb
Koalas (πŸ₯ˆ29 Β· ⭐ 2.5K) - Koalas: pandas API on Apache Spark. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 47 Β· πŸ”€ 290 Β· πŸ“₯ 1K Β· πŸ“¦ 67 Β· πŸ“‹ 500 - 16% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/databricks/koalas
  • PyPi (πŸ“₯ 880K / month Β· πŸ“¦ 1 Β· ⏱️ 11.12.2020):

    pip install koalas
  • Conda (πŸ“₯ 73K Β· ⏱️ 11.12.2020):

    conda install -c conda-forge koalas
PyTables (πŸ₯ˆ29 Β· ⭐ 1K) - A Python package to manage extremely large amounts of data. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 96 Β· πŸ”€ 180 Β· πŸ“₯ 120 Β· πŸ“‹ 590 - 26% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/PyTables/PyTables
  • PyPi (πŸ“₯ 540K / month Β· πŸ“¦ 3.8K Β· ⏱️ 30.10.2019):

    pip install tables
  • Conda (πŸ“₯ 2.1M Β· ⏱️ 21.10.2020):

    conda install -c conda-forge pytables
Modin (πŸ₯ˆ28 Β· ⭐ 5.5K) - Modin: Speed up your Pandas workflows by changing a single line.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 64 Β· πŸ”€ 380 Β· πŸ“₯ 200K Β· πŸ“¦ 270 Β· πŸ“‹ 1.4K - 27% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/modin-project/modin
  • PyPi (πŸ“₯ 50K / month Β· πŸ“¦ 9 Β· ⏱️ 10.11.2020):

    pip install modin
Bottleneck (πŸ₯ˆ28 Β· ⭐ 570) - Fast NumPy array functions written in C. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 61 Β· πŸ“¦ 18K Β· πŸ“‹ 200 - 11% open Β· ⏱️ 25.11.2020):

    git clone https://github.com/pydata/bottleneck
  • PyPi (πŸ“₯ 190K / month Β· πŸ“¦ 2.9K Β· ⏱️ 21.02.2020):

    pip install Bottleneck
  • Conda (πŸ“₯ 1.4M Β· ⏱️ 12.10.2020):

    conda install -c conda-forge bottleneck
datasketch (πŸ₯‰27 Β· ⭐ 1.4K) - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 200 Β· πŸ“₯ 16 Β· πŸ“¦ 240 Β· πŸ“‹ 110 - 17% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/ekzhu/datasketch
  • PyPi (πŸ“₯ 190K / month Β· πŸ“¦ 86 Β· ⏱️ 15.12.2020):

    pip install datasketch
zarr (πŸ₯‰26 Β· ⭐ 610) - An implementation of chunked, compressed, N-dimensional arrays for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 100 Β· πŸ“¦ 500 Β· πŸ“‹ 400 - 43% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/zarr-developers/zarr-python
  • PyPi (πŸ“₯ 20K / month Β· πŸ“¦ 72 Β· ⏱️ 02.12.2020):

    pip install zarr
  • Conda (πŸ“₯ 510K Β· ⏱️ 03.12.2020):

    conda install -c conda-forge zarr
swifter (πŸ₯‰25 Β· ⭐ 1.5K) - A package which efficiently applies any function to a pandas.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 68 Β· πŸ“¦ 270 Β· πŸ“‹ 89 - 14% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/jmcarpenter2/swifter
  • PyPi (πŸ“₯ 54K / month Β· πŸ“¦ 16 Β· ⏱️ 11.10.2020):

    pip install swifter
  • Conda (πŸ“₯ 73K Β· ⏱️ 20.09.2020):

    conda install -c conda-forge swifter
PandaralΒ·lel (πŸ₯‰23 Β· ⭐ 1.3K) - A simple and efficient tool to parallelize Pandas.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 89 Β· πŸ“¦ 130 Β· πŸ“‹ 110 - 52% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/nalepae/pandarallel
  • PyPi (πŸ“₯ 44K / month Β· ⏱️ 25.08.2020):

    pip install pandarallel
Vaex (πŸ₯‰22 Β· ⭐ 5.5K) - Out-of-Core DataFrames for Python, ML, visualize and explore big tabular.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 420 Β· πŸ“₯ 200 Β· πŸ“‹ 650 - 41% open Β· ⏱️ 09.12.2020):

    git clone https://github.com/vaexio/vaex
  • PyPi (πŸ“₯ 4.5K / month Β· πŸ“¦ 2 Β· ⏱️ 08.12.2020):

    pip install vaex
  • Conda (πŸ“₯ 93K Β· ⏱️ 01.06.2020):

    conda install -c conda-forge vaex
datatable (πŸ₯‰20 Β· ⭐ 1.1K) - A Python package for manipulating 2-dimensional tabular data.. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 95 Β· πŸ“₯ 710 Β· πŸ“‹ 1.2K - 9% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/h2oai/datatable
  • PyPi (πŸ“₯ 16K / month Β· πŸ“¦ 6 Β· ⏱️ 10.12.2020):

    pip install datatable
StaticFrame (πŸ₯‰20 Β· ⭐ 210) - The StaticFrame library defines the Series and Frame, immutable data.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 20 Β· πŸ“¦ 5 Β· πŸ“‹ 260 - 12% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/InvestmentSystems/static-frame
  • PyPi (πŸ“₯ 1.9K / month Β· ⏱️ 01.12.2020):

    pip install static-frame
  • Conda (πŸ“₯ 58K Β· ⏱️ 14.12.2020):

    conda install -c conda-forge static-frame
fletcher (πŸ₯‰20 Β· ⭐ 200) - Pandas ExtensionDType/Array backed by Apache Arrow. MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 32 Β· πŸ“₯ 12 Β· πŸ“¦ 1 Β· πŸ“‹ 72 - 45% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/xhochy/fletcher
  • PyPi (πŸ“₯ 480 / month Β· ⏱️ 07.12.2020):

    pip install fletcher
  • Conda (πŸ“₯ 18K Β· ⏱️ 07.12.2020):

    conda install -c conda-forge fletcher
Bounter (πŸ₯‰18 Β· ⭐ 890) - Efficient Counter that uses a limited (bounded) amount of memory.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 48 Β· πŸ“¦ 20 Β· πŸ“‹ 20 - 55% open Β· ⏱️ 16.08.2020):

    git clone https://github.com/RaRe-Technologies/bounter
  • PyPi (πŸ“₯ 330 / month Β· πŸ“¦ 14 Β· ⏱️ 17.08.2020):

    pip install bounter
PandaPy (πŸ₯‰15 Β· ⭐ 460) - PandaPy has the speed of NumPy and the usability of Pandas 10x to.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 44 Β· πŸ“¦ 1 Β· πŸ“‹ 3 - 66% open Β· ⏱️ 12.11.2020):

    git clone https://github.com/firmai/pandapy
  • PyPi (πŸ“₯ 97 / month Β· ⏱️ 25.01.2020):

    pip install pandapy
Show 6 hidden projects...
  • Blaze (πŸ₯ˆ28 Β· ⭐ 2.9K Β· πŸ’€) - NumPy and Pandas interface to Big Data. BSD-3
  • sklearn-pandas (πŸ₯ˆ28 Β· ⭐ 2.3K) - Pandas integration with sklearn. ❗️Zlib
  • Arctic (πŸ₯‰24 Β· ⭐ 2.1K) - Arctic is a high performance datastore for numeric data. ❗️LGPL-2.1
  • pandasql (πŸ₯‰22 Β· ⭐ 930 Β· πŸ’€) - sqldf for pandas. MIT
  • pickleDB (πŸ₯‰21 Β· ⭐ 530 Β· πŸ’€) - pickleDB is an open source key-value store using Python's json.. BSD-3
  • Pandas Summary (πŸ₯‰21 Β· ⭐ 360 Β· πŸ’€) - An extension to pandas dataframes describe function. MIT

Data Loading & Extraction

Back to top

Libraries for loading, collecting, and extracting data from a variety of data sources and formats.

Faker (πŸ₯‡36 Β· ⭐ 12K) - Faker is a Python package that generates fake data for you. MIT
  • GitHub (πŸ‘¨β€πŸ’» 380 Β· πŸ”€ 1.3K Β· πŸ“¦ 22K Β· πŸ“‹ 470 - 27% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/joke2k/faker
  • PyPi (πŸ“₯ 2.1M / month Β· πŸ“¦ 4.7K Β· ⏱️ 15.12.2020):

    pip install Faker
  • Conda (πŸ“₯ 360K Β· ⏱️ 15.12.2020):

    conda install -c conda-forge faker
xlrd (πŸ₯‡33 Β· ⭐ 1.8K) - Please use openpyxl where you can... BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 410 Β· πŸ“¦ 60K Β· ⏱️ 12.12.2020):

    git clone https://github.com/python-excel/xlrd
  • PyPi (πŸ“₯ 5.6M / month Β· πŸ“¦ 14K Β· ⏱️ 11.12.2020):

    pip install xlrd
  • Conda (πŸ“₯ 1.1M Β· ⏱️ 25.07.2020):

    conda install -c conda-forge xlrd
xmltodict (πŸ₯ˆ32 Β· ⭐ 4.2K Β· πŸ’€) - Python module that makes working with XML feel like you are.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 390 Β· πŸ“¦ 20K Β· πŸ“‹ 200 - 31% open Β· ⏱️ 26.04.2020):

    git clone https://github.com/martinblech/xmltodict
  • PyPi (πŸ“₯ 4.1M / month Β· πŸ“¦ 8.2K Β· ⏱️ 11.02.2019):

    pip install xmltodict
  • Conda (πŸ“₯ 570K Β· ⏱️ 11.02.2019):

    conda install -c conda-forge xmltodict
Tablib (πŸ₯ˆ32 Β· ⭐ 3.8K) - Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c. MIT
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 560 Β· πŸ“¦ 8.8K Β· πŸ“‹ 230 - 14% open Β· ⏱️ 11.12.2020):

    git clone https://github.com/jazzband/tablib
  • PyPi (πŸ“₯ 600K / month Β· πŸ“¦ 2.4K Β· ⏱️ 05.12.2020):

    pip install tablib
  • Conda (πŸ“₯ 56K Β· ⏱️ 05.12.2020):

    conda install -c conda-forge tablib
TensorFlow Datasets (πŸ₯ˆ32 Β· ⭐ 2.6K Β· πŸ“ˆ) - TFDS is a collection of datasets ready to use with.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 890 Β· πŸ“¦ 3.2K Β· πŸ“‹ 880 - 43% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/tensorflow/datasets
  • PyPi (πŸ“₯ 540K / month Β· πŸ“¦ 100 Β· ⏱️ 09.10.2020):

    pip install tensorflow-datasets
smart-open (πŸ₯ˆ30 Β· ⭐ 1.9K) - Utils for streaming large files (S3, HDFS, gzip, bz2...). MIT
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 260 Β· πŸ“‹ 290 - 23% open Β· ⏱️ 10.12.2020):

    git clone https://github.com/RaRe-Technologies/smart_open
  • PyPi (πŸ“₯ 6.9M / month Β· πŸ“¦ 640 Β· ⏱️ 27.11.2020):

    pip install smart-open
python-magic (πŸ₯ˆ30 Β· ⭐ 1.8K) - A python wrapper for libmagic. MIT
  • GitHub (πŸ‘¨β€πŸ’» 47 Β· πŸ”€ 210 Β· πŸ“¦ 11K Β· πŸ“‹ 150 - 18% open Β· ⏱️ 11.12.2020):

    git clone https://github.com/ahupp/python-magic
  • PyPi (πŸ“₯ 1.4M / month Β· πŸ“¦ 5.1K Β· ⏱️ 06.05.2020):

    pip install python-magic
  • Conda (πŸ“₯ 73K Β· ⏱️ 24.12.2020):

    conda install -c conda-forge python-magic
pandas-datareader (πŸ₯ˆ30 Β· ⭐ 1.8K) - Extract data from a wide range of Internet sources into a.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 76 Β· πŸ”€ 490 Β· πŸ“¦ 7.2K Β· πŸ“‹ 440 - 15% open Β· ⏱️ 02.09.2020):

    git clone https://github.com/pydata/pandas-datareader
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 1.4K Β· ⏱️ 10.07.2020):

    pip install pandas-datareader
  • Conda (πŸ“₯ 83K Β· ⏱️ 20.11.2019):

    conda install -c conda-forge pandas-datareader
csvkit (πŸ₯‰28 Β· ⭐ 4.4K) - A suite of utilities for converting to and working with CSV, the king of.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 91 Β· πŸ”€ 540 Β· πŸ“¦ 790 Β· πŸ“‹ 800 - 7% open Β· ⏱️ 30.10.2020):

    git clone https://github.com/wireservice/csvkit
  • PyPi (πŸ“₯ 27K / month Β· πŸ“¦ 700 Β· ⏱️ 03.03.2020):

    pip install csvkit
  • Conda (πŸ“₯ 45K Β· ⏱️ 28.05.2019):

    conda install -c conda-forge csvkit
snorkel (πŸ₯‰28 Β· ⭐ 4.4K) - A system for quickly generating training data with weak supervision. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 700 Β· πŸ“₯ 480 Β· πŸ“¦ 61 Β· πŸ“‹ 940 - 3% open Β· ⏱️ 05.09.2020):

    git clone https://github.com/snorkel-team/snorkel
  • PyPi (πŸ“₯ 74K / month Β· πŸ“¦ 4 Β· ⏱️ 07.04.2020):

    pip install snorkel
  • Conda (πŸ“₯ 15K Β· ⏱️ 10.04.2020):

    conda install -c conda-forge snorkel
PDFMiner (πŸ₯‰26 Β· ⭐ 4.5K Β· πŸ’€) - Python PDF Parser (Not actively maintained). Check out pdfminer.six. MIT
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 1K Β· πŸ“¦ 1.9K Β· πŸ“‹ 260 - 85% open Β· ⏱️ 18.01.2020):

    git clone https://github.com/euske/pdfminer
  • PyPi (πŸ“₯ 170K / month Β· πŸ“¦ 1.5K Β· ⏱️ 25.11.2019):

    pip install pdfminer
  • Conda (πŸ“₯ 13K Β· ⏱️ 03.11.2019):

    conda install -c conda-forge pdfminer
Intake (πŸ₯‰24 Β· ⭐ 510) - Intake is a lightweight package for finding, investigating, loading and.. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 51 Β· πŸ”€ 91 Β· πŸ“¦ 210 Β· πŸ“‹ 260 - 28% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/intake/intake
  • PyPi (πŸ“₯ 3.3K / month Β· πŸ“¦ 74 Β· ⏱️ 03.06.2020):

    pip install intake
  • Conda (πŸ“₯ 54K Β· ⏱️ 03.06.2020):

    conda install -c conda-forge intake
SDV (πŸ₯‰22 Β· ⭐ 290) - Synthetic Data Generation for tabular, relational and time series data. MIT
  • GitHub (πŸ‘¨β€πŸ’» 32 Β· πŸ”€ 52 Β· πŸ“¦ 12 Β· πŸ“‹ 170 - 24% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/sdv-dev/SDV
  • PyPi (πŸ“₯ 1.5K / month Β· ⏱️ 22.12.2020):

    pip install sdv
Show 4 hidden projects...
  • textract (πŸ₯‰26 Β· ⭐ 2.9K Β· πŸ’€) - extract text from any document. no muss. no fuss. MIT
  • Camelot (πŸ₯‰24 Β· ⭐ 3K Β· πŸ’€) - Camelot: PDF Table Extraction for Humans. MIT
  • Singer (πŸ₯‰24 Β· ⭐ 650) - Standard for moving data between databases, web APIs, files,.. ❗️AGPL-3.0
  • pyexcel-xlsx (πŸ₯‰23 Β· ⭐ 83) - A wrapper library to read, manipulate and write data in xlsx and.. BSD-3

Web Scraping & Crawling

Back to top

Libraries for web scraping, crawling, downloading, and mining as well as libraries.

Scrapy (πŸ₯‡38 Β· ⭐ 39K) - Scrapy, a fast high-level web crawling & scraping framework for Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 460 Β· πŸ”€ 8.9K Β· πŸ“¦ 22K Β· πŸ“‹ 2.6K - 28% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/scrapy/scrapy
  • PyPi (πŸ“₯ 510K / month Β· πŸ“¦ 8.4K Β· ⏱️ 17.11.2020):

    pip install scrapy
  • Conda (πŸ“₯ 380K Β· ⏱️ 17.11.2020):

    conda install -c conda-forge scrapy
youtube-dl (πŸ₯‡37 Β· ⭐ 86K) - Command-line program to download videos from YouTube.com and.. Unlicense
  • GitHub (πŸ‘¨β€πŸ’» 910 Β· πŸ”€ 3.2K Β· πŸ“₯ 46M Β· πŸ“‹ 23K - 17% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/ytdl-org/youtube-dl
  • PyPi (πŸ“₯ 1.9M / month Β· πŸ“¦ 2.9K Β· ⏱️ 26.12.2020):

    pip install youtube_dl
  • Conda (πŸ“₯ 530K Β· ⏱️ 26.12.2020):

    conda install -c conda-forge youtube-dl
feedparser (πŸ₯‡31 Β· ⭐ 1.2K Β· βž•) - Parse feeds in Python. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 42 Β· πŸ”€ 250 Β· πŸ“‹ 170 - 37% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/kurtmckee/feedparser
  • PyPi (πŸ“₯ 530K / month Β· πŸ“¦ 14K Β· ⏱️ 25.10.2020):

    pip install feedparser
You-Get (πŸ₯ˆ29 Β· ⭐ 38K) - Dumb downloader that scrapes the web. MIT
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 7.9K Β· πŸ“₯ 330K Β· ⏱️ 26.12.2020):

    git clone https://github.com/soimort/you-get
  • PyPi (πŸ“₯ 24K / month Β· πŸ“¦ 18 Β· ⏱️ 26.12.2020):

    pip install you-get
Requests-HTML (πŸ₯ˆ29 Β· ⭐ 11K Β· πŸ’€) - Pythonic HTML Parsing for Humans. MIT
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 770 Β· πŸ“¦ 3.5K Β· πŸ“‹ 320 - 41% open Β· ⏱️ 10.05.2020):

    git clone https://github.com/psf/requests-html
  • PyPi (πŸ“₯ 280K / month Β· πŸ“¦ 360 Β· ⏱️ 17.02.2019):

    pip install requests-html
newspaper3k (πŸ₯ˆ29 Β· ⭐ 11K) - News, full-text, and article metadata extraction in Python 3... MIT
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 1.7K Β· πŸ“¦ 2.1K Β· πŸ“‹ 660 - 63% open Β· ⏱️ 02.09.2020):

    git clone https://github.com/codelucas/newspaper
  • PyPi (πŸ“₯ 27K / month Β· πŸ“¦ 440 Β· ⏱️ 28.09.2018):

    pip install newspaper3k
  • Conda (πŸ“₯ 35K Β· ⏱️ 14.10.2020):

    conda install -c conda-forge newspaper3k
Pattern (πŸ₯ˆ29 Β· ⭐ 7.7K Β· πŸ’€) - Web mining module for Python, with tools for scraping, natural.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 1.5K Β· πŸ“¦ 1K Β· πŸ“‹ 230 - 67% open Β· ⏱️ 25.04.2020):

    git clone https://github.com/clips/pattern
  • PyPi (πŸ“₯ 65K / month Β· πŸ“¦ 270 Β· ⏱️ 18.08.2018):

    pip install pattern
  • Conda (πŸ“₯ 5.5K Β· ⏱️ 05.05.2020):

    conda install -c conda-forge pattern
Datasets (πŸ₯ˆ29 Β· ⭐ 5.9K) - The largest hub of ready-to-use NLP datasets for ML models with.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 610 Β· πŸ“¦ 220 Β· πŸ“‹ 550 - 47% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/huggingface/datasets
  • PyPi (πŸ“₯ 23K / month Β· πŸ“¦ 18 Β· ⏱️ 19.11.2020):

    pip install datasets
pyspider (πŸ₯ˆ28 Β· ⭐ 15K) - A Powerful Spider(Web Crawler) System in Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 3.6K Β· πŸ“¦ 280 Β· πŸ“‹ 820 - 33% open Β· ⏱️ 02.08.2020):

    git clone https://github.com/binux/pyspider
  • PyPi (πŸ“₯ 1.8K / month Β· πŸ“¦ 30 Β· ⏱️ 18.04.2018):

    pip install pyspider
MechanicalSoup (πŸ₯‰26 Β· ⭐ 3.6K) - A Python library for automating interaction with websites. MIT
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 310 Β· πŸ“₯ 39 Β· πŸ“‹ 140 - 15% open Β· ⏱️ 11.11.2020):

    git clone https://github.com/MechanicalSoup/MechanicalSoup
  • PyPi (πŸ“₯ 32K / month Β· πŸ“¦ 390 Β· ⏱️ 27.08.2019):

    pip install MechanicalSoup
  • Conda (πŸ“₯ 73K Β· ⏱️ 29.08.2019):

    conda install -c conda-forge mechanicalsoup
Grab (πŸ₯‰26 Β· ⭐ 2K) - Web Scraping Framework. MIT
  • GitHub (πŸ‘¨β€πŸ’» 65 Β· πŸ”€ 240 Β· πŸ“¦ 300 Β· πŸ“‹ 220 - 12% open Β· ⏱️ 08.12.2020):

    git clone https://github.com/lorien/grab
  • PyPi (πŸ“₯ 1.8K / month Β· πŸ“¦ 100 Β· ⏱️ 24.06.2018):

    pip install grab
micawber (πŸ₯‰25 Β· ⭐ 510) - a small library for extracting rich content from urls. MIT
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 78 Β· πŸ“¦ 480 Β· ⏱️ 18.12.2020):

    git clone https://github.com/coleifer/micawber
  • PyPi (πŸ“₯ 4K / month Β· πŸ“¦ 370 Β· ⏱️ 17.12.2020):

    pip install micawber
beautifulsoup4 (πŸ₯‰22) - Screen-scraping library. MIT
  • PyPi (πŸ“₯ 9.5M / month Β· πŸ“¦ 62K Β· ⏱️ 03.10.2020):
    pip install beautifulsoup4
  • Conda (πŸ“₯ 3.2M Β· ⏱️ 03.10.2020):
    conda install -c conda-forge beautifulsoup4
AutoScraper (πŸ₯‰20 Β· ⭐ 3.1K Β· 🐣) - A Smart, Automatic, Fast and Lightweight Web Scraper for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 290 Β· πŸ“¦ 18 Β· πŸ“‹ 26 - 11% open Β· ⏱️ 15.12.2020):

    git clone https://github.com/alirezamika/autoscraper
  • PyPi (πŸ“₯ 830 / month Β· ⏱️ 29.11.2020):

    pip install autoscraper
Lassie (πŸ₯‰20 Β· ⭐ 520 Β· βž•) - Web Content Retrieval for Humans. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 42 Β· πŸ“¦ 20 Β· πŸ“‹ 39 - 25% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/michaelhelmick/lassie
  • PyPi (πŸ“₯ 540 / month Β· πŸ“¦ 7 Β· ⏱️ 16.12.2020):

    pip install lassie
gazpacho (πŸ₯‰20 Β· ⭐ 500) - The simple, fast, and modern web scraping library. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 47 Β· πŸ“¦ 79 Β· πŸ“‹ 33 - 12% open Β· ⏱️ 31.10.2020):

    git clone https://github.com/maxhumber/gazpacho
  • PyPi (πŸ“₯ 2.5K / month Β· πŸ“¦ 2 Β· ⏱️ 24.09.2020):

    pip install gazpacho
Cola (πŸ₯‰18 Β· ⭐ 1.4K Β· πŸ’€) - A high-level distributed crawling framework. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 540 Β· πŸ“¦ 3 Β· πŸ“‹ 66 - 21% open Β· ⏱️ 01.03.2020):

    git clone https://github.com/qinxuye/cola
  • PyPi (πŸ“₯ 59 / month Β· πŸ“¦ 1 Β· ⏱️ 31.03.2016):

    pip install cola
Transistor (πŸ₯‰16 Β· ⭐ 210) - Transistor, a Python web scraping framework for intelligent use cases. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 17 Β· πŸ“₯ 52 Β· πŸ“‹ 63 - 95% open Β· ⏱️ 16.08.2020):

    git clone https://github.com/bomquote/transistor
  • PyPi (πŸ“₯ 100 / month Β· ⏱️ 03.08.2020):

    pip install transistor
Show 6 hidden projects...
  • soupsieve (πŸ₯ˆ29 Β· ⭐ 84) - A modern CSS selector implementation for BeautifulSoup. MIT
  • Google Images Download (πŸ₯‰27 Β· ⭐ 7.3K Β· πŸ’€) - Python Script to download hundreds of images from 'Google.. MIT
  • gallery-dl (πŸ₯‰23 Β· ⭐ 2.8K) - Command-line program to download image galleries and.. ❗️GPL-2.0
  • WikiExtractor (πŸ₯‰23 Β· ⭐ 2.5K) - A tool for extracting plain text from Wikipedia dumps. ❗️AGPL-3.0
  • Photon (πŸ₯‰20 Β· ⭐ 7.4K Β· πŸ’€) - Incredibly fast crawler designed for OSINT. ❗️GPL-3.0
  • lazynlp (πŸ₯‰14 Β· ⭐ 1.9K Β· πŸ’€) - Library to scrape and clean web pages to create massive datasets. MIT

Data Pipelines & Streaming

Back to top

Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.

Celery (πŸ₯‡37 Β· ⭐ 16K) - Asynchronous task queue/job queue based on distributed message passing. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 1.1K Β· πŸ”€ 3.9K Β· πŸ“¦ 47K Β· πŸ“‹ 4.4K - 11% open Β· ⏱️ 27.12.2020):

    git clone https://github.com/celery/celery
  • PyPi (πŸ“₯ 2.2M / month Β· πŸ“¦ 28K Β· ⏱️ 16.12.2020):

    pip install celery
  • Conda (πŸ“₯ 360K Β· ⏱️ 11.11.2020):

    conda install -c conda-forge celery
Airflow (πŸ₯‡35 Β· ⭐ 20K) - Platform to programmatically author, schedule, and monitor workflows. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.7K Β· πŸ”€ 7.6K Β· πŸ“₯ 68K Β· πŸ“‹ 2.7K - 34% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/apache/airflow
  • PyPi (πŸ“₯ 580K / month Β· πŸ“¦ 290 Β· ⏱️ 14.12.2020):

    pip install apache-airflow
  • Conda (πŸ“₯ 250K Β· ⏱️ 26.11.2020):

    conda install -c conda-forge airflow
  • Dockerhub (πŸ“₯ 3.3M Β· ⭐ 190 Β· ⏱️ 26.12.2020):

    docker pull apache/airflow
luigi (πŸ₯‡33 Β· ⭐ 14K) - Luigi is a Python module that helps you build complex pipelines of.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 550 Β· πŸ”€ 2.2K Β· πŸ“¦ 1.3K Β· πŸ“‹ 900 - 7% open Β· ⏱️ 10.12.2020):

    git clone https://github.com/spotify/luigi
  • PyPi (πŸ“₯ 390K / month Β· πŸ“¦ 680 Β· ⏱️ 23.09.2020):

    pip install luigi
  • Conda (πŸ“₯ 6.2K Β· ⏱️ 21.07.2020):

    conda install -c anaconda luigi
Beam (πŸ₯ˆ32 Β· ⭐ 4.5K) - Unified programming model to define and execute data processing.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 980 Β· πŸ”€ 2.7K Β· ⏱️ 25.12.2020):

    git clone https://github.com/apache/beam
  • PyPi (πŸ“₯ 1.7M / month Β· πŸ“¦ 190 Β· ⏱️ 11.12.2020):

    pip install apache-beam
Prefect (πŸ₯ˆ30 Β· ⭐ 5.6K) - The easiest way to automate your data. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 470 Β· πŸ“¦ 220 Β· πŸ“‹ 1.5K - 15% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/PrefectHQ/prefect
  • PyPi (πŸ“₯ 34K / month Β· πŸ“¦ 2 Β· ⏱️ 16.12.2020):

    pip install prefect
  • Conda (πŸ“₯ 63K Β· ⏱️ 17.12.2020):

    conda install -c conda-forge prefect
mrjob (πŸ₯ˆ30 Β· ⭐ 2.5K) - Run MapReduce jobs on Hadoop or Amazon Web Services. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 590 Β· πŸ“¦ 640 Β· πŸ“‹ 1.3K - 15% open Β· ⏱️ 16.11.2020):

    git clone https://github.com/Yelp/mrjob
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 110 Β· ⏱️ 17.09.2020):

    pip install mrjob
  • Conda (πŸ“₯ 300K Β· ⏱️ 24.12.2020):

    conda install -c conda-forge mrjob
faust (πŸ₯ˆ28 Β· ⭐ 5.1K) - Python Stream Processing. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 90 Β· πŸ”€ 420 Β· πŸ“¦ 560 Β· πŸ“‹ 440 - 50% open Β· ⏱️ 09.10.2020):

    git clone https://github.com/robinhood/faust
  • PyPi (πŸ“₯ 75K / month Β· πŸ“¦ 28 Β· ⏱️ 25.02.2020):

    pip install faust
dbt (πŸ₯ˆ28 Β· ⭐ 2.3K) - dbt (data build tool) enables data analysts and engineers to transform.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 460 Β· πŸ“¦ 160 Β· πŸ“‹ 1.7K - 15% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/fishtown-analytics/dbt
  • PyPi (πŸ“₯ 88K / month Β· πŸ“¦ 9 Β· ⏱️ 21.10.2020):

    pip install dbt
  • Conda (πŸ“₯ 150K Β· ⏱️ 14.05.2020):

    conda install -c conda-forge dbt
Kedro (πŸ₯ˆ27 Β· ⭐ 3.3K Β· πŸ“‰) - A Python framework for creating reproducible, maintainable and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 89 Β· πŸ”€ 380 Β· πŸ“¦ 340 Β· πŸ“‹ 410 - 19% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/quantumblacklabs/kedro
  • PyPi (πŸ“₯ 59K / month Β· πŸ“¦ 16 Β· ⏱️ 17.12.2020):

    pip install kedro
Dagster (πŸ₯ˆ26 Β· ⭐ 2.4K) - A data orchestrator for machine learning, analytics, and ETL. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 250 Β· πŸ“¦ 130 Β· πŸ“‹ 2.3K - 25% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/dagster-io/dagster
  • PyPi (πŸ“₯ 26K / month Β· πŸ“¦ 4 Β· ⏱️ 11.12.2020):

    pip install dagster
  • Conda (πŸ“₯ 120K Β· ⏱️ 04.12.2020):

    conda install -c conda-forge dagster
PyFunctional (πŸ₯ˆ26 Β· ⭐ 1.8K) - Python library for creating data pipelines with chain functional.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 96 Β· πŸ“¦ 240 Β· πŸ“‹ 120 - 2% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/EntilZha/PyFunctional
  • PyPi (πŸ“₯ 33K / month Β· ⏱️ 17.09.2020):

    pip install pyfunctional
petl (πŸ₯ˆ26 Β· ⭐ 800) - Python Extract Transform and Load Tables of Data. MIT
  • GitHub (πŸ‘¨β€πŸ’» 43 Β· πŸ”€ 150 Β· πŸ“¦ 300 Β· πŸ“‹ 410 - 16% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/petl-developers/petl
  • PyPi (πŸ“₯ 15K / month Β· πŸ“¦ 110 Β· ⏱️ 06.10.2020):

    pip install petl
  • Conda (πŸ“₯ 19K Β· ⏱️ 25.09.2020):

    conda install -c conda-forge petl
streamparse (πŸ₯‰25 Β· ⭐ 1.4K) - Run Python in Apache Storm topologies. Pythonic API, CLI.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 40 Β· πŸ”€ 220 Β· πŸ“¦ 46 Β· πŸ“‹ 330 - 20% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/Parsely/streamparse
  • PyPi (πŸ“₯ 1.7K / month Β· πŸ“¦ 26 Β· ⏱️ 07.10.2020):

    pip install streamparse
TFX (πŸ₯‰25 Β· ⭐ 1.3K) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 96 Β· πŸ”€ 360 Β· πŸ“‹ 500 - 40% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/tensorflow/tfx
  • PyPi (πŸ“₯ 64K / month Β· πŸ“¦ 2 Β· ⏱️ 19.11.2020):

    pip install tfx
Optimus (πŸ₯‰24 Β· ⭐ 960) - Agile Data Preparation Workflows madeeasy with dask, cudf,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 190 Β· πŸ“¦ 11 Β· πŸ“‹ 300 - 45% open Β· ⏱️ 19.11.2020):

    git clone https://github.com/ironmussa/Optimus
  • PyPi (πŸ“₯ 7.7K / month Β· ⏱️ 30.05.2019):

    pip install optimuspyspark
Great Expectations (πŸ₯‰23 Β· ⭐ 3.3K) - Always know what to expect from your data. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 370 Β· πŸ“‹ 730 - 20% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/great-expectations/great_expectations
  • PyPi (πŸ“₯ 350K / month Β· πŸ“¦ 6 Β· ⏱️ 23.12.2020):

    pip install great_expectations
pysparkling (πŸ₯‰22 Β· ⭐ 230) - A pure Python implementation of Apache Spark's RDD and DStream.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 43 Β· πŸ“¦ 54 Β· πŸ“‹ 22 - 9% open Β· ⏱️ 02.11.2020):

    git clone https://github.com/svenkreiss/pysparkling
  • PyPi (πŸ“₯ 6.4K / month Β· πŸ“¦ 7 Β· ⏱️ 13.07.2019):

    pip install pysparkling
Pypeline (πŸ₯‰21 Β· ⭐ 1.2K) - Concurrent data pipelines in Python . MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 64 Β· πŸ“‹ 45 - 24% open Β· ⏱️ 13.11.2020):

    git clone https://github.com/cgarciae/pypeln
  • PyPi (πŸ“₯ 2.4K / month Β· πŸ“¦ 5 Β· ⏱️ 11.10.2020):

    pip install pypeln
pdpipe (πŸ₯‰20 Β· ⭐ 570) - Easy pipelines for pandas DataFrames. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 29 Β· πŸ“¦ 21 Β· πŸ“‹ 20 - 30% open Β· ⏱️ 17.11.2020):

    git clone https://github.com/pdpipe/pdpipe
  • PyPi (πŸ“₯ 1.1K / month Β· ⏱️ 09.11.2020):

    pip install pdpipe
riko (πŸ₯‰19 Β· ⭐ 1.5K) - A Python stream processing engine modeled after Yahoo! Pipes. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 70 Β· πŸ“‹ 30 - 73% open Β· ⏱️ 14.08.2020):

    git clone https://github.com/nerevu/riko
  • PyPi (πŸ“₯ 170 / month Β· πŸ“¦ 1 Β· ⏱️ 14.08.2020):

    pip install riko
TaskTiger (πŸ₯‰19 Β· ⭐ 1K) - Python task queue using Redis. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 58 Β· πŸ“¦ 19 Β· πŸ“‹ 61 - 49% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/closeio/tasktiger
  • PyPi (πŸ“₯ 420 / month Β· πŸ“¦ 9 Β· ⏱️ 18.06.2020):

    pip install tasktiger
Databolt Flow (πŸ₯‰19 Β· ⭐ 890) - Python library for building highly effective data science workflows. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 64 Β· πŸ“¦ 12 Β· πŸ“‹ 16 - 31% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/d6t/d6tflow
  • PyPi (πŸ“₯ 230 / month Β· ⏱️ 19.12.2020):

    pip install d6tflow
Mara Pipelines (πŸ₯‰18 Β· ⭐ 1.6K) - A lightweight opinionated ETL framework, halfway between plain.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 76 Β· πŸ“¦ 7 Β· πŸ“‹ 17 - 35% open Β· ⏱️ 31.07.2020):

    git clone https://github.com/mara/mara-pipelines
  • PyPi (πŸ“₯ 42 / month Β· ⏱️ 31.07.2020):

    pip install mara-pipelines
BatchFlow (πŸ₯‰18 Β· ⭐ 150) - BatchFlow helps you conveniently work with random or sequential.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 33 Β· πŸ“‹ 100 - 49% open Β· ⏱️ 08.12.2020):

    git clone https://github.com/analysiscenter/batchflow
  • PyPi (πŸ“₯ 24 / month Β· ⏱️ 23.08.2019):

    pip install batchflow
Show 2 hidden projects...
  • bonobo (πŸ₯‰23 Β· ⭐ 1.4K Β· πŸ’€) - Extract Transform Load for Python 3.5+. Apache-2
  • Botflow (πŸ₯‰15 Β· ⭐ 1.2K Β· πŸ’€) - Python Fast Dataflow programming framework for Data pipeline.. BSD-3

Distributed Machine Learning

Back to top

Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.

dask (πŸ₯‡35 Β· ⭐ 7.7K) - Parallel computing with task scheduling. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 420 Β· πŸ”€ 1.2K Β· πŸ“¦ 24K Β· πŸ“‹ 3.5K - 19% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/dask/dask
  • PyPi (πŸ“₯ 1.4M / month Β· πŸ“¦ 3.9K Β· ⏱️ 11.12.2020):

    pip install dask
  • Conda (πŸ“₯ 2.8M Β· ⏱️ 11.12.2020):

    conda install -c conda-forge dask
Ray (πŸ₯‡33 Β· ⭐ 14K) - An open source framework that provides a simple, universal API for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 410 Β· πŸ”€ 2.3K Β· πŸ“¦ 1.4K Β· πŸ“‹ 5.5K - 21% open Β· ⏱️ 27.12.2020):

    git clone https://github.com/ray-project/ray
  • PyPi (πŸ“₯ 220K / month Β· πŸ“¦ 160 Β· ⏱️ 24.12.2020):

    pip install ray
horovod (πŸ₯‡30 Β· ⭐ 11K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 1.7K Β· πŸ“¦ 300 Β· πŸ“‹ 1.6K - 15% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/horovod/horovod
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 20 Β· ⏱️ 01.10.2020):

    pip install horovod
ipyparallel (πŸ₯ˆ29 Β· ⭐ 1.8K) - Interactive Parallel Computing in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 94 Β· πŸ”€ 720 Β· πŸ“¦ 1.3K Β· πŸ“‹ 250 - 56% open Β· ⏱️ 24.08.2020):

    git clone https://github.com/ipython/ipyparallel
  • PyPi (πŸ“₯ 61K / month Β· πŸ“¦ 490 Β· ⏱️ 05.05.2020):

    pip install ipyparallel
  • Conda (πŸ“₯ 360K Β· ⏱️ 14.12.2020):

    conda install -c conda-forge ipyparallel
dask-ml (πŸ₯ˆ26 Β· ⭐ 680) - Scalable Machine Learning with Dask. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 60 Β· πŸ”€ 180 Β· πŸ“¦ 320 Β· πŸ“‹ 370 - 46% open Β· ⏱️ 25.11.2020):

    git clone https://github.com/dask/dask-ml
  • PyPi (πŸ“₯ 45K / month Β· πŸ“¦ 42 Β· ⏱️ 24.09.2020):

    pip install dask-ml
  • Conda (πŸ“₯ 190K Β· ⏱️ 24.09.2020):

    conda install -c conda-forge dask-ml
Mesh (πŸ₯ˆ26 Β· ⭐ 600) - Mesh TensorFlow: Model Parallelism Made Easier. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 110 Β· πŸ“¦ 470 Β· πŸ“‹ 84 - 85% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/tensorflow/mesh
  • PyPi (πŸ“₯ 78K / month Β· πŸ“¦ 28 Β· ⏱️ 17.12.2020):

    pip install mesh-tensorflow
mpi4py (πŸ₯ˆ26 Β· ⭐ 370) - Python bindings for MPI. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 60 Β· πŸ“₯ 200 Β· πŸ“‹ 4 - 25% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/mpi4py/mpi4py
  • PyPi (πŸ“₯ 160K / month Β· πŸ“¦ 700 Β· ⏱️ 04.11.2019):

    pip install mpi4py
  • Conda (πŸ“₯ 490K Β· ⏱️ 21.11.2020):

    conda install -c conda-forge mpi4py
BigDL (πŸ₯ˆ25 Β· ⭐ 3.7K) - BigDL: Distributed Deep Learning Framework for Apache Spark. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 71 Β· πŸ”€ 900 Β· πŸ“¦ 19 Β· πŸ“‹ 910 - 19% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/intel-analytics/BigDL
  • PyPi (πŸ“₯ 1.2K / month Β· πŸ“¦ 3 Β· ⏱️ 06.12.2020):

    pip install bigdl
  • Maven (⏱️ 05.12.2020):

     <dependency>
     	<groupId>com.intel.analytics.bigdl</groupId>
     	<artifactId>bigdl-SPARK_2.4</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
TensorFlowOnSpark (πŸ₯‰24 Β· ⭐ 3.6K) - TensorFlowOnSpark brings TensorFlow programs to Apache.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 930 Β· πŸ“‹ 340 - 1% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/yahoo/TensorFlowOnSpark
  • PyPi (πŸ“₯ 34K / month Β· πŸ“¦ 5 Β· ⏱️ 18.12.2020):

    pip install tensorflowonspark
DeepSpeed (πŸ₯‰23 Β· ⭐ 3.9K) - DeepSpeed is a deep learning optimization library that makes.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 34 Β· πŸ”€ 340 Β· πŸ“¦ 5 Β· πŸ“‹ 230 - 39% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/microsoft/DeepSpeed
  • PyPi (πŸ“₯ 1K / month Β· ⏱️ 23.12.2020):

    pip install deepspeed
  • Dockerhub (πŸ“₯ 6.5K Β· ⭐ 2 Β· ⏱️ 20.11.2020):

    docker pull deepspeed/deepspeed
MMLSpark (πŸ₯‰23 Β· ⭐ 2.2K) - Microsoft Machine Learning for Apache Spark. MIT
  • GitHub (πŸ‘¨β€πŸ’» 61 Β· πŸ”€ 470 Β· πŸ“‹ 380 - 44% open Β· ⏱️ 03.12.2020):

    git clone https://github.com/Azure/mmlspark
  • PyPi (πŸ“₯ 19K / month Β· ⏱️ 18.03.2020):

    pip install mmlspark
Elephas (πŸ₯‰23 Β· ⭐ 1.4K) - Distributed Deep learning with Keras & Spark. MIT
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 280 Β· πŸ“‹ 130 - 38% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/maxpumperla/elephas
  • PyPi (πŸ“₯ 11K / month Β· πŸ“¦ 3 Β· ⏱️ 10.02.2020):

    pip install elephas
analytics-zoo (πŸ₯‰22 Β· ⭐ 2.2K Β· βž•) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 81 Β· πŸ”€ 620 Β· πŸ“¦ 2 Β· πŸ“‹ 890 - 47% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/intel-analytics/analytics-zoo
  • PyPi (πŸ“₯ 780 / month Β· ⏱️ 17.12.2020):

    pip install analytics-zoo
BytePS (πŸ₯‰20 Β· ⭐ 2.6K) - A high performance and generic framework for distributed DNN.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 360 Β· πŸ“‹ 200 - 35% open Β· ⏱️ 06.12.2020):

    git clone https://github.com/bytedance/byteps
  • PyPi (πŸ“₯ 150 / month Β· ⏱️ 04.11.2020):

    pip install byteps
  • Dockerhub (πŸ“₯ 930 Β· ⏱️ 03.03.2020):

    docker pull bytepsimage/tensorflow
Fiber (πŸ₯‰20 Β· ⭐ 830) - Distributed Computing for AI Made Simple. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 87 Β· πŸ“¦ 17 Β· πŸ“‹ 17 - 70% open Β· ⏱️ 06.10.2020):

    git clone https://github.com/uber/fiber
  • PyPi (πŸ“₯ 1.5K / month Β· πŸ“¦ 1 Β· ⏱️ 09.07.2020):

    pip install fiber
Submit it (πŸ₯‰20 Β· ⭐ 270) - Python 3.6+ toolbox for submitting jobs to Slurm. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 16 Β· πŸ“¦ 69 Β· πŸ“‹ 21 - 42% open Β· ⏱️ 04.12.2020):

    git clone https://github.com/facebookincubator/submitit
  • PyPi (πŸ“₯ 1.7K / month Β· ⏱️ 01.12.2020):

    pip install submitit
  • Conda (πŸ“₯ 630 Β· ⏱️ 19.11.2020):

    conda install -c conda-forge submitit
Apache Singa (πŸ₯‰19 Β· ⭐ 2.2K) - a distributed deep learning platform. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 70 Β· πŸ”€ 570 Β· πŸ“‹ 81 - 60% open Β· ⏱️ 11.12.2020):

    git clone https://github.com/apache/singa
  • Conda (πŸ“₯ 240 Β· ⏱️ 19.10.2020):

    conda install -c nusdbsystem singa
  • Dockerhub (πŸ“₯ 160 Β· ⭐ 2 Β· ⏱️ 04.06.2019):

    docker pull apache/singa
sk-dist (πŸ₯‰18 Β· ⭐ 250) - Distributed scikit-learn meta-estimators in PySpark. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 42 Β· πŸ“¦ 6 Β· πŸ“‹ 15 - 40% open Β· ⏱️ 03.12.2020):

    git clone https://github.com/Ibotta/sk-dist
  • PyPi (πŸ“₯ 4.7K / month Β· πŸ“¦ 2 Β· ⏱️ 14.05.2020):

    pip install sk-dist
Hivemind (πŸ₯‰17 Β· ⭐ 630) - Decentralized deep learning in PyTorch. Built to train models on.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 28 Β· πŸ“¦ 1 Β· πŸ“‹ 31 - 45% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/learning-at-home/hivemind
  • PyPi (πŸ“₯ 38 / month Β· πŸ“¦ 1 Β· ⏱️ 28.08.2020):

    pip install hivemind
FairScale (πŸ₯‰17 Β· ⭐ 590 Β· 🐣) - PyTorch extensions for high performance and large scale.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 39 Β· πŸ“¦ 1 Β· πŸ“‹ 72 - 31% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/facebookresearch/fairscale
  • PyPi (πŸ“₯ 500 / month Β· ⏱️ 02.12.2020):

    pip install fairscale
Show 3 hidden projects...
  • DEAP (πŸ₯ˆ28 Β· ⭐ 4K) - Distributed Evolutionary Algorithms in Python. ❗️LGPL-3.0
  • TensorFrames (πŸ₯‰19 Β· ⭐ 770 Β· πŸ’€) - [DEPRECATED] Tensorflow wrapper for DataFrames on Apache.. Apache-2
  • LazyCluster (πŸ₯‰12 Β· ⭐ 31) - Distributed machine learning made simple. Apache-2

Hyperparameter Optimization & AutoML

Back to top

Libraries for hyperparameter optimization, automl and neural architecture search.

Hyperopt (πŸ₯‡31 Β· ⭐ 5.3K) - Distributed Asynchronous Hyperparameter Optimization in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 84 Β· πŸ”€ 840 Β· πŸ“¦ 2.4K Β· πŸ“‹ 540 - 59% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/hyperopt/hyperopt
  • PyPi (πŸ“₯ 470K / month Β· πŸ“¦ 500 Β· ⏱️ 07.10.2020):

    pip install hyperopt
  • Conda (πŸ“₯ 150K Β· ⏱️ 14.10.2020):

    conda install -c conda-forge hyperopt
Optuna (πŸ₯‡31 Β· ⭐ 3.8K) - A hyperparameter optimization framework. MIT
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 430 Β· πŸ“¦ 810 Β· πŸ“‹ 610 - 26% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/optuna/optuna
  • PyPi (πŸ“₯ 170K / month Β· πŸ“¦ 52 Β· ⏱️ 04.11.2020):

    pip install optuna
  • Conda (πŸ“₯ 16K Β· ⏱️ 11.11.2020):

    conda install -c conda-forge optuna
scikit-optimize (πŸ₯‡31 Β· ⭐ 2K) - Sequential model-based optimization with a `scipy.optimize`.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 67 Β· πŸ”€ 370 Β· πŸ“¦ 1.3K Β· πŸ“‹ 510 - 31% open Β· ⏱️ 29.09.2020):

    git clone https://github.com/scikit-optimize/scikit-optimize
  • PyPi (πŸ“₯ 500K / month Β· πŸ“¦ 160 Β· ⏱️ 04.09.2020):

    pip install scikit-optimize
  • Conda (πŸ“₯ 180K Β· ⏱️ 04.09.2020):

    conda install -c conda-forge scikit-optimize
Keras Tuner (πŸ₯‡29 Β· ⭐ 2.2K) - Hyperparameter tuning for humans. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 250 Β· πŸ“¦ 330 Β· πŸ“‹ 280 - 52% open Β· ⏱️ 09.12.2020):

    git clone https://github.com/keras-team/keras-tuner
  • PyPi (πŸ“₯ 830K / month Β· πŸ“¦ 2 Β· ⏱️ 20.11.2020):

    pip install keras-tuner
featuretools (πŸ₯ˆ28 Β· ⭐ 5.3K) - An open source python library for automated feature engineering. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 49 Β· πŸ”€ 680 Β· πŸ“¦ 630 Β· πŸ“‹ 490 - 21% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/alteryx/featuretools
  • PyPi (πŸ“₯ 66K / month Β· πŸ“¦ 70 Β· ⏱️ 30.11.2020):

    pip install featuretools
  • Conda (πŸ“₯ 42K Β· ⏱️ 30.11.2020):

    conda install -c conda-forge featuretools
AutoKeras (πŸ₯ˆ27 Β· ⭐ 7.7K) - AutoML library for deep learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 1.2K Β· πŸ“¦ 140 Β· πŸ“‹ 710 - 8% open Β· ⏱️ 11.12.2020):

    git clone https://github.com/keras-team/autokeras
  • PyPi (πŸ“₯ 6.3K / month Β· πŸ“¦ 7 Β· ⏱️ 30.11.2020):

    pip install autokeras
NNI (πŸ₯ˆ26 Β· ⭐ 8.6K) - An open source AutoML toolkit for automate machine learning lifecycle,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 1.1K Β· πŸ“¦ 76 Β· πŸ“‹ 1.1K - 21% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/microsoft/nni
  • PyPi (πŸ“₯ 2.1K / month Β· πŸ“¦ 8 Β· ⏱️ 22.10.2020):

    pip install nni
Bayesian Optimization (πŸ₯ˆ26 Β· ⭐ 4.7K) - A Python implementation of global optimization with.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 1K Β· πŸ“₯ 44 Β· πŸ“¦ 550 Β· πŸ“‹ 200 - 20% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/fmfn/BayesianOptimization
  • PyPi (πŸ“₯ 58K / month Β· πŸ“¦ 92 Β· ⏱️ 16.05.2020):

    pip install bayesian-optimization
BoTorch (πŸ₯ˆ26 Β· ⭐ 1.8K) - Bayesian optimization in PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 49 Β· πŸ”€ 170 Β· πŸ“¦ 70 Β· πŸ“‹ 150 - 27% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/pytorch/botorch
  • PyPi (πŸ“₯ 29K / month Β· ⏱️ 08.12.2020):

    pip install botorch
SMAC3 (πŸ₯ˆ26 Β· ⭐ 540) - Sequential Model-based Algorithm Configuration. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 150 Β· πŸ“‹ 330 - 23% open Β· ⏱️ 29.10.2020):

    git clone https://github.com/automl/SMAC3
  • PyPi (πŸ“₯ 10K / month Β· πŸ“¦ 58 Β· ⏱️ 29.10.2020):

    pip install smac
auto-sklearn (πŸ₯ˆ25 Β· ⭐ 5.1K) - Automated Machine Learning with scikit-learn. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 63 Β· πŸ”€ 960 Β· πŸ“¦ 120 Β· πŸ“‹ 670 - 17% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/automl/auto-sklearn
  • PyPi (πŸ“₯ 7K / month Β· πŸ“¦ 15 Β· ⏱️ 22.12.2020):

    pip install auto-sklearn
AutoGluon (πŸ₯ˆ25 Β· ⭐ 2.8K) - AutoGluon: AutoML for Text, Image, and Tabular Data. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 370 Β· πŸ“¦ 42 Β· πŸ“‹ 360 - 40% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/awslabs/autogluon
  • PyPi (πŸ“₯ 12K / month Β· ⏱️ 26.12.2020):

    pip install autogluon
nevergrad (πŸ₯ˆ25 Β· ⭐ 2.8K) - A Python toolbox for performing gradient-free optimization. MIT
  • GitHub (πŸ‘¨β€πŸ’» 40 Β· πŸ”€ 260 Β· πŸ“¦ 110 Β· πŸ“‹ 180 - 37% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/facebookresearch/nevergrad
  • PyPi (πŸ“₯ 7.1K / month Β· πŸ“¦ 14 Β· ⏱️ 10.12.2020):

    pip install nevergrad
  • Conda (πŸ“₯ 5.4K Β· ⏱️ 14.12.2020):

    conda install -c conda-forge nevergrad
Ax (πŸ₯ˆ25 Β· ⭐ 1.4K) - Adaptive Experimentation Platform. MIT
  • GitHub (πŸ‘¨β€πŸ’» 89 Β· πŸ”€ 140 Β· πŸ“¦ 110 Β· πŸ“‹ 200 - 15% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/facebook/Ax
  • PyPi (πŸ“₯ 18K / month Β· πŸ“¦ 4 Β· ⏱️ 08.12.2020):

    pip install ax-platform
Hyperas (πŸ₯ˆ24 Β· ⭐ 2.1K Β· πŸ“‰) - Keras + Hyperopt: A very simple wrapper for convenient.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 300 Β· πŸ“‹ 250 - 35% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/maxpumperla/hyperas
  • PyPi (πŸ“₯ 14K / month Β· πŸ“¦ 21 Β· ⏱️ 28.02.2019):

    pip install hyperas
GPyOpt (πŸ₯ˆ24 Β· ⭐ 700) - Gaussian Process Optimization using GPy. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 49 Β· πŸ”€ 210 Β· πŸ“¦ 160 Β· πŸ“‹ 290 - 34% open Β· ⏱️ 05.11.2020):

    git clone https://github.com/SheffieldML/GPyOpt
  • PyPi (πŸ“₯ 5.2K / month Β· πŸ“¦ 36 Β· ⏱️ 19.03.2020):

    pip install gpyopt
Orion (πŸ₯‰23 Β· ⭐ 180) - Asynchronous Distributed Hyperparameter Optimization. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 35 Β· πŸ“¦ 39 Β· πŸ“‹ 110 - 13% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/Epistimio/orion
  • PyPi (πŸ“₯ 1.1K / month Β· πŸ“¦ 8 Β· ⏱️ 04.12.2020):

    pip install orion
AdaNet (πŸ₯‰22 Β· ⭐ 3.2K) - Fast and flexible AutoML with learning guarantees. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 490 Β· πŸ“¦ 33 Β· πŸ“‹ 110 - 57% open Β· ⏱️ 09.07.2020):

    git clone https://github.com/tensorflow/adanet
  • PyPi (πŸ“₯ 690 / month Β· πŸ“¦ 2 Β· ⏱️ 09.07.2020):

    pip install adanet
Talos (πŸ₯‰22 Β· ⭐ 1.3K) - Hyperparameter Optimization for TensorFlow, Keras and PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 220 Β· πŸ“¦ 100 Β· πŸ“‹ 380 - 10% open Β· ⏱️ 21.11.2020):

    git clone https://github.com/autonomio/talos
  • PyPi (πŸ“₯ 1.1K / month Β· πŸ“¦ 6 Β· ⏱️ 09.11.2020):

    pip install talos
Neuraxle (πŸ₯‰21 Β· ⭐ 340 Β· βž•) - A Sklearn-like Framework for Hyperparameter Tuning and AutoML.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 39 Β· πŸ“¦ 17 Β· πŸ“‹ 240 - 48% open Β· ⏱️ 09.12.2020):

    git clone https://github.com/Neuraxio/Neuraxle
  • PyPi (πŸ“₯ 450 / month Β· ⏱️ 20.09.2020):

    pip install neuraxle
mljar-supervised (πŸ₯‰20 Β· ⭐ 600 Β· βž•) - Automates Machine Learning Pipeline with Feature.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 92 Β· πŸ“¦ 8 Β· πŸ“‹ 250 - 19% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/mljar/mljar-supervised
  • PyPi (πŸ“₯ 1.2K / month Β· ⏱️ 17.12.2020):

    pip install mljar-supervised
Test Tube (πŸ₯‰18 Β· ⭐ 640 Β· πŸ’€) - Python library to easily log experiments and parallelize.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 59 Β· πŸ“₯ 7 Β· πŸ“‹ 43 - 51% open Β· ⏱️ 17.03.2020):

    git clone https://github.com/williamFalcon/test-tube
  • PyPi (πŸ“₯ 7.7K / month Β· πŸ“¦ 5 Β· ⏱️ 12.12.2018):

    pip install test_tube
Dragonfly (πŸ₯‰18 Β· ⭐ 550) - An open source python library for scalable Bayesian optimisation. MIT
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 72 Β· πŸ“‹ 38 - 55% open Β· ⏱️ 03.07.2020):

    git clone https://github.com/dragonfly/dragonfly
  • PyPi (πŸ“₯ 3.9K / month Β· πŸ“¦ 2 Β· ⏱️ 03.07.2020):

    pip install dragonfly-opt
HyperparameterHunter (πŸ₯‰16 Β· ⭐ 630) - Easy hyperparameter optimization and automatic result.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 77 Β· πŸ“₯ 240 Β· πŸ“‹ 120 - 30% open Β· ⏱️ 21.10.2020):

    git clone https://github.com/HunterMcGushion/hyperparameter_hunter
  • PyPi (πŸ“₯ 95 / month Β· πŸ“¦ 1 Β· ⏱️ 06.08.2019):

    pip install hyperparameter-hunter
AlphaPy (πŸ₯‰16 Β· ⭐ 530) - Automated Machine Learning [AutoML] with Python, scikit-learn,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 110 Β· πŸ“¦ 2 Β· πŸ“‹ 39 - 25% open Β· ⏱️ 06.09.2020):

    git clone https://github.com/ScottfreeLLC/AlphaPy
  • PyPi (πŸ“₯ 110 / month Β· ⏱️ 25.08.2020):

    pip install alphapy
Auto Tune Models (πŸ₯‰16 Β· ⭐ 500 Β· πŸ’€) - Auto Tune Models - A multi-tenant, multi-data system for.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 120 Β· πŸ“¦ 6 Β· πŸ“‹ 88 - 19% open Β· ⏱️ 21.02.2020):

    git clone https://github.com/HDI-Project/ATM
  • PyPi (πŸ“₯ 100 / month Β· ⏱️ 24.06.2019):

    pip install atm
Parfit (πŸ₯‰15 Β· ⭐ 200 Β· πŸ’€) - A package for parallelizing the fit and flexibly scoring of.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 26 Β· πŸ“¦ 6 Β· πŸ“‹ 10 - 50% open Β· ⏱️ 04.04.2020):

    git clone https://github.com/jmcarpenter2/parfit
  • PyPi (πŸ“₯ 4.1K / month Β· ⏱️ 11.10.2018):

    pip install parfit
ENAS (πŸ₯‰14 Β· ⭐ 2.4K) - PyTorch implementation of Efficient Neural Architecture Search via.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 440 Β· πŸ“‹ 44 - 84% open Β· ⏱️ 16.06.2020):

    git clone https://github.com/carpedm20/ENAS-pytorch
Devol (πŸ₯‰11 Β· ⭐ 920) - Genetic neural architecture search with Keras. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 110 Β· πŸ“‹ 27 - 25% open Β· ⏱️ 05.07.2020):

    git clone https://github.com/joeddav/devol
Show 11 hidden projects...
  • TPOT (πŸ₯ˆ28 Β· ⭐ 7.7K) - A Python Automated Machine Learning tool that optimizes machine.. ❗️LGPL-3.0
  • MLBox (πŸ₯‰23 Β· ⭐ 1.2K) - MLBox is a powerful Automated Machine Learning python library. ❗️BSD-1-Clause
  • auto_ml (πŸ₯‰20 Β· ⭐ 1.5K Β· πŸ’€) - [UNMAINTAINED] Automated machine learning for analytics & production. MIT
  • HpBandSter (πŸ₯‰19 Β· ⭐ 430 Β· πŸ’€) - a distributed Hyperband implementation on Steroids. BSD-3
  • Sherpa (πŸ₯‰19 Β· ⭐ 280) - Hyperparameter optimization that enables researchers to experiment,.. ❗️GPL-3.0
  • Advisor (πŸ₯‰17 Β· ⭐ 1.3K Β· πŸ’€) - Open-source implementation of Google Vizier for hyper.. Apache-2
  • automl-gs (πŸ₯‰16 Β· ⭐ 1.7K Β· πŸ’€) - Provide an input CSV and a target field to predict, generate a.. MIT
  • Xcessiv (πŸ₯‰16 Β· ⭐ 1.3K Β· πŸ’€) - A web-based application for quick, scalable, and automated.. Apache-2
  • Hypermax (πŸ₯‰14 Β· ⭐ 94) - Better, faster hyper-parameter optimization. BSD-3
  • Auptimizer (πŸ₯‰13 Β· ⭐ 150) - An automatic ML model optimization tool. ❗️GPL-3.0
  • Hypertunity (πŸ₯‰11 Β· ⭐ 120 Β· πŸ’€) - A toolset for black-box hyperparameter optimisation. Apache-2

Reinforcement Learning

Back to top

Libraries for building and evaluating reinforcement learning & agent-based systems.

OpenAI Gym (πŸ₯‡35 Β· ⭐ 23K) - A toolkit for developing and comparing reinforcement learning.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 270 Β· πŸ”€ 6.6K Β· πŸ“¦ 17K Β· πŸ“‹ 1.3K - 17% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/openai/gym
  • PyPi (πŸ“₯ 320K / month Β· πŸ“¦ 3.6K Β· ⏱️ 19.12.2020):

    pip install gym
Dopamine (πŸ₯‡27 Β· ⭐ 9.2K) - Dopamine is a research framework for fast prototyping of.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 1.2K Β· πŸ“‹ 140 - 51% open Β· ⏱️ 24.11.2020):

    git clone https://github.com/google/dopamine
  • PyPi (πŸ“₯ 66K / month Β· πŸ“¦ 31 Β· ⏱️ 16.10.2020):

    pip install dopamine-rl
TF-Agents (πŸ₯‡27 Β· ⭐ 1.7K) - TF-Agents is a library for Reinforcement Learning in.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 84 Β· πŸ”€ 440 Β· πŸ“¦ 330 Β· πŸ“‹ 400 - 24% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/tensorflow/agents
  • PyPi (πŸ“₯ 6.9K / month Β· πŸ“¦ 6 Β· ⏱️ 15.12.2020):

    pip install tf-agents
TensorLayer (πŸ₯ˆ26 Β· ⭐ 6.4K) - Deep Learning and Reinforcement Learning Library for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 1.4K Β· πŸ“₯ 1.2K Β· πŸ“‹ 450 - 10% open Β· ⏱️ 29.10.2020):

    git clone https://github.com/tensorlayer/tensorlayer
  • PyPi (πŸ“₯ 2.7K / month Β· πŸ“¦ 68 Β· ⏱️ 19.06.2020):

    pip install tensorlayer
Stable Baselines (πŸ₯ˆ25 Β· ⭐ 2.7K) - A fork of OpenAI Baselines, implementations of reinforcement.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 540 Β· πŸ“‹ 820 - 14% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/hill-a/stable-baselines
  • PyPi (πŸ“₯ 8.9K / month Β· πŸ“¦ 25 Β· ⏱️ 05.08.2020):

    pip install stable-baselines
ViZDoom (πŸ₯ˆ25 Β· ⭐ 1.1K) - Doom-based AI Research Platform for Reinforcement Learning from Raw.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 43 Β· πŸ”€ 280 Β· πŸ“₯ 10K Β· πŸ“¦ 96 Β· πŸ“‹ 410 - 21% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/mwydmuch/ViZDoom
  • PyPi (πŸ“₯ 390 / month Β· πŸ“¦ 12 Β· ⏱️ 29.10.2020):

    pip install vizdoom
TensorForce (πŸ₯‰24 Β· ⭐ 2.8K) - Tensorforce: a TensorFlow library for applied.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 76 Β· πŸ”€ 470 Β· πŸ“‹ 540 - 1% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/tensorforce/tensorforce
  • PyPi (πŸ“₯ 920 / month Β· πŸ“¦ 23 Β· ⏱️ 07.09.2019):

    pip install tensorforce
Acme (πŸ₯‰23 Β· ⭐ 1.8K) - A library of reinforcement learning components and agents. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 170 Β· πŸ“¦ 16 Β· πŸ“‹ 80 - 46% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/deepmind/acme
  • PyPi (πŸ“₯ 600 / month Β· ⏱️ 24.07.2020):

    pip install dm-acme
garage (πŸ₯‰23 Β· ⭐ 990) - A toolkit for reproducible reinforcement learning research. MIT
  • GitHub (πŸ‘¨β€πŸ’» 73 Β· πŸ”€ 180 Β· πŸ“¦ 13 Β· πŸ“‹ 950 - 19% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/rlworkgroup/garage
  • PyPi (πŸ“₯ 260 / month Β· ⏱️ 02.10.2020):

    pip install garage
ChainerRL (πŸ₯‰23 Β· ⭐ 920) - ChainerRL is a deep reinforcement learning library built on top of.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 210 Β· πŸ“¦ 88 Β· πŸ“‹ 220 - 33% open Β· ⏱️ 09.12.2020):

    git clone https://github.com/chainer/chainerrl
  • PyPi (πŸ“₯ 870 / month Β· πŸ“¦ 14 Β· ⏱️ 14.02.2020):

    pip install chainerrl
TRFL (πŸ₯‰20 Β· ⭐ 3.1K Β· πŸ’€) - TensorFlow Reinforcement Learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 360 Β· πŸ“¦ 36 Β· πŸ“‹ 20 - 20% open Β· ⏱️ 20.04.2020):

    git clone https://github.com/deepmind/trfl
  • PyPi (πŸ“₯ 690 / month Β· πŸ“¦ 2 Β· ⏱️ 20.04.2020):

    pip install trfl
Coach (πŸ₯‰19 Β· ⭐ 1.9K) - Reinforcement Learning Coach by Intel AI Lab enables easy.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 34 Β· πŸ”€ 370 Β· πŸ“‹ 260 - 30% open Β· ⏱️ 15.12.2020):

    git clone https://github.com/IntelLabs/coach
  • PyPi (πŸ“₯ 140 / month Β· πŸ“¦ 1 Β· ⏱️ 10.10.2019):

    pip install rl_coach
PARL (πŸ₯‰19 Β· ⭐ 1.7K) - A high-performance distributed training framework for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 400 Β· πŸ“¦ 65 Β· πŸ“‹ 190 - 24% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/PaddlePaddle/PARL
  • PyPi (πŸ“₯ 470 / month Β· ⏱️ 20.12.2020):

    pip install parl
PFRL (πŸ₯‰19 Β· ⭐ 460 Β· 🐣) - PFRL: a PyTorch-based deep reinforcement learning library. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 56 Β· πŸ“¦ 4 Β· πŸ“‹ 41 - 53% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/pfnet/pfrl
  • PyPi (πŸ“₯ 610 / month Β· ⏱️ 17.12.2020):

    pip install pfrl
ReAgent (πŸ₯‰16 Β· ⭐ 2.7K) - A platform for Reasoning systems (Reinforcement Learning,.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 82 Β· πŸ”€ 370 Β· πŸ“‹ 93 - 21% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/facebookresearch/ReAgent
RLax (πŸ₯‰16 Β· ⭐ 520) - A library of reinforcement learning building blocks in JAX. Apache-2 jax
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 36 Β· πŸ“¦ 9 Β· πŸ“‹ 6 - 50% open Β· ⏱️ 04.12.2020):

    git clone https://github.com/deepmind/rlax
  • PyPi (πŸ“₯ 130 / month Β· ⏱️ 19.02.2020):

    pip install rlax
Show 2 hidden projects...
  • keras-rl (πŸ₯ˆ26 Β· ⭐ 4.9K Β· πŸ’€) - Deep Reinforcement Learning for Keras. MIT
  • DeepMind Lab (πŸ₯‰17 Β· ⭐ 6.4K) - A customisable 3D platform for agent-based AI research. ❗️GPL-2.0

Recommender Systems

Back to top

Libraries for building and evaluating recommendation systems.

scikit-surprise (πŸ₯‡28 Β· ⭐ 4.6K) - A Python scikit for building and analyzing recommender.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 810 Β· πŸ“¦ 840 Β· πŸ“‹ 320 - 10% open Β· ⏱️ 05.08.2020):

    git clone https://github.com/NicolasHug/Surprise
  • PyPi (πŸ“₯ 41K / month Β· πŸ“¦ 24 Β· ⏱️ 19.07.2020):

    pip install scikit-surprise
  • Conda (πŸ“₯ 150K Β· ⏱️ 13.10.2020):

    conda install -c conda-forge scikit-surprise
implicit (πŸ₯‡28 Β· ⭐ 2.2K) - Fast Python Collaborative Filtering for Implicit Feedback Datasets. MIT
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 450 Β· πŸ“¦ 330 Β· πŸ“‹ 330 - 22% open Β· ⏱️ 15.11.2020):

    git clone https://github.com/benfred/implicit
  • PyPi (πŸ“₯ 94K / month Β· πŸ“¦ 22 Β· ⏱️ 15.09.2020):

    pip install implicit
  • Conda (πŸ“₯ 170K Β· ⏱️ 24.11.2020):

    conda install -c conda-forge implicit
lightfm (πŸ₯ˆ27 Β· ⭐ 3.4K) - A Python implementation of LightFM, a hybrid recommendation.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 42 Β· πŸ”€ 560 Β· πŸ“¦ 370 Β· πŸ“‹ 400 - 31% open Β· ⏱️ 27.11.2020):

    git clone https://github.com/lyst/lightfm
  • PyPi (πŸ“₯ 97K / month Β· πŸ“¦ 28 Β· ⏱️ 27.11.2020):

    pip install lightfm
  • Conda (πŸ“₯ 71K Β· ⏱️ 07.12.2020):

    conda install -c conda-forge lightfm
Recommenders (πŸ₯ˆ21 Β· ⭐ 8.9K) - Best Practices on Recommendation Systems. MIT
  • GitHub (πŸ‘¨β€πŸ’» 86 Β· πŸ”€ 1.5K Β· πŸ“¦ 1 Β· πŸ“‹ 530 - 19% open Β· ⏱️ 27.11.2020):

    git clone https://github.com/microsoft/recommenders
TF Ranking (πŸ₯ˆ21 Β· ⭐ 2K) - Learning to Rank in TensorFlow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 350 Β· πŸ“‹ 220 - 10% open Β· ⏱️ 19.08.2020):

    git clone https://github.com/tensorflow/ranking
  • PyPi (πŸ“₯ 21K / month Β· πŸ“¦ 1 Β· ⏱️ 19.08.2020):

    pip install tensorflow_ranking
tensorrec (πŸ₯ˆ21 Β· ⭐ 1.1K Β· πŸ’€) - A TensorFlow recommendation algorithm and framework in.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 200 Β· πŸ“¦ 22 Β· πŸ“‹ 120 - 26% open Β· ⏱️ 04.02.2020):

    git clone https://github.com/jfkirk/tensorrec
  • PyPi (πŸ“₯ 1.1K / month Β· πŸ“¦ 1 Β· ⏱️ 02.04.2019):

    pip install tensorrec
RecBole (πŸ₯‰20 Β· ⭐ 650 Β· 🐣) - A unified, comprehensive and efficient recommendation library. MIT
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 86 Β· πŸ“‹ 42 - 40% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/RUCAIBox/RecBole
  • PyPi (πŸ“₯ 140 / month Β· ⏱️ 06.12.2020):

    pip install recbole
  • Conda (πŸ“₯ 140 Β· ⏱️ 06.12.2020):

    conda install -c aibox recbole
recmetrics (πŸ₯‰20 Β· ⭐ 220) - A library of metrics for evaluating recommender systems. MIT
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 52 Β· πŸ“¦ 16 Β· πŸ“‹ 14 - 50% open Β· ⏱️ 03.12.2020):

    git clone https://github.com/statisticianinstilettos/recmetrics
  • PyPi (πŸ“₯ 1.1K / month Β· ⏱️ 10.02.2019):

    pip install recmetrics
TF Recommenders (πŸ₯‰19 Β· ⭐ 660 Β· 🐣) - TensorFlow Recommenders is a library for building.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 65 Β· πŸ“¦ 10 Β· πŸ“‹ 64 - 37% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/tensorflow/recommenders
  • PyPi (πŸ“₯ 3.1K / month Β· ⏱️ 22.12.2020):

    pip install tensorflow-recommenders
Spotlight (πŸ₯‰18 Β· ⭐ 2.4K Β· πŸ’€) - Deep recommender models using PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 370 Β· πŸ“‹ 110 - 57% open Β· ⏱️ 09.02.2020):

    git clone https://github.com/maciejkula/spotlight
  • Conda (πŸ“₯ 5.9K Β· ⏱️ 27.05.2018):

    conda install -c maciejkula spotlight
Case Recommender (πŸ₯‰17 Β· ⭐ 300 Β· πŸ’€) - Case Recommender: A Flexible and Extensible Python.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 66 Β· πŸ“¦ 5 Β· πŸ“‹ 22 - 27% open Β· ⏱️ 26.05.2020):

    git clone https://github.com/caserec/CaseRecommender
  • PyPi (πŸ“₯ 190 / month Β· ⏱️ 20.08.2019):

    pip install caserecommender
OpenRec (πŸ₯‰16 Β· ⭐ 350 Β· πŸ’€) - OpenRec is an open-source and modular library for neural.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 79 Β· πŸ“¦ 1 Β· πŸ“‹ 16 - 25% open Β· ⏱️ 19.02.2020):

    git clone https://github.com/ylongqi/openrec
  • PyPi (πŸ“₯ 63 / month Β· πŸ“¦ 1 Β· ⏱️ 11.09.2019):

    pip install openrec

Privacy Machine Learning

Back to top

Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.

PySyft (πŸ₯‡26 Β· ⭐ 6.7K) - A library for answering questions using data you cannot see. MIT
  • GitHub (πŸ‘¨β€πŸ’» 330 Β· πŸ”€ 1.5K Β· πŸ“‹ 2.6K - 6% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/OpenMined/PySyft
  • PyPi (πŸ“₯ 2.3K / month Β· πŸ“¦ 3 Β· ⏱️ 14.09.2020):

    pip install syft
TensorFlow Privacy (πŸ₯ˆ21 Β· ⭐ 1.3K) - Library for training machine learning models with privacy.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 260 Β· πŸ“₯ 41 Β· πŸ“‹ 110 - 34% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/tensorflow/privacy
  • PyPi (πŸ“₯ 4.4K / month Β· πŸ“¦ 1 Β· ⏱️ 03.09.2020):

    pip install tensorflow-privacy
TFEncrypted (πŸ₯ˆ21 Β· ⭐ 800) - A Framework for Encrypted Machine Learning in TensorFlow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 130 Β· πŸ“¦ 46 Β· πŸ“‹ 390 - 41% open Β· ⏱️ 19.08.2020):

    git clone https://github.com/tf-encrypted/tf-encrypted
  • PyPi (πŸ“₯ 530 / month Β· πŸ“¦ 8 Β· ⏱️ 21.10.2019):

    pip install tf-encrypted
FATE (πŸ₯‰20 Β· ⭐ 2.6K) - An Industrial Grade Federated Learning Framework. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 760 Β· πŸ“‹ 770 - 32% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/FederatedAI/FATE
Opacus (πŸ₯‰20 Β· ⭐ 690) - Training PyTorch models with differential privacy. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 87 Β· πŸ“₯ 28 Β· πŸ“¦ 9 Β· πŸ“‹ 53 - 13% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/pytorch/opacus
  • PyPi (πŸ“₯ 690 / month Β· ⏱️ 17.12.2020):

    pip install opacus
CrypTen (πŸ₯‰17 Β· ⭐ 680) - A framework for Privacy Preserving Machine Learning. MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 87 Β· πŸ“¦ 5 Β· πŸ“‹ 46 - 32% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/facebookresearch/CrypTen
  • PyPi (πŸ“₯ 190 / month Β· ⏱️ 05.02.2020):

    pip install crypten

Workflow & Experiment Tracking

Back to top

Libraries to organize, track, and visualize machine learning experiments.

Tensorboard (πŸ₯‡36 Β· ⭐ 5.1K) - TensorFlow's Visualization Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 250 Β· πŸ”€ 1.3K Β· πŸ“¦ 49K Β· πŸ“‹ 1.4K - 37% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/tensorflow/tensorboard
  • PyPi (πŸ“₯ 5.3M / month Β· πŸ“¦ 3.6K Β· ⏱️ 12.11.2020):

    pip install tensorboard
  • Conda (πŸ“₯ 1.5M Β· ⏱️ 12.11.2020):

    conda install -c conda-forge tensorboard
mlflow (πŸ₯‡33 Β· ⭐ 8.1K) - Open source platform for the machine learning lifecycle. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 270 Β· πŸ”€ 1.8K Β· πŸ“¦ 1.7K Β· πŸ“‹ 1.7K - 39% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/mlflow/mlflow
  • PyPi (πŸ“₯ 2.6M / month Β· πŸ“¦ 150 Β· ⏱️ 25.12.2020):

    pip install mlflow
  • Conda (πŸ“₯ 190K Β· ⏱️ 16.12.2020):

    conda install -c conda-forge mlflow
DVC (πŸ₯‡30 Β· ⭐ 7K) - Data Version Control | Git for Data & Models. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 200 Β· πŸ”€ 660 Β· πŸ“₯ 16K Β· πŸ“¦ 510 Β· πŸ“‹ 2.6K - 18% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/iterative/dvc
  • PyPi (πŸ“₯ 59K / month Β· πŸ“¦ 46 Β· ⏱️ 23.12.2020):

    pip install dvc
  • Conda (πŸ“₯ 430K Β· ⏱️ 23.12.2020):

    conda install -c conda-forge dvc
SageMaker SDK (πŸ₯‡30 Β· ⭐ 1.3K) - A library for training and deploying machine learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 550 Β· πŸ“¦ 540 Β· πŸ“‹ 710 - 27% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/aws/sagemaker-python-sdk
  • PyPi (πŸ“₯ 420K / month Β· πŸ“¦ 28 Β· ⏱️ 23.12.2020):

    pip install sagemaker
tensorboardX (πŸ₯ˆ29 Β· ⭐ 6.7K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
  • GitHub (πŸ‘¨β€πŸ’» 64 Β· πŸ”€ 770 Β· πŸ“₯ 290 Β· πŸ“¦ 9.5K Β· πŸ“‹ 410 - 17% open Β· ⏱️ 05.07.2020):

    git clone https://github.com/lanpa/tensorboardX
  • PyPi (πŸ“₯ 280K / month Β· πŸ“¦ 1.3K Β· ⏱️ 31.12.2019):

    pip install tensorboardX
  • Conda (πŸ“₯ 250K Β· ⏱️ 06.07.2020):

    conda install -c conda-forge tensorboardx
kaggle (πŸ₯ˆ29 Β· ⭐ 3.7K) - Official Kaggle API. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 760 Β· πŸ“¦ 4.1K Β· πŸ“‹ 270 - 60% open Β· ⏱️ 30.11.2020):

    git clone https://github.com/Kaggle/kaggle-api
  • PyPi (πŸ“₯ 240K / month Β· πŸ“¦ 560 Β· ⏱️ 30.11.2020):

    pip install kaggle
  • Conda (πŸ“₯ 39K Β· ⏱️ 30.11.2020):

    conda install -c conda-forge kaggle
sacred (πŸ₯ˆ29 Β· ⭐ 3.2K) - Sacred is a tool to help you configure, organize, log and reproduce.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 90 Β· πŸ”€ 310 Β· πŸ“¦ 750 Β· πŸ“‹ 480 - 19% open Β· ⏱️ 15.12.2020):

    git clone https://github.com/IDSIA/sacred
  • PyPi (πŸ“₯ 14K / month Β· πŸ“¦ 150 Β· ⏱️ 14.12.2020):

    pip install sacred
snakemake (πŸ₯ˆ29 Β· ⭐ 780) - This is the development home of the workflow management system.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 180 Β· πŸ“¦ 710 Β· πŸ“‹ 510 - 60% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/snakemake/snakemake
  • PyPi (πŸ“₯ 10K / month Β· πŸ“¦ 290 Β· ⏱️ 21.12.2020):

    pip install snakemake
  • Conda (πŸ“₯ 260K Β· ⏱️ 22.12.2020):

    conda install -c bioconda snakemake
PyCaret (πŸ₯ˆ28 Β· ⭐ 2.7K) - An open-source, low-code machine learning library in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 570 Β· πŸ“₯ 340 Β· πŸ“¦ 550 Β· πŸ“‹ 660 - 22% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/pycaret/pycaret
  • PyPi (πŸ“₯ 27K / month Β· ⏱️ 22.12.2020):

    pip install pycaret
AzureML SDK (πŸ₯ˆ28 Β· ⭐ 2K) - Python notebooks with ML and deep learning examples with Azure.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 52 Β· πŸ”€ 1.5K Β· πŸ“₯ 380 Β· πŸ“‹ 900 - 5% open Β· ⏱️ 12.12.2020):

    git clone https://github.com/Azure/MachineLearningNotebooks
  • PyPi (πŸ“₯ 290K / month Β· πŸ“¦ 60 Β· ⏱️ 07.12.2020):

    pip install azureml-sdk
Catalyst (πŸ₯ˆ27 Β· ⭐ 2.3K Β· πŸ“‰) - Accelerated deep learning R&D. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 80 Β· πŸ”€ 260 Β· πŸ“¦ 250 Β· πŸ“‹ 280 - 9% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/catalyst-team/catalyst
  • PyPi (πŸ“₯ 9.2K / month Β· πŸ“¦ 38 Β· ⏱️ 20.12.2020):

    pip install catalyst
Metaflow (πŸ₯ˆ26 Β· ⭐ 3.9K) - Build and manage real-life data science projects with ease. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 310 Β· πŸ“¦ 98 Β· πŸ“‹ 240 - 44% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/Netflix/metaflow
  • PyPi (πŸ“₯ 39K / month Β· πŸ“¦ 1 Β· ⏱️ 29.10.2020):

    pip install metaflow
  • Conda (πŸ“₯ 9.7K Β· ⏱️ 12.11.2020):

    conda install -c conda-forge metaflow
TRAINS (πŸ₯‰25 Β· ⭐ 2K Β· πŸ“‰) - ClearML - Auto-Magical Suite of tools to streamline your ML.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 24 Β· πŸ”€ 240 Β· πŸ“₯ 250 Β· πŸ“¦ 3 Β· πŸ“‹ 230 - 35% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/allegroai/trains
  • PyPi (πŸ“₯ 160K / month Β· ⏱️ 10.12.2020):

    pip install trains
  • Dockerhub (πŸ“₯ 30K Β· ⏱️ 05.10.2020):

    docker pull allegroai/trains
ml-metadata (πŸ₯‰25 Β· ⭐ 210 Β· βž•) - For recording and retrieving metadata associated with ML.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 56 Β· πŸ“₯ 1.2K Β· πŸ“¦ 60 Β· πŸ“‹ 52 - 30% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/google/ml-metadata
  • PyPi (πŸ“₯ 640K / month Β· ⏱️ 15.12.2020):

    pip install ml-metadata
TNT (πŸ₯‰24 Β· ⭐ 1.3K Β· πŸ’€) - Simple tools for logging and visualizing, loading and training. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 34 Β· πŸ”€ 180 Β· πŸ“¦ 500 Β· πŸ“‹ 64 - 46% open Β· ⏱️ 15.01.2020):

    git clone https://github.com/pytorch/tnt
  • PyPi (πŸ“₯ 8.3K / month Β· πŸ“¦ 50 Β· ⏱️ 29.07.2018):

    pip install torchnet
VisualDL (πŸ₯‰23 Β· ⭐ 3.2K) - Deep Learning Visualization Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 430 Β· πŸ“₯ 57 Β· πŸ“¦ 120 Β· πŸ“‹ 330 - 6% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/PaddlePaddle/VisualDL
  • PyPi (πŸ“₯ 4.2K / month Β· πŸ“¦ 5 Β· ⏱️ 26.11.2020):

    pip install visualdl
TensorWatch (πŸ₯‰22 Β· ⭐ 3K) - Debugging, monitoring and visualization for Python Machine Learning.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 310 Β· πŸ“¦ 42 Β· πŸ“‹ 62 - 77% open Β· ⏱️ 16.09.2020):

    git clone https://github.com/microsoft/tensorwatch
  • PyPi (πŸ“₯ 1.4K / month Β· πŸ“¦ 6 Β· ⏱️ 04.03.2020):

    pip install tensorwatch
Guild AI (πŸ₯‰22 Β· ⭐ 510) - Experiment tracking, ML developer tools. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 44 Β· πŸ“¦ 25 Β· πŸ“‹ 240 - 39% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/guildai/guildai
  • PyPi (πŸ“₯ 5K / month Β· ⏱️ 17.12.2020):

    pip install guildai
knockknock (πŸ₯‰21 Β· ⭐ 1.9K Β· πŸ’€) - Knock Knock: Get notified when your training ends with only two.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 150 Β· πŸ“¦ 120 Β· πŸ“‹ 33 - 36% open Β· ⏱️ 16.03.2020):

    git clone https://github.com/huggingface/knockknock
  • PyPi (πŸ“₯ 1.2K / month Β· πŸ“¦ 3 Β· ⏱️ 16.03.2020):

    pip install knockknock
  • Conda (πŸ“₯ 5.2K Β· ⏱️ 17.03.2020):

    conda install -c conda-forge knockknock
Studio.ml (πŸ₯‰21 Β· ⭐ 370) - Studio: Simplify and expedite model building process. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 47 Β· πŸ“¦ 2 Β· πŸ“‹ 240 - 23% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/studioml/studio
  • PyPi (πŸ“₯ 1.3K / month Β· ⏱️ 17.12.2020):

    pip install studioml
gokart (πŸ₯‰21 Β· ⭐ 150 Β· βž•) - A wrapper of the data pipeline library luigi. MIT
  • GitHub (πŸ‘¨β€πŸ’» 24 Β· πŸ”€ 30 Β· πŸ“¦ 28 Β· πŸ“‹ 27 - 33% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/m3dev/gokart
  • PyPi (πŸ“₯ 870 / month Β· πŸ“¦ 2 Β· ⏱️ 25.02.2020):

    pip install gokart
Labml (πŸ₯‰20 Β· ⭐ 320) - Monitor PyTorch & TensorFlow model training on mobile phones. MIT
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 27 Β· πŸ“¦ 20 Β· ⏱️ 26.12.2020):

    git clone https://github.com/lab-ml/labml
  • PyPi (πŸ“₯ 1K / month Β· ⏱️ 22.12.2020):

    pip install labml
MXBoard (πŸ₯‰19 Β· ⭐ 330 Β· πŸ’€) - Logging MXNet data for visualization in TensorBoard. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 46 Β· πŸ“¦ 90 Β· πŸ“‹ 31 - 51% open Β· ⏱️ 24.01.2020):

    git clone https://github.com/awslabs/mxboard
  • PyPi (πŸ“₯ 2.7K / month Β· πŸ“¦ 1 Β· ⏱️ 22.05.2018):

    pip install mxboard
Show 6 hidden projects...
  • TensorBoard Logger (πŸ₯‰21 Β· ⭐ 610 Β· πŸ’€) - Log TensorBoard events without touching TensorFlow. MIT
  • SKLL (πŸ₯‰17 Β· ⭐ 520) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine.. ❗️BSD-1-Clause
  • datmo (πŸ₯‰17 Β· ⭐ 330 Β· πŸ’€) - Open source production model management tool for data scientists. MIT
  • steppy (πŸ₯‰15 Β· ⭐ 120 Β· πŸ’€) - Lightweight, Python library for fast and reproducible experimentation. MIT
  • ModelChimp (πŸ₯‰14 Β· ⭐ 120 Β· πŸ’€) - Experiment tracking for machine and deep learning projects. BSD-2
  • traintool (πŸ₯‰10 Β· ⭐ 8 Β· 🐣) - Train off-the-shelf machine learning models in one line of.. Apache-2

Model Serialization & Conversion

Back to top

Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.

onnx (πŸ₯‡33 Β· ⭐ 9.5K) - Open standard for machine learning interoperability. MIT
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 1.7K Β· πŸ“₯ 9.4K Β· πŸ“¦ 2.3K Β· πŸ“‹ 1.4K - 35% open Β· ⏱️ 20.12.2020):

    git clone https://github.com/onnx/onnx
  • PyPi (πŸ“₯ 330K / month Β· πŸ“¦ 300 Β· ⏱️ 06.11.2020):

    pip install onnx
  • Conda (πŸ“₯ 180K Β· ⏱️ 10.11.2020):

    conda install -c conda-forge onnx
Core ML Tools (πŸ₯ˆ26 Β· ⭐ 2K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 320 Β· πŸ“₯ 2.2K Β· πŸ“¦ 440 Β· πŸ“‹ 590 - 48% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/apple/coremltools
  • PyPi (πŸ“₯ 39K / month Β· πŸ“¦ 130 Β· ⏱️ 10.10.2020):

    pip install coremltools
mmdnn (πŸ₯ˆ24 Β· ⭐ 5.1K) - MMdnn is a set of tools to help users inter-operate among different deep.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 85 Β· πŸ”€ 920 Β· πŸ“₯ 3.3K Β· πŸ“¦ 47 Β· πŸ“‹ 590 - 51% open Β· ⏱️ 14.08.2020):

    git clone https://github.com/Microsoft/MMdnn
  • PyPi (πŸ“₯ 750 / month Β· πŸ“¦ 3 Β· ⏱️ 24.07.2020):

    pip install mmdnn
TorchServe (πŸ₯ˆ24 Β· ⭐ 1.5K) - Model Serving on PyTorch. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 61 Β· πŸ”€ 210 Β· πŸ“₯ 140 Β· πŸ“¦ 24 Β· πŸ“‹ 520 - 25% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/pytorch/serve
  • PyPi (πŸ“₯ 1.5K / month Β· ⏱️ 17.12.2020):

    pip install torchserve
  • Conda (πŸ“₯ 5.5K Β· ⏱️ 17.12.2020):

    conda install -c pytorch torchserve
  • Dockerhub (πŸ“₯ 14K Β· ⭐ 3 Β· ⏱️ 18.12.2020):

    docker pull pytorch/torchserve
cortex (πŸ₯‰21 Β· ⭐ 7.1K) - Run inference at scale. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 540 Β· πŸ“‹ 850 - 23% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/cortexlabs/cortex
  • PyPi (πŸ“₯ 800 / month Β· ⏱️ 23.12.2020):

    pip install cortex
Hummingbird (πŸ₯‰19 Β· ⭐ 1.9K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 140 Β· πŸ“₯ 84 Β· πŸ“¦ 12 Β· πŸ“‹ 170 - 26% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/microsoft/hummingbird
  • PyPi (πŸ“₯ 700 / month Β· ⏱️ 30.10.2020):

    pip install hummingbird-ml
pytorch2keras (πŸ₯‰18 Β· ⭐ 640 Β· πŸ’€) - PyTorch to Keras model convertor. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 110 Β· πŸ“¦ 16 Β· πŸ“‹ 100 - 37% open Β· ⏱️ 14.05.2020):

    git clone https://github.com/nerox8664/pytorch2keras
  • PyPi (πŸ“₯ 580 / month Β· πŸ“¦ 1 Β· ⏱️ 14.05.2020):

    pip install pytorch2keras
sklearn-porter (πŸ₯‰17 Β· ⭐ 940 Β· πŸ’€) - Transpile trained scikit-learn estimators to C, Java,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 120 Β· πŸ“‹ 63 - 53% open Β· ⏱️ 18.12.2019):

    git clone https://github.com/nok/sklearn-porter
  • PyPi (πŸ“₯ 520 / month Β· πŸ“¦ 1 Β· ⏱️ 18.12.2019):

    pip install sklearn-porter
Show 1 hidden projects...

Model Interpretability

Back to top

Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.

shap (πŸ₯‡33 Β· ⭐ 11K Β· πŸ“ˆ) - A game theoretic approach to explain the output of any machine learning.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 1.6K Β· πŸ“¦ 1.8K Β· πŸ“‹ 1.4K - 63% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/slundberg/shap
  • PyPi (πŸ“₯ 710K / month Β· πŸ“¦ 140 Β· ⏱️ 04.11.2020):

    pip install shap
  • Conda (πŸ“₯ 340K Β· ⏱️ 16.12.2020):

    conda install -c conda-forge shap
Lime (πŸ₯‡29 Β· ⭐ 8.3K) - Lime: Explaining the predictions of any machine learning classifier. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 56 Β· πŸ”€ 1.3K Β· πŸ“¦ 960 Β· πŸ“‹ 480 - 9% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/marcotcr/lime
  • PyPi (πŸ“₯ 170K / month Β· πŸ“¦ 130 Β· ⏱️ 03.04.2020):

    pip install lime
  • Conda (πŸ“₯ 61K Β· ⏱️ 28.06.2020):

    conda install -c conda-forge lime
eli5 (πŸ₯‡28 Β· ⭐ 2.2K Β· πŸ’€) - A library for debugging/inspecting machine learning classifiers and.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 280 Β· πŸ“¦ 800 Β· πŸ“‹ 230 - 53% open Β· ⏱️ 22.01.2020):

    git clone https://github.com/TeamHG-Memex/eli5
  • PyPi (πŸ“₯ 240K / month Β· πŸ“¦ 96 Β· ⏱️ 29.08.2019):

    pip install eli5
  • Conda (πŸ“₯ 85K Β· ⏱️ 15.06.2020):

    conda install -c conda-forge eli5
pyLDAvis (πŸ₯‡28 Β· ⭐ 1.4K Β· πŸ“ˆ) - Python library for interactive topic model visualization... BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 31 Β· πŸ”€ 290 Β· πŸ“¦ 1.6K Β· πŸ“‹ 150 - 61% open Β· ⏱️ 02.12.2020):

    git clone https://github.com/bmabey/pyLDAvis
  • PyPi (πŸ“₯ 67K / month Β· πŸ“¦ 99 Β· ⏱️ 05.06.2018):

    pip install pyldavis
  • Conda (πŸ“₯ 21K Β· ⏱️ 22.06.2018):

    conda install -c conda-forge pyldavis
InterpretML (πŸ₯ˆ27 Β· ⭐ 3.3K) - Fit interpretable models. Explain blackbox machine learning. MIT
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 410 Β· πŸ“¦ 58 Β· πŸ“‹ 170 - 29% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/interpretml/interpret
  • PyPi (πŸ“₯ 12K / month Β· πŸ“¦ 2 Β· ⏱️ 20.10.2020):

    pip install interpret
Model Analysis (πŸ₯ˆ27 Β· ⭐ 1K) - Model analysis tools for TensorFlow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 200 Β· πŸ“‹ 54 - 40% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/tensorflow/model-analysis
  • PyPi (πŸ“₯ 5.1M / month Β· πŸ“¦ 10 Β· ⏱️ 16.12.2020):

    pip install tensorflow-model-analysis
yellowbrick (πŸ₯ˆ26 Β· ⭐ 3K) - Visual analysis and diagnostic tools to facilitate machine.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 95 Β· πŸ”€ 460 Β· πŸ“‹ 590 - 14% open Β· ⏱️ 01.11.2020):

    git clone https://github.com/DistrictDataLabs/yellowbrick
  • PyPi (πŸ“₯ 63K / month Β· πŸ“¦ 62 Β· ⏱️ 09.10.2020):

    pip install yellowbrick
Captum (πŸ₯ˆ26 Β· ⭐ 2K) - Model interpretability and understanding for PyTorch. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 66 Β· πŸ”€ 190 Β· πŸ“¦ 94 Β· πŸ“‹ 170 - 26% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/pytorch/captum
  • PyPi (πŸ“₯ 8.2K / month Β· ⏱️ 13.11.2020):

    pip install captum
Lucid (πŸ₯ˆ24 Β· ⭐ 4K) - A collection of infrastructure and tools for research in neural.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 34 Β· πŸ”€ 550 Β· πŸ“¦ 370 Β· πŸ“‹ 160 - 41% open Β· ⏱️ 24.11.2020):

    git clone https://github.com/tensorflow/lucid
  • PyPi (πŸ“₯ 600 / month Β· πŸ“¦ 6 Β· ⏱️ 20.12.2018):

    pip install lucid
DoWhy (πŸ₯ˆ24 Β· ⭐ 2.5K) - DoWhy is a Python library for causal inference that supports explicit.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 37 Β· πŸ”€ 350 Β· πŸ“₯ 17 Β· πŸ“¦ 23 Β· πŸ“‹ 90 - 14% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/Microsoft/dowhy
  • PyPi (πŸ“₯ 7.9K / month Β· ⏱️ 12.12.2020):

    pip install dowhy
  • Conda (πŸ“₯ 820 Β· ⏱️ 13.12.2020):

    conda install -c conda-forge dowhy
Fairness 360 (πŸ₯ˆ24 Β· ⭐ 1.2K) - A comprehensive set of fairness metrics for datasets and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 40 Β· πŸ”€ 380 Β· πŸ“¦ 60 Β· πŸ“‹ 81 - 43% open Β· ⏱️ 28.10.2020):

    git clone https://github.com/Trusted-AI/AIF360
  • PyPi (πŸ“₯ 3.2K / month Β· πŸ“¦ 2 Β· ⏱️ 02.06.2020):

    pip install aif360
keras-vis (πŸ₯ˆ23 Β· ⭐ 2.8K Β· πŸ’€) - Neural network visualization toolkit for keras. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 560 Β· πŸ“¦ 630 Β· πŸ“‹ 210 - 52% open Β· ⏱️ 20.04.2020):

    git clone https://github.com/raghakot/keras-vis
  • PyPi (πŸ“₯ 2.6K / month Β· πŸ“¦ 28 Β· ⏱️ 06.07.2017):

    pip install keras-vis
keract (πŸ₯ˆ23 Β· ⭐ 840) - Activation Maps (Layers Outputs) and Gradients in Keras. MIT
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 170 Β· πŸ“¦ 63 Β· πŸ“‹ 74 - 2% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/philipperemy/keract
  • PyPi (πŸ“₯ 1.4K / month Β· πŸ“¦ 5 Β· ⏱️ 22.12.2020):

    pip install keract
Alibi (πŸ₯‰22 Β· ⭐ 820) - Algorithms for monitoring and explaining machine learning models. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 98 Β· πŸ“¦ 53 Β· πŸ“‹ 130 - 46% open Β· ⏱️ 02.12.2020):

    git clone https://github.com/SeldonIO/alibi
  • PyPi (πŸ“₯ 3.1K / month Β· πŸ“¦ 9 Β· ⏱️ 20.10.2020):

    pip install alibi
Explainability 360 (πŸ₯‰22 Β· ⭐ 740) - Interpretability and explainability of data and machine.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 160 Β· πŸ“¦ 12 Β· πŸ“‹ 44 - 59% open Β· ⏱️ 04.12.2020):

    git clone https://github.com/Trusted-AI/AIX360
  • PyPi (πŸ“₯ 560 / month Β· πŸ“¦ 1 Β· ⏱️ 28.10.2020):

    pip install aix360
TreeInterpreter (πŸ₯‰22 Β· ⭐ 630) - Package for interpreting scikit-learn's decision tree and.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 130 Β· πŸ“¦ 110 Β· πŸ“‹ 19 - 94% open Β· ⏱️ 22.06.2020):

    git clone https://github.com/andosa/treeinterpreter
  • PyPi (πŸ“₯ 69K / month Β· πŸ“¦ 4 Β· ⏱️ 16.12.2018):

    pip install treeinterpreter
tf-explain (πŸ₯‰21 Β· ⭐ 740) - Interpretability Methods for tf.keras models with Tensorflow 2.x. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 66 Β· πŸ“¦ 56 Β· πŸ“‹ 74 - 44% open Β· ⏱️ 24.07.2020):

    git clone https://github.com/sicara/tf-explain
  • PyPi (πŸ“₯ 1.1K / month Β· πŸ“¦ 1 Β· ⏱️ 14.02.2020):

    pip install tf-explain
iNNvestigate (πŸ₯‰20 Β· ⭐ 740) - A toolbox to iNNvestigate neural networks' predictions!. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 170 Β· πŸ“¦ 41 Β· πŸ“‹ 200 - 36% open Β· ⏱️ 16.10.2020):

    git clone https://github.com/albermax/innvestigate
  • PyPi (πŸ“₯ 480 / month Β· πŸ“¦ 1 Β· ⏱️ 14.11.2020):

    pip install innvestigate
sklearn-evaluation (πŸ₯‰20 Β· ⭐ 280) - Machine learning model evaluation made easy: plots,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 25 Β· πŸ“¦ 26 Β· πŸ“‹ 34 - 20% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/edublancas/sklearn-evaluation
  • PyPi (πŸ“₯ 290 / month Β· πŸ“¦ 2 Β· ⏱️ 15.12.2020):

    pip install sklearn-evaluation
What-If Tool (πŸ₯‰18 Β· ⭐ 400) - Source code/webpage/demos for the What-If Tool. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 87 Β· πŸ“‹ 68 - 52% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/PAIR-code/what-if-tool
  • PyPi (πŸ“₯ 2.2K / month Β· ⏱️ 28.06.2020):

    pip install witwidget
  • NPM (πŸ“₯ 2K / month Β· ⏱️ 03.11.2020):

    npm install wit-widget
DiCE (πŸ₯‰17 Β· ⭐ 410) - Generate Diverse Counterfactual Explanations for any machine.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 63 Β· πŸ“‹ 30 - 40% open Β· ⏱️ 15.12.2020):

    git clone https://github.com/interpretml/DiCE
  • PyPi (πŸ“₯ 170 / month Β· ⏱️ 22.09.2020):

    pip install dice-ml
LOFO (πŸ₯‰17 Β· ⭐ 290) - Leave One Feature Out Importance. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 40 Β· πŸ“¦ 3 Β· πŸ“‹ 11 - 9% open Β· ⏱️ 02.07.2020):

    git clone https://github.com/aerdem4/lofo-importance
  • PyPi (πŸ“₯ 220 / month Β· ⏱️ 02.07.2020):

    pip install lofo-importance
ExplainX.ai (πŸ₯‰17 Β· ⭐ 150 Β· 🐣) - Explainable AI framework for data scientists. Explain & debug.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 27 Β· πŸ“‹ 19 - 36% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/explainX/explainx
  • PyPi (πŸ“₯ 830 / month Β· ⏱️ 07.12.2020):

    pip install explainx
LIT (πŸ₯‰16 Β· ⭐ 2.3K Β· 🐣) - The Language Interpretability Tool: Interactively analyze NLP.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 220 Β· πŸ“‹ 52 - 48% open Β· ⏱️ 17.11.2020):

    git clone https://github.com/PAIR-code/lit
  • PyPi (πŸ“₯ 260 / month Β· ⏱️ 17.11.2020):

    pip install lit-nlp
FlashTorch (πŸ₯‰16 Β· ⭐ 520 Β· πŸ’€) - Visualization toolkit for neural networks in PyTorch! Demo --. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 65 Β· πŸ“¦ 6 Β· πŸ“‹ 27 - 22% open Β· ⏱️ 29.05.2020):

    git clone https://github.com/MisaOgura/flashtorch
  • PyPi (πŸ“₯ 210 / month Β· ⏱️ 29.05.2020):

    pip install flashtorch
Anchor (πŸ₯‰14 Β· ⭐ 610) - Code for High-Precision Model-Agnostic Explanations paper. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 87 Β· πŸ“‹ 52 - 15% open Β· ⏱️ 10.09.2020):

    git clone https://github.com/marcotcr/anchor
  • PyPi (πŸ“₯ 610 / month Β· ⏱️ 26.06.2020):

    pip install anchor_exp
Show 4 hidden projects...
  • scikit-plot (πŸ₯ˆ23 Β· ⭐ 2K Β· πŸ’€) - An intuitive library to add plotting functionality to scikit-.. MIT
  • Skater (πŸ₯‰21 Β· ⭐ 960) - Python Library for Model Interpretation/Explanations. ❗️UPL-1.0
  • XAI (πŸ₯‰16 Β· ⭐ 540 Β· πŸ’€) - XAI - An eXplainability toolbox for machine learning. MIT
  • Attribution Priors (πŸ₯‰12 Β· ⭐ 71) - Tools for training explainable models using attribution.. MIT

Vector Similarity Search (ANN)

Back to top

Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search. For an awesome comparison, refer to this benchmark.

Faiss (πŸ₯‡29 Β· ⭐ 12K) - A library for efficient similarity search and clustering of dense vectors. MIT
  • GitHub (πŸ‘¨β€πŸ’» 71 Β· πŸ”€ 2.1K Β· πŸ“¦ 290 Β· πŸ“‹ 1.3K - 7% open Β· ⏱️ 25.12.2020):

    git clone https://github.com/facebookresearch/faiss
  • PyPi (πŸ“₯ 5.5K / month Β· πŸ“¦ 6 Β· ⏱️ 16.10.2020):

    pip install pymilvus
  • Conda (πŸ“₯ 19K Β· ⏱️ 12.12.2020):

    conda install -c conda-forge faiss
Annoy (πŸ₯‡29 Β· ⭐ 8K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 850 Β· πŸ“¦ 1.4K Β· πŸ“‹ 290 - 8% open Β· ⏱️ 03.12.2020):

    git clone https://github.com/spotify/annoy
  • PyPi (πŸ“₯ 210K / month Β· πŸ“¦ 360 Β· ⏱️ 18.09.2020):

    pip install annoy
NMSLIB (πŸ₯ˆ27 Β· ⭐ 2.2K) - Non-Metric Space Library (NMSLIB): An efficient similarity search.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 40 Β· πŸ”€ 330 Β· πŸ“¦ 300 Β· πŸ“‹ 350 - 12% open Β· ⏱️ 03.11.2020):

    git clone https://github.com/nmslib/nmslib
  • PyPi (πŸ“₯ 45K / month Β· πŸ“¦ 52 Β· ⏱️ 16.04.2020):

    pip install nmslib
  • Conda (πŸ“₯ 7.9K Β· ⏱️ 27.11.2020):

    conda install -c conda-forge nmslib
Milvus (πŸ₯ˆ25 Β· ⭐ 4.8K Β· βž•) - An open source embedding vector similarity search engine.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 740 Β· πŸ“‹ 2K - 11% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/milvus-io/milvus
  • PyPi (πŸ“₯ 5.5K / month Β· πŸ“¦ 6 Β· ⏱️ 16.10.2020):

    pip install pymilvus
  • Dockerhub (πŸ“₯ 210K Β· ⭐ 8 Β· ⏱️ 02.12.2020):

    docker pull milvusdb/milvus
hnswlib (πŸ₯ˆ23 Β· ⭐ 1.3K) - Header-only C++/python library for fast approximate nearest.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 32 Β· πŸ”€ 230 Β· πŸ“‹ 180 - 43% open Β· ⏱️ 18.08.2020):

    git clone https://github.com/nmslib/hnswlib
  • PyPi (πŸ“₯ 12K / month Β· πŸ“¦ 2 Β· ⏱️ 13.07.2020):

    pip install hnswlib
Magnitude (πŸ₯‰21 Β· ⭐ 1.4K) - A fast, efficient universal vector embedding utility package. MIT
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 98 Β· πŸ“¦ 170 Β· πŸ“‹ 74 - 32% open Β· ⏱️ 17.07.2020):

    git clone https://github.com/plasticityai/magnitude
  • PyPi (πŸ“₯ 5.5K / month Β· πŸ“¦ 14 Β· ⏱️ 25.05.2020):

    pip install pymagnitude
PyNNDescent (πŸ₯‰21 Β· ⭐ 350) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 43 Β· πŸ“‹ 53 - 49% open Β· ⏱️ 28.11.2020):

    git clone https://github.com/lmcinnes/pynndescent
  • PyPi (πŸ“₯ 7.7K / month Β· πŸ“¦ 3 Β· ⏱️ 19.11.2020):

    pip install pynndescent
  • Conda (πŸ“₯ 20K Β· ⏱️ 19.11.2020):

    conda install -c conda-forge pynndescent
N2 (πŸ₯‰20 Β· ⭐ 440) - TOROS N2 - lightweight approximate Nearest Neighbor library which runs.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 51 Β· πŸ“¦ 12 Β· πŸ“‹ 38 - 44% open Β· ⏱️ 21.10.2020):

    git clone https://github.com/kakao/n2
  • PyPi (πŸ“₯ 440 / month Β· πŸ“¦ 2 Β· ⏱️ 16.10.2020):

    pip install n2
NGT (πŸ₯‰19 Β· ⭐ 600) - Nearest Neighbor Search with Neighborhood Graph and Tree for High-.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 67 Β· πŸ“‹ 73 - 9% open Β· ⏱️ 17.11.2020):

    git clone https://github.com/yahoojapan/NGT
  • PyPi (πŸ“₯ 3.3K / month Β· πŸ“¦ 3 Β· ⏱️ 23.09.2020):

    pip install ngt
Show 2 hidden projects...
  • NearPy (πŸ₯‰20 Β· ⭐ 660 Β· πŸ’€) - Python framework for fast (approximated) nearest neighbour search in.. MIT
  • PySparNN (πŸ₯‰12 Β· ⭐ 840 Β· πŸ’€) - Approximate Nearest Neighbor Search for Sparse Data in Python!. BSD-3

Probabilistic Methods

Back to top

Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, and gaussian processes.

PyMC3 (πŸ₯‡32 Β· ⭐ 5.4K) - Probabilistic Programming in Python: Bayesian Modeling and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 300 Β· πŸ”€ 1.3K Β· πŸ“₯ 140 Β· πŸ“¦ 1.9K Β· πŸ“‹ 2.1K - 7% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/pymc-devs/pymc3
  • PyPi (πŸ“₯ 120K / month Β· πŸ“¦ 290 Β· ⏱️ 07.12.2020):

    pip install pymc3
  • Conda (πŸ“₯ 240K Β· ⏱️ 07.12.2020):

    conda install -c conda-forge pymc3
tensorflow-probability (πŸ₯‡31 Β· ⭐ 3.2K Β· πŸ“ˆ) - Probabilistic reasoning and statistical analysis in.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 400 Β· πŸ”€ 840 Β· πŸ“¦ 1 Β· πŸ“‹ 930 - 45% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/tensorflow/probability
  • PyPi (πŸ“₯ 270K / month Β· πŸ“¦ 250 Β· ⏱️ 09.12.2020):

    pip install tensorflow-probability
  • Conda (πŸ“₯ 28K Β· ⏱️ 13.03.2020):

    conda install -c conda-forge tensorflow-probability
Pyro (πŸ₯ˆ28 Β· ⭐ 6.6K) - Deep universal probabilistic programming with Python and PyTorch. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 780 Β· πŸ“¦ 350 Β· πŸ“‹ 810 - 16% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/pyro-ppl/pyro
  • PyPi (πŸ“₯ 10K / month Β· πŸ“¦ 36 Β· ⏱️ 17.11.2020):

    pip install pyro-ppl
GPyTorch (πŸ₯ˆ28 Β· ⭐ 2.2K Β· πŸ“ˆ) - A highly efficient and modular implementation of Gaussian.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 310 Β· πŸ“¦ 180 Β· πŸ“‹ 800 - 23% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/cornellius-gp/gpytorch
  • PyPi (πŸ“₯ 33K / month Β· πŸ“¦ 8 Β· ⏱️ 30.11.2020):

    pip install gpytorch
pomegranate (πŸ₯ˆ27 Β· ⭐ 2.5K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 60 Β· πŸ”€ 460 Β· πŸ“¦ 360 Β· πŸ“‹ 560 - 6% open Β· ⏱️ 12.12.2020):

    git clone https://github.com/jmschrei/pomegranate
  • PyPi (πŸ“₯ 20K / month Β· πŸ“¦ 28 Β· ⏱️ 12.07.2020):

    pip install pomegranate
  • Conda (πŸ“₯ 41K Β· ⏱️ 01.11.2020):

    conda install -c conda-forge pomegranate
GPflow (πŸ₯ˆ27 Β· ⭐ 1.4K) - Gaussian processes in TensorFlow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 390 Β· πŸ“¦ 210 Β· πŸ“‹ 680 - 12% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/GPflow/GPflow
  • PyPi (πŸ“₯ 4.9K / month Β· πŸ“¦ 17 Β· ⏱️ 01.12.2020):

    pip install gpflow
  • Conda (πŸ“₯ 7.9K Β· ⏱️ 06.11.2018):

    conda install -c conda-forge gpflow
pgmpy (πŸ₯‰24 Β· ⭐ 1.7K) - Python Library for learning (Structure and Parameter) and inference.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 88 Β· πŸ”€ 540 Β· πŸ“₯ 36 Β· πŸ“¦ 170 Β· πŸ“‹ 710 - 31% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/pgmpy/pgmpy
  • PyPi (πŸ“₯ 20K / month Β· πŸ“¦ 6 Β· ⏱️ 30.06.2020):

    pip install pgmpy
pyhsmm (πŸ₯‰18 Β· ⭐ 470) - Bayesian inference in HSMMs and HMMs. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 150 Β· πŸ“¦ 17 Β· πŸ“‹ 93 - 35% open Β· ⏱️ 24.08.2020):

    git clone https://github.com/mattjj/pyhsmm
  • PyPi (πŸ“₯ 75 / month Β· πŸ“¦ 4 Β· ⏱️ 10.05.2017):

    pip install pyhsmm
Baal (πŸ₯‰18 Β· ⭐ 300) - Using approximate bayesian posteriors in deep nets for active learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 24 Β· πŸ“¦ 6 Β· πŸ“‹ 34 - 35% open Β· ⏱️ 11.12.2020):

    git clone https://github.com/ElementAI/baal
  • PyPi (πŸ“₯ 180 / month Β· ⏱️ 03.11.2020):

    pip install baal
Funsor (πŸ₯‰18 Β· ⭐ 160) - Functional tensors for probabilistic programming. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 15 Β· πŸ“¦ 7 Β· πŸ“‹ 110 - 50% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/pyro-ppl/funsor
  • PyPi (πŸ“₯ 200 / month Β· ⏱️ 15.10.2020):

    pip install funsor
Orbit (πŸ₯‰17 Β· ⭐ 290) - Bayesian forecasting with object-oriented design and probabilistic.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 12 Β· πŸ“¦ 2 Β· πŸ“‹ 160 - 13% open Β· ⏱️ 25.11.2020):

    git clone https://github.com/uber/orbit
  • PyPi (πŸ“₯ 170 / month Β· ⏱️ 15.11.2020):

    pip install orbit-ml
Show 3 hidden projects...
  • PyStan (πŸ₯ˆ27 Β· ⭐ 900) - PyStan, the Python interface to Stan. ❗️GPL-3.0
  • Edward (πŸ₯‰24 Β· ⭐ 4.6K Β· πŸ’€) - A probabilistic programming language in TensorFlow. Deep.. Apache-2
  • ZhuSuan (πŸ₯‰14 Β· ⭐ 2K Β· πŸ’€) - A probabilistic programming library for Bayesian deep learning,.. MIT

Adversarial Robustness

Back to top

Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.

Foolbox (πŸ₯‡26 Β· ⭐ 1.7K) - A Python toolbox to create adversarial examples that fool neural.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 310 Β· πŸ“¦ 160 Β· πŸ“‹ 300 - 12% open Β· ⏱️ 20.12.2020):

    git clone https://github.com/bethgelab/foolbox
  • PyPi (πŸ“₯ 1.3K / month Β· πŸ“¦ 18 Β· ⏱️ 26.09.2020):

    pip install foolbox
CleverHans (πŸ₯ˆ25 Β· ⭐ 4.9K) - An adversarial example library for constructing attacks,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 1.2K Β· πŸ“¦ 170 Β· πŸ“‹ 450 - 16% open Β· ⏱️ 09.07.2020):

    git clone https://github.com/cleverhans-lab/cleverhans
  • PyPi (πŸ“₯ 540 / month Β· πŸ“¦ 8 Β· ⏱️ 27.11.2018):

    pip install cleverhans
TextAttack (πŸ₯ˆ24 Β· ⭐ 1.1K) - TextAttack is a Python framework for adversarial attacks, data.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 120 Β· πŸ“¦ 11 Β· πŸ“‹ 95 - 25% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/QData/TextAttack
  • PyPi (πŸ“₯ 1.3K / month Β· ⏱️ 18.11.2020):

    pip install textattack
ART (πŸ₯‰23 Β· ⭐ 1.9K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 58 Β· πŸ”€ 550 Β· πŸ“¦ 94 Β· πŸ“‹ 370 - 13% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/Trusted-AI/adversarial-robustness-toolbox
  • PyPi (πŸ“₯ 1.9K / month Β· ⏱️ 01.12.2020):

    pip install adversarial-robustness-toolbox
AdvBox (πŸ₯‰17 Β· ⭐ 1K) - Advbox is a toolbox to generate adversarial examples that fool neural.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 220 Β· πŸ“‹ 32 - 18% open Β· ⏱️ 26.08.2020):

    git clone https://github.com/advboxes/AdvBox
  • PyPi (πŸ“₯ 21 / month Β· ⏱️ 05.12.2018):

    pip install advbox
Show 2 hidden projects...
  • advertorch (πŸ₯‰18 Β· ⭐ 780) - A Toolbox for Adversarial Robustness Research. ❗️GPL-3.0
  • Adversary (πŸ₯‰13 Β· ⭐ 340 Β· πŸ’€) - Tool to generate adversarial text examples and test machine.. MIT

GPU Utilities

Back to top

Libraries that require and make use of CUDA/GPU system capabilities to optimize data handling and machine learning tasks.

CuPy (πŸ₯‡30 Β· ⭐ 4.7K) - A NumPy-compatible array library accelerated by CUDA. MIT
  • GitHub (πŸ‘¨β€πŸ’» 250 Β· πŸ”€ 430 Β· πŸ“₯ 3.7K Β· πŸ“¦ 630 Β· πŸ“‹ 1.2K - 30% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/cupy/cupy
  • PyPi (πŸ“₯ 8.6K / month Β· πŸ“¦ 190 Β· ⏱️ 25.12.2020):

    pip install cupy
  • Conda (πŸ“₯ 320K Β· ⏱️ 29.10.2020):

    conda install -c conda-forge cupy
  • Dockerhub (πŸ“₯ 48K Β· ⭐ 6 Β· ⏱️ 26.12.2020):

    docker pull cupy/cupy
PyCUDA (πŸ₯‡27 Β· ⭐ 1K) - CUDA integration for Python, plus shiny features. MIT
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 210 Β· πŸ“¦ 700 Β· πŸ“‹ 160 - 24% open Β· ⏱️ 20.12.2020):

    git clone https://github.com/inducer/pycuda
  • PyPi (πŸ“₯ 15K / month Β· πŸ“¦ 290 Β· ⏱️ 07.10.2020):

    pip install pycuda
Apex (πŸ₯ˆ23 Β· ⭐ 4.9K) - A PyTorch Extension: Tools for easy mixed precision and distributed.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 77 Β· πŸ”€ 660 Β· πŸ“¦ 410 Β· πŸ“‹ 780 - 54% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/NVIDIA/apex
  • Conda (πŸ“₯ 28K Β· ⏱️ 23.11.2020):

    conda install -c conda-forge nvidia-apex
py3nvml (πŸ₯ˆ23 Β· ⭐ 160 Β· πŸ’€) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 24 Β· πŸ“¦ 210 Β· πŸ“‹ 10 - 20% open Β· ⏱️ 23.04.2020):

    git clone https://github.com/fbcotter/py3nvml
  • PyPi (πŸ“₯ 81K / month Β· πŸ“¦ 20 Β· ⏱️ 06.04.2020):

    pip install py3nvml
  • Conda (πŸ“₯ 11K Β· ⏱️ 10.10.2020):

    conda install -c conda-forge py3nvml
ArrayFire (πŸ₯ˆ22 Β· ⭐ 3.3K) - ArrayFire: a general purpose GPU library. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 80 Β· πŸ”€ 450 Β· πŸ“₯ 480 Β· πŸ“‹ 1.4K - 17% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/arrayfire/arrayfire
  • PyPi (πŸ“₯ 500 / month Β· πŸ“¦ 2 Β· ⏱️ 18.10.2018):

    pip install arrayfire
scikit-cuda (πŸ₯ˆ21 Β· ⭐ 790) - Python interface to GPU-powered libraries. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 44 Β· πŸ”€ 150 Β· πŸ“¦ 110 Β· πŸ“‹ 200 - 18% open Β· ⏱️ 13.07.2020):

    git clone https://github.com/lebedov/scikit-cuda
  • PyPi (πŸ“₯ 660 / month Β· πŸ“¦ 31 Β· ⏱️ 27.05.2019):

    pip install scikit-cuda
cuDF (πŸ₯‰20 Β· ⭐ 3.6K) - cuDF - GPU DataFrame Library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 200 Β· πŸ”€ 470 Β· πŸ“‹ 3.2K - 17% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/rapidsai/cudf
  • PyPi (πŸ“₯ 830 / month Β· πŸ“¦ 2 Β· ⏱️ 01.06.2020):

    pip install cudf
DALI (πŸ₯‰20 Β· ⭐ 3K) - A library containing both highly optimized building blocks and an.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 53 Β· πŸ”€ 360 Β· πŸ“‹ 760 - 23% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/NVIDIA/DALI
cuML (πŸ₯‰19 Β· ⭐ 1.9K) - cuML - RAPIDS Machine Learning Library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 300 Β· πŸ“‹ 1.5K - 42% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/rapidsai/cuml
  • PyPi (πŸ“₯ 350 / month Β· ⏱️ 11.04.2019):

    pip install cuml
BlazingSQL (πŸ₯‰17 Β· ⭐ 1.4K) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 130 Β· πŸ“‹ 560 - 42% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/BlazingDB/blazingsql
  • Conda (πŸ“₯ 910 Β· ⏱️ 11.11.2019):

    conda install -c blazingsql blazingsql-protocol
cuGraph (πŸ₯‰16 Β· ⭐ 600) - cuGraph - RAPIDS Graph Analytics Library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 52 Β· πŸ”€ 130 Β· πŸ“‹ 540 - 17% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/rapidsai/cugraph
  • PyPi (πŸ“₯ 62 / month Β· ⏱️ 04.04.2019):

    pip install cugraph
SpeedTorch (πŸ₯‰16 Β· ⭐ 590 Β· πŸ’€) - Library for faster pinned CPU - GPU transfer in Pytorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 35 Β· πŸ“¦ 3 Β· πŸ“‹ 5 - 60% open Β· ⏱️ 21.02.2020):

    git clone https://github.com/Santosh-Gupta/SpeedTorch
  • PyPi (πŸ“₯ 140 / month Β· πŸ“¦ 2 Β· ⏱️ 06.01.2020):

    pip install SpeedTorch
cuSignal (πŸ₯‰15 Β· ⭐ 430) - GPU accelerated signal processing. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 56 Β· πŸ“‹ 97 - 17% open Β· ⏱️ 15.12.2020):

    git clone https://github.com/rapidsai/cusignal
Show 2 hidden projects...
  • ipyexperiments (πŸ₯‰17 Β· ⭐ 120) - jupyter/ipython experiment containers for GPU and general.. Apache-2
  • nvidia-ml-py3 (πŸ₯‰17 Β· ⭐ 59 Β· πŸ’€) - Python 3 Bindings for the NVIDIA Management Library. BSD-3

Tensorflow Utilities

Back to top

Libraries that extend TensorFlow with general-purpose capabilities.

tensor2tensor (πŸ₯‡32 Β· ⭐ 11K) - Library of deep learning models and datasets designed to.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 230 Β· πŸ”€ 2.7K Β· πŸ“¦ 870 Β· πŸ“‹ 1.2K - 45% open Β· ⏱️ 16.11.2020):

    git clone https://github.com/tensorflow/tensor2tensor
  • PyPi (πŸ“₯ 62K / month Β· πŸ“¦ 82 Β· ⏱️ 17.06.2020):

    pip install tensor2tensor
tensorflow-hub (πŸ₯‡32 Β· ⭐ 2.7K) - A library for transfer learning by reusing parts of.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 65 Β· πŸ”€ 1.4K Β· πŸ“¦ 4.8K Β· πŸ“‹ 530 - 8% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/tensorflow/hub
  • PyPi (πŸ“₯ 1.1M / month Β· πŸ“¦ 310 Β· ⏱️ 29.10.2020):

    pip install tensorflow-hub
  • Conda (πŸ“₯ 48K Β· ⏱️ 24.08.2020):

    conda install -c conda-forge tensorflow-hub
TF Addons (πŸ₯ˆ30 Β· ⭐ 1.1K) - Useful extra functionality for TensorFlow 2.x maintained by.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 390 Β· πŸ“¦ 1.8K Β· πŸ“‹ 760 - 20% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/tensorflow/addons
  • PyPi (πŸ“₯ 270K / month Β· πŸ“¦ 18 Β· ⏱️ 23.12.2020):

    pip install tensorflow-addons
TensorFlow Transform (πŸ₯ˆ29 Β· ⭐ 840) - Input pipeline framework. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 160 Β· πŸ“¦ 380 Β· πŸ“‹ 140 - 22% open Β· ⏱️ 16.12.2020):

    git clone https://github.com/tensorflow/transform
  • PyPi (πŸ“₯ 5.9M / month Β· πŸ“¦ 84 Β· ⏱️ 16.12.2020):

    pip install tensorflow-transform
TF Model Optimization (πŸ₯ˆ26 Β· ⭐ 940) - A toolkit to optimize ML models for deployment for Keras.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 210 Β· πŸ“¦ 490 Β· πŸ“‹ 160 - 44% open Β· ⏱️ 23.11.2020):

    git clone https://github.com/tensorflow/model-optimization
  • PyPi (πŸ“₯ 58K / month Β· πŸ“¦ 3 Β· ⏱️ 14.09.2020):

    pip install tensorflow-model-optimization
TensorFlow I/O (πŸ₯‰25 Β· ⭐ 400) - Dataset, streaming, and file system extensions maintained.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 70 Β· πŸ”€ 140 Β· πŸ“¦ 85 Β· πŸ“‹ 380 - 31% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/tensorflow/io
  • PyPi (πŸ“₯ 36K / month Β· ⏱️ 15.12.2020):

    pip install tensorflow-io
TensorFlow Cloud (πŸ₯‰23 Β· ⭐ 210) - The TensorFlow Cloud repository provides APIs that will.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 41 Β· πŸ“¦ 8 Β· πŸ“‹ 66 - 62% open Β· ⏱️ 04.12.2020):

    git clone https://github.com/tensorflow/cloud
  • PyPi (πŸ“₯ 490K / month Β· ⏱️ 03.12.2020):

    pip install tensorflow-cloud
Neural Structured Learning (πŸ₯‰22 Β· ⭐ 760) - Training neural models with structured signals. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 140 Β· πŸ“¦ 15 Β· πŸ“‹ 43 - 11% open Β· ⏱️ 14.12.2020):

    git clone https://github.com/tensorflow/neural-structured-learning
  • PyPi (πŸ“₯ 2.4K / month Β· ⏱️ 18.08.2020):

    pip install neural-structured-learning
TensorNets (πŸ₯‰20 Β· ⭐ 960) - High level network definitions with pre-trained weights in.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 180 Β· πŸ“¦ 29 Β· πŸ“‹ 56 - 25% open Β· ⏱️ 03.10.2020):

    git clone https://github.com/taehoonlee/tensornets
  • PyPi (πŸ“₯ 280 / month Β· πŸ“¦ 2 Β· ⏱️ 31.03.2020):

    pip install tensornets
Saliency (πŸ₯‰18 Β· ⭐ 610) - TensorFlow implementation for SmoothGrad, Grad-CAM, Guided.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 130 Β· πŸ“¦ 13 Β· πŸ“‹ 21 - 52% open Β· ⏱️ 01.10.2020):

    git clone https://github.com/PAIR-code/saliency
  • PyPi (πŸ“₯ 160 / month Β· ⏱️ 26.09.2019):

    pip install saliency
TF Compression (πŸ₯‰15 Β· ⭐ 400) - Data compression in TensorFlow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 140 Β· πŸ“‹ 54 - 5% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/tensorflow/compression
  • PyPi (πŸ“₯ 560 / month Β· πŸ“¦ 1 Β· ⏱️ 04.12.2020):

    pip install tensorflow-compression

Pytorch Utilities

Back to top

Libraries that extend Pytorch with general-purpose capabilities.

pretrainedmodels (πŸ₯‡27 Β· ⭐ 7.6K Β· πŸ’€) - Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet,.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 1.6K Β· πŸ“¦ 790 Β· πŸ“‹ 170 - 49% open Β· ⏱️ 16.04.2020):

    git clone https://github.com/Cadene/pretrained-models.pytorch
  • PyPi (πŸ“₯ 23K / month Β· πŸ“¦ 100 Β· ⏱️ 29.10.2018):

    pip install pretrainedmodels
EfficientNet-PyTorch (πŸ₯‡24 Β· ⭐ 5.2K Β· βž•) - A PyTorch implementation of EfficientNet. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 1K Β· πŸ“₯ 270K Β· πŸ“‹ 220 - 46% open Β· ⏱️ 09.10.2020):

    git clone https://github.com/lukemelas/EfficientNet-PyTorch
  • PyPi (πŸ“₯ 32K / month Β· πŸ“¦ 7 Β· ⏱️ 26.08.2020):

    pip install efficientnet-pytorch
torchdiffeq (πŸ₯‡24 Β· ⭐ 3.3K) - Differentiable ODE solvers with full GPU support and.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 550 Β· πŸ“¦ 53 Β· πŸ“‹ 120 - 18% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/rtqichen/torchdiffeq
  • PyPi (πŸ“₯ 1.9K / month Β· ⏱️ 16.09.2020):

    pip install torchdiffeq
pytorch-summary (πŸ₯‡24 Β· ⭐ 2.8K) - Model summary in PyTorch similar to `model.summary()` in.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 330 Β· πŸ“¦ 1.7K Β· πŸ“‹ 140 - 73% open Β· ⏱️ 07.08.2020):

    git clone https://github.com/sksq96/pytorch-summary
  • PyPi (πŸ“₯ 29K / month Β· πŸ“¦ 120 Β· ⏱️ 26.09.2018):

    pip install torchsummary
SRU (πŸ₯‡24 Β· ⭐ 1.9K) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755). MIT
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 280 Β· πŸ“¦ 15 Β· πŸ“‹ 100 - 47% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/asappresearch/sru
  • PyPi (πŸ“₯ 2.9K / month Β· πŸ“¦ 1 Β· ⏱️ 17.12.2020):

    pip install sru
PML (πŸ₯ˆ23 Β· ⭐ 2.5K) - The easiest way to use deep metric learning in your application... MIT
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 340 Β· πŸ“¦ 40 Β· πŸ“‹ 200 - 15% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/KevinMusgrave/pytorch-metric-learning
  • PyPi (πŸ“₯ 4.3K / month Β· ⏱️ 27.12.2019):

    pip install pytorch-metric-learning
  • Conda (πŸ“₯ 1K Β· ⏱️ 11.12.2020):

    conda install -c metric-learning pytorch-metric-learning
EfficientNets (πŸ₯ˆ22 Β· ⭐ 1.2K) - Pretrained EfficientNet, EfficientNet-Lite, MixNet,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 160 Β· πŸ“¦ 27 Β· ⏱️ 30.11.2020):

    git clone https://github.com/rwightman/gen-efficientnet-pytorch
  • PyPi (πŸ“₯ 1.2K / month Β· ⏱️ 20.08.2020):

    pip install geffnet
Torchmeta (πŸ₯ˆ21 Β· ⭐ 1.2K) - A collection of extensions and data-loaders for few-shot learning.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 130 Β· πŸ“¦ 23 Β· πŸ“‹ 88 - 21% open Β· ⏱️ 25.11.2020):

    git clone https://github.com/tristandeleu/pytorch-meta
  • PyPi (πŸ“₯ 610 / month Β· ⏱️ 25.11.2020):

    pip install torchmeta
PyTorch Sparse (πŸ₯ˆ21 Β· ⭐ 330) - PyTorch Extension Library of Optimized Autograd Sparse Matrix.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 40 Β· πŸ“‹ 76 - 42% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/rusty1s/pytorch_sparse
  • PyPi (πŸ“₯ 4.2K / month Β· πŸ“¦ 17 Β· ⏱️ 02.11.2020):

    pip install torch-sparse
reformer-pytorch (πŸ₯ˆ20 Β· ⭐ 1.3K Β· βž•) - Reformer, the efficient Transformer, in Pytorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 160 Β· πŸ“‹ 93 - 1% open Β· ⏱️ 11.12.2020):

    git clone https://github.com/lucidrains/reformer-pytorch
  • PyPi (πŸ“₯ 870 / month Β· ⏱️ 06.12.2020):

    pip install reformer-pytorch
torch-scatter (πŸ₯ˆ20 Β· ⭐ 560) - PyTorch Extension Library of Optimized Scatter Operations. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 72 Β· πŸ“‹ 160 - 35% open Β· ⏱️ 01.12.2020):

    git clone https://github.com/rusty1s/pytorch_scatter
  • PyPi (πŸ“₯ 5.2K / month Β· πŸ“¦ 16 Β· ⏱️ 22.02.2020):

    pip install torch-scatter
Higher (πŸ₯‰19 Β· ⭐ 1K) - higher is a pytorch library allowing users to obtain higher order.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 71 Β· πŸ“¦ 27 Β· πŸ“‹ 69 - 33% open Β· ⏱️ 28.10.2020):

    git clone https://github.com/facebookresearch/higher
  • PyPi (πŸ“₯ 540 / month Β· ⏱️ 14.07.2020):

    pip install higher
Pytorch Toolbelt (πŸ₯‰19 Β· ⭐ 870) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 71 Β· πŸ“‹ 17 - 17% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/BloodAxe/pytorch-toolbelt
  • PyPi (πŸ“₯ 3.8K / month Β· πŸ“¦ 1 Β· ⏱️ 19.08.2020):

    pip install pytorch_toolbelt
TabNet (πŸ₯‰19 Β· ⭐ 690) - PyTorch implementation of TabNet paper :.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 130 Β· πŸ“‹ 120 - 24% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/dreamquark-ai/tabnet
  • PyPi (πŸ“₯ 6.6K / month Β· ⏱️ 15.12.2020):

    pip install pytorch-tabnet
Performer Pytorch (πŸ₯‰17 Β· ⭐ 420 Β· 🐣) - An implementation of Performer, a linear attention-based.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 58 Β· πŸ“¦ 6 Β· πŸ“‹ 44 - 34% open Β· ⏱️ 14.12.2020):

    git clone https://github.com/lucidrains/performer-pytorch
  • PyPi (πŸ“₯ 1.7K / month Β· ⏱️ 13.12.2020):

    pip install performer-pytorch
Lambda Networks (πŸ₯‰16 Β· ⭐ 1.2K Β· 🐣) - Implementation of LambdaNetworks, a new approach to image.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 130 Β· πŸ“‹ 23 - 34% open Β· ⏱️ 18.11.2020):

    git clone https://github.com/lucidrains/lambda-networks
  • PyPi (πŸ“₯ 480 / month Β· ⏱️ 20.10.2020):

    pip install lambda-networks
Pywick (πŸ₯‰16 Β· ⭐ 310) - High-level batteries-included neural network training library for.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 35 Β· πŸ“¦ 2 Β· πŸ“‹ 11 - 18% open Β· ⏱️ 24.09.2020):

    git clone https://github.com/achaiah/pywick
  • PyPi (πŸ“₯ 210 / month Β· ⏱️ 20.01.2020):

    pip install pywick
tinygrad (πŸ₯‰15 Β· ⭐ 3.7K Β· 🐣) - You like pytorch? You like micrograd? You love tinygrad!. MIT
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 400 Β· πŸ“‹ 59 - 15% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/geohot/tinygrad
Tensor Sensor (πŸ₯‰15 Β· ⭐ 430 Β· 🐣) - The goal of this library is to generate more helpful.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 22 Β· πŸ“‹ 16 - 25% open Β· ⏱️ 03.12.2020):

    git clone https://github.com/parrt/tensor-sensor
  • PyPi (πŸ“₯ 390 / month Β· ⏱️ 03.12.2020):

    pip install tensor-sensor
micrograd (πŸ₯‰14 Β· ⭐ 1.5K Β· πŸ’€) - A tiny scalar-valued autograd engine and a neural net library.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 100 Β· πŸ“‹ 5 - 40% open Β· ⏱️ 18.04.2020):

    git clone https://github.com/karpathy/micrograd
  • PyPi (πŸ“₯ 44 / month Β· ⏱️ 18.04.2020):

    pip install micrograd
torchsde (πŸ₯‰14 Β· ⭐ 620 Β· 🐣) - Differentiable SDE solvers with GPU support and efficient.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 52 Β· πŸ“‹ 31 - 16% open Β· ⏱️ 30.10.2020):

    git clone https://github.com/google-research/torchsde
Tez (πŸ₯‰14 Β· ⭐ 350 Β· 🐣) - Tez is a super-simple and lightweight Trainer for PyTorch. It.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1 Β· πŸ”€ 38 Β· πŸ“‹ 9 - 44% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/abhishekkrthakur/tez
  • PyPi (πŸ“₯ 910 / month Β· ⏱️ 26.12.2020):

    pip install tez
Torch-Struct (πŸ₯‰13 Β· ⭐ 860) - Fast, general, and tested differentiable structured prediction.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 67 Β· πŸ“‹ 34 - 41% open Β· ⏱️ 30.07.2020):

    git clone https://github.com/harvardnlp/pytorch-struct
Show 3 hidden projects...

Database Clients

Back to top

Libraries for connecting to, operating, and querying databases.

boto3 (πŸ₯‡37 Β· ⭐ 6K) - AWS SDK for Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 1.3K Β· πŸ“¦ 90K Β· πŸ“‹ 2.2K - 13% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/boto/boto3
  • PyPi (πŸ“₯ 49M / month Β· πŸ“¦ 23K Β· ⏱️ 23.12.2020):

    pip install boto3
  • Conda (πŸ“₯ 4.1M Β· ⏱️ 24.12.2020):

    conda install -c conda-forge boto3
SQLAlchemy (πŸ₯‡37 Β· ⭐ 3.1K) - The Database Toolkit for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 450 Β· πŸ”€ 490 Β· πŸ“₯ 1.2K Β· πŸ“¦ 220K Β· πŸ“‹ 5.5K - 2% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/sqlalchemy/sqlalchemy
  • PyPi (πŸ“₯ 13M / month Β· πŸ“¦ 84K Β· ⏱️ 18.12.2020):

    pip install SQLAlchemy
  • Conda (πŸ“₯ 2.8M Β· ⏱️ 19.12.2020):

    conda install -c conda-forge sqlalchemy
redis (πŸ₯‡36 Β· ⭐ 9K) - Redis Python Client. MIT
  • GitHub (πŸ‘¨β€πŸ’» 250 Β· πŸ”€ 1.9K Β· πŸ“‹ 860 - 7% open Β· ⏱️ 22.11.2020):

    git clone https://github.com/andymccurdy/redis-py
  • PyPi (πŸ“₯ 8.4M / month Β· πŸ“¦ 49K Β· ⏱️ 01.06.2020):

    pip install redis
pymongo (πŸ₯‡36 Β· ⭐ 3.2K) - PyMongo - the Python driver for MongoDB. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 970 Β· πŸ“¦ 82K Β· ⏱️ 22.12.2020):

    git clone https://github.com/mongodb/mongo-python-driver
  • PyPi (πŸ“₯ 6.1M / month Β· πŸ“¦ 36K Β· ⏱️ 02.12.2020):

    pip install pymongo
  • Conda (πŸ“₯ 320K Β· ⏱️ 12.10.2020):

    conda install -c conda-forge pymongo
PyMySQL (πŸ₯‡35 Β· ⭐ 6.3K) - Pure Python MySQL Client. MIT
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 1.2K Β· πŸ“¦ 71K Β· πŸ“‹ 530 - 3% open Β· ⏱️ 10.12.2020):

    git clone https://github.com/PyMySQL/PyMySQL
  • PyPi (πŸ“₯ 5.3M / month Β· πŸ“¦ 18K Β· ⏱️ 10.09.2020):

    pip install PyMySQL
  • Conda (πŸ“₯ 230K Β· ⏱️ 17.11.2020):

    conda install -c conda-forge pymysql
google-cloud-storage (πŸ₯‡35 Β· ⭐ 3.4K) - Google Cloud Client Library for Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 310 Β· πŸ”€ 1.3K Β· πŸ“‹ 3.3K - 0% open Β· ⏱️ 29.10.2020):

    git clone https://github.com/googleapis/google-cloud-python
  • PyPi (πŸ“₯ 14M / month Β· πŸ“¦ 2.1K Β· ⏱️ 14.12.2020):

    pip install google-cloud-storage
  • Conda (πŸ“₯ 560K Β· ⏱️ 16.12.2020):

    conda install -c conda-forge google-cloud-storage
elasticsearch (πŸ₯‡35 Β· ⭐ 3K) - Official Python low-level client for Elasticsearch. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 960 Β· πŸ“₯ 54 Β· πŸ“¦ 19K Β· πŸ“‹ 840 - 8% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/elastic/elasticsearch-py
  • PyPi (πŸ“₯ 3.7M / month Β· πŸ“¦ 11K Β· ⏱️ 09.12.2020):

    pip install elasticsearch
  • Conda (πŸ“₯ 340K Β· ⏱️ 11.12.2020):

    conda install -c conda-forge elasticsearch
kafka-python (πŸ₯ˆ34 Β· ⭐ 4.1K) - Python client for Apache Kafka. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 200 Β· πŸ”€ 1.1K Β· πŸ“₯ 1.7K Β· πŸ“¦ 5.6K Β· πŸ“‹ 1.3K - 10% open Β· ⏱️ 02.12.2020):

    git clone https://github.com/dpkp/kafka-python
  • PyPi (πŸ“₯ 2M / month Β· πŸ“¦ 2K Β· ⏱️ 30.09.2020):

    pip install kafka-python
  • Conda (πŸ“₯ 230K Β· ⏱️ 30.09.2020):

    conda install -c conda-forge kafka-python
peewee (πŸ₯ˆ33 Β· ⭐ 8K) - a small, expressive orm -- supports postgresql, mysql and sqlite. MIT
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 1.3K Β· πŸ“¦ 12K Β· πŸ“‹ 1.9K - 0% open Β· ⏱️ 26.11.2020):

    git clone https://github.com/coleifer/peewee
  • PyPi (πŸ“₯ 180K / month Β· πŸ“¦ 6.5K Β· ⏱️ 07.11.2020):

    pip install peewee
  • Conda (πŸ“₯ 270K Β· ⏱️ 11.11.2020):

    conda install -c conda-forge peewee
python-bigquery (πŸ₯ˆ33 Β· ⭐ 3.4K Β· βž•) - Google BigQuery API client library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 86 Β· πŸ”€ 1.3K Β· πŸ“¦ 6.1K Β· πŸ“‹ 200 - 21% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/googleapis/python-bigquery
  • PyPi (πŸ“₯ 9.4M / month Β· πŸ“¦ 1.2K Β· ⏱️ 09.12.2020):

    pip install google-cloud-bigquery
Elasticsearch DSL (πŸ₯ˆ31 Β· ⭐ 3.1K) - High level Python client for Elasticsearch. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 660 Β· πŸ“₯ 21 Β· πŸ“¦ 4.1K Β· πŸ“‹ 1.1K - 4% open Β· ⏱️ 08.12.2020):

    git clone https://github.com/elastic/elasticsearch-dsl-py
  • PyPi (πŸ“₯ 910K / month Β· πŸ“¦ 1.6K Β· ⏱️ 17.09.2020):

    pip install elasticsearch-dsl
Prometheus Client (πŸ₯ˆ31 Β· ⭐ 1.9K) - Prometheus instrumentation library for Python applications. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 95 Β· πŸ”€ 500 Β· πŸ“‹ 310 - 8% open Β· ⏱️ 24.11.2020):

    git clone https://github.com/prometheus/client_python
  • PyPi (πŸ“₯ 12M / month Β· πŸ“¦ 3.7K Β· ⏱️ 20.06.2019):

    pip install prometheus_client
  • Conda (πŸ“₯ 2.2M Β· ⏱️ 16.11.2020):

    conda install -c conda-forge prometheus_client
influxdb (πŸ₯ˆ31 Β· ⭐ 1.4K) - Python client for InfluxDB. MIT
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 480 Β· πŸ“¦ 3.8K Β· πŸ“‹ 500 - 26% open Β· ⏱️ 11.11.2020):

    git clone https://github.com/influxdata/influxdb-python
  • PyPi (πŸ“₯ 1.1M / month Β· πŸ“¦ 2.1K Β· ⏱️ 11.11.2020):

    pip install influxdb
  • Conda (πŸ“₯ 8K Β· ⏱️ 12.11.2020):

    conda install -c conda-forge influxdb
Cassandra Driver (πŸ₯ˆ31 Β· ⭐ 1.2K) - DataStax Python Driver for Apache Cassandra. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 450 Β· πŸ“¦ 2.3K Β· ⏱️ 11.11.2020):

    git clone https://github.com/datastax/python-driver
  • PyPi (πŸ“₯ 570K / month Β· πŸ“¦ 1.2K Β· ⏱️ 18.06.2020):

    pip install cassandra-driver
  • Conda (πŸ“₯ 60K Β· ⏱️ 12.10.2020):

    conda install -c conda-forge cassandra-driver
confluent-kafka-python (πŸ₯ˆ30 Β· ⭐ 2.1K) - Confluent's Kafka Python Client. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 66 Β· πŸ”€ 550 Β· πŸ“¦ 1.9K Β· πŸ“‹ 700 - 32% open Β· ⏱️ 24.11.2020):

    git clone https://github.com/confluentinc/confluent-kafka-python
  • PyPi (πŸ“₯ 950K / month Β· πŸ“¦ 300 Β· ⏱️ 22.07.2020):

    pip install confluent-kafka
pyodbc (πŸ₯ˆ30 Β· ⭐ 2K) - Python ODBC bridge. MIT
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 410 Β· πŸ“‹ 720 - 29% open Β· ⏱️ 26.10.2020):

    git clone https://github.com/mkleehammer/pyodbc
  • PyPi (πŸ“₯ 4.1M / month Β· πŸ“¦ 3.3K Β· ⏱️ 10.02.2020):

    pip install pyodbc
  • Conda (πŸ“₯ 390K Β· ⏱️ 10.10.2020):

    conda install -c conda-forge pyodbc
Motor (πŸ₯ˆ30 Β· ⭐ 1.6K) - Motor - the async Python driver for MongoDB and Tornado or asyncio. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 140 Β· πŸ“¦ 3.4K Β· ⏱️ 24.09.2020):

    git clone https://github.com/mongodb/motor
  • PyPi (πŸ“₯ 130K / month Β· πŸ“¦ 1.8K Β· ⏱️ 24.09.2020):

    pip install motor
  • Conda (πŸ“₯ 10K Β· ⏱️ 20.11.2020):

    conda install -c conda-forge motor
py2neo (πŸ₯‰29 Β· ⭐ 950) - Py2neo is a comprehensive toolkit for working with Neo4j from within.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 31 Β· πŸ”€ 200 Β· πŸ“‹ 710 - 3% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/technige/py2neo
  • PyPi (πŸ“₯ 75K / month Β· πŸ“¦ 1.1K Β· ⏱️ 21.11.2020):

    pip install py2neo
  • Conda (πŸ“₯ 32K Β· ⏱️ 17.03.2019):

    conda install -c conda-forge py2neo
pandas-gbq (πŸ₯‰29 Β· ⭐ 230) - Pandas Google BigQuery. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 83 Β· πŸ“₯ 190 Β· πŸ“¦ 1.7K Β· πŸ“‹ 190 - 17% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/pydata/pandas-gbq
  • PyPi (πŸ“₯ 6.2M / month Β· πŸ“¦ 430 Β· ⏱️ 10.11.2020):

    pip install pandas-gbq
  • Conda (πŸ“₯ 620K Β· ⏱️ 11.11.2020):

    conda install -c conda-forge pandas-gbq
Records (πŸ₯‰28 Β· ⭐ 6.5K) - SQL for Humans. ISC
  • GitHub (πŸ‘¨β€πŸ’» 37 Β· πŸ”€ 530 Β· πŸ“¦ 700 Β· πŸ“‹ 140 - 40% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/kennethreitz42/records
  • PyPi (πŸ“₯ 180K / month Β· πŸ“¦ 300 Β· ⏱️ 21.02.2019):

    pip install records
  • Conda (πŸ“₯ 16K Β· ⏱️ 15.01.2018):

    conda install -c conda-forge records
dataset (πŸ₯‰28 Β· ⭐ 4K) - Easy-to-use data handling for SQL data stores with support for implicit.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 71 Β· πŸ”€ 260 Β· πŸ“¦ 1.7K Β· πŸ“‹ 240 - 4% open Β· ⏱️ 14.11.2020):

    git clone https://github.com/pudo/dataset
  • PyPi (πŸ“₯ 42K / month Β· πŸ“¦ 1K Β· ⏱️ 14.11.2020):

    pip install dataset
Ibis (πŸ₯‰28 Β· ⭐ 1.5K) - A pandas-like deferred expression system, with first-class SQL support. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 70 Β· πŸ”€ 240 Β· πŸ“¦ 130 Β· πŸ“‹ 1.5K - 35% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/ibis-project/ibis
  • PyPi (πŸ“₯ 60K / month Β· πŸ“¦ 48 Β· ⏱️ 11.11.2020):

    pip install ibis-framework
  • Conda (πŸ“₯ 160K Β· ⏱️ 13.11.2020):

    conda install -c conda-forge ibis-framework
minio (πŸ₯‰28 Β· ⭐ 380) - MinIO Client SDK for Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 88 Β· πŸ”€ 170 Β· πŸ“¦ 1.2K Β· πŸ“‹ 370 - 2% open Β· ⏱️ 10.12.2020):

    git clone https://github.com/minio/minio-py
  • PyPi (πŸ“₯ 170K / month Β· πŸ“¦ 340 Β· ⏱️ 08.12.2020):

    pip install minio
  • Conda (πŸ“₯ 15K Β· ⏱️ 08.12.2020):

    conda install -c conda-forge minio
Databases (πŸ₯‰27 Β· ⭐ 1.9K) - Async database support for Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 130 Β· πŸ“¦ 1.2K Β· πŸ“‹ 180 - 50% open Β· ⏱️ 10.12.2020):

    git clone https://github.com/encode/databases
  • PyPi (πŸ“₯ 45K / month Β· πŸ“¦ 60 Β· ⏱️ 16.11.2020):

    pip install databases
  • Conda (πŸ“₯ 77K Β· ⏱️ 21.11.2020):

    conda install -c conda-forge databases
libcloud (πŸ₯‰27 Β· ⭐ 1.7K) - Apache Libcloud is a Python library which hides differences.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 420 Β· πŸ”€ 880 Β· πŸ“¦ 1.6K Β· πŸ“‹ 93 - 32% open Β· ⏱️ 19.12.2020):

    git clone https://github.com/apache/libcloud
  • PyPi (πŸ“₯ 55K / month Β· πŸ“¦ 1.5K Β· ⏱️ 20.09.2020):

    pip install apache-libcloud
AWS Data Wrangler (πŸ₯‰26 Β· ⭐ 1.3K) - Pandas on AWS. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 53 Β· πŸ”€ 210 Β· πŸ“₯ 19K Β· πŸ“‹ 280 - 11% open Β· ⏱️ 22.12.2020):

    git clone https://github.com/awslabs/aws-data-wrangler
  • PyPi (πŸ“₯ 490K / month Β· ⏱️ 22.12.2020):

    pip install awswrangler
cx-Oracle (πŸ₯‰26 Β· ⭐ 690) - Python interface to Oracle Database conforming to the Python DB API.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 240 Β· πŸ“‹ 490 - 6% open Β· ⏱️ 21.12.2020):

    git clone https://github.com/oracle/python-cx_Oracle
  • PyPi (πŸ“₯ 1M / month Β· πŸ“¦ 140 Β· ⏱️ 08.12.2020):

    pip install cx-Oracle
cloudant (πŸ₯‰26 Β· ⭐ 160) - A Python library for Cloudant and CouchDB. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 45 Β· πŸ“¦ 1.2K Β· πŸ“‹ 280 - 13% open Β· ⏱️ 25.11.2020):

    git clone https://github.com/cloudant/python-cloudant
  • PyPi (πŸ“₯ 42K / month Β· πŸ“¦ 530 Β· ⏱️ 18.08.2020):

    pip install cloudant
  • Conda (πŸ“₯ 71K Β· ⏱️ 18.05.2019):

    conda install -c conda-forge cloudant
Pony (πŸ₯‰25 Β· ⭐ 2.4K) - Pony Object Relational Mapper. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 190 Β· πŸ“¦ 1.6K Β· πŸ“‹ 530 - 44% open Β· ⏱️ 23.11.2020):

    git clone https://github.com/ponyorm/pony
  • PyPi (πŸ“₯ 21K / month Β· πŸ“¦ 540 Β· ⏱️ 23.11.2020):

    pip install pony
  • Conda (πŸ“₯ 43K Β· ⏱️ 23.11.2020):

    conda install -c conda-forge pony
neo4j-driver (πŸ₯‰24 Β· ⭐ 580) - Neo4j Bolt driver for Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 130 Β· πŸ“‹ 170 - 2% open Β· ⏱️ 18.12.2020):

    git clone https://github.com/neo4j/neo4j-python-driver
  • PyPi (πŸ“₯ 72K / month Β· πŸ“¦ 380 Β· ⏱️ 21.12.2020):

    pip install neo4j-driver
HappyBase (πŸ₯‰24 Β· ⭐ 550 Β· πŸ’€) - A developer-friendly Python library to interact with Apache HBase. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 150 Β· πŸ“¦ 450 Β· πŸ“‹ 210 - 13% open Β· ⏱️ 09.04.2020):

    git clone https://github.com/python-happybase/happybase
  • PyPi (πŸ“₯ 40K / month Β· πŸ“¦ 420 Β· ⏱️ 03.04.2017):

    pip install happybase
  • Conda (πŸ“₯ 47K Β· ⏱️ 01.07.2019):

    conda install -c conda-forge happybase
Queries (πŸ₯‰20 Β· ⭐ 230 Β· βž•) - PostgreSQL database access simplified. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 27 Β· πŸ“¦ 83 Β· πŸ“‹ 15 - 20% open Β· ⏱️ 07.08.2020):

    git clone https://github.com/gmr/queries
  • PyPi (πŸ“₯ 840 / month Β· πŸ“¦ 25 Β· ⏱️ 09.03.2020):

    pip install queries
PugSQL (πŸ₯‰18 Β· ⭐ 400) - A HugSQL-inspired database library for Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 13 Β· πŸ“‹ 32 - 40% open Β· ⏱️ 13.09.2020):

    git clone https://github.com/mcfunley/pugsql
  • PyPi (πŸ“₯ 1.9K / month Β· πŸ“¦ 1 Β· ⏱️ 13.09.2020):

    pip install pugsql
Show 8 hidden projects...
  • psycopg2 (πŸ₯‡35 Β· ⭐ 2K) - PostgreSQL database adapter for the Python programming.. ❗️BSD-3-Clause-Attribution
  • mysqlclient (πŸ₯ˆ30 Β· ⭐ 1.8K) - MySQL database connector for Python (with Python 3 support). ❗️GPL-2.0
  • mongo-connector (πŸ₯‰24 Β· ⭐ 1.8K Β· πŸ’€) - MongoDB data stream pipeline tools by YouGov (adopted from.. Apache-2
  • pyhdb (πŸ₯‰23 Β· ⭐ 290 Β· πŸ’€) - SAP HANA Connector in pure Python. Apache-2
  • filedepot (πŸ₯‰21 Β· ⭐ 120) - Toolkit for storing files and attachments in web applications. MIT
  • db.py (πŸ₯‰20 Β· ⭐ 1.2K Β· πŸ’€) - db.py is an easier way to interact with your databases. BSD-2
  • lazydata (πŸ₯‰16 Β· ⭐ 620 Β· πŸ’€) - Lazydata: Scalable data dependencies for Python projects. Apache-2
  • SuperSQLite (πŸ₯‰15 Β· ⭐ 660 Β· πŸ’€) - A supercharged SQLite library for Python. MIT

System Monitoring & Profiling

Back to top

Libraries to monitor, collect, and visualize system metrics (e.g., memory, CPU, GPU).

psutil (πŸ₯‡36 Β· ⭐ 6.9K) - Cross-platform lib for process and system monitoring in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 1.1K Β· πŸ“¦ 68K Β· πŸ“‹ 1.4K - 14% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/giampaolo/psutil
  • PyPi (πŸ“₯ 12M / month Β· πŸ“¦ 25K Β· ⏱️ 19.12.2020):

    pip install psutil
  • Conda (πŸ“₯ 5.2M Β· ⏱️ 19.12.2020):

    conda install -c conda-forge psutil
memory-profiler (πŸ₯ˆ27 Β· ⭐ 2.7K) - Monitor Memory usage of Python code. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 83 Β· πŸ”€ 290 Β· πŸ“‹ 180 - 45% open Β· ⏱️ 03.12.2020):

    git clone https://github.com/pythonprofilers/memory_profiler
  • PyPi (πŸ“₯ 140K / month Β· πŸ“¦ 530 Β· ⏱️ 16.08.2018):

    pip install memory_profiler
Yappi (πŸ₯ˆ27 Β· ⭐ 540) - Yet Another Python Profiler, but this time thread&coroutine&greenlet aware. MIT
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 38 Β· πŸ“¦ 350 Β· πŸ“‹ 39 - 25% open Β· ⏱️ 09.12.2020):

    git clone https://github.com/sumerc/yappi
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 640 Β· ⏱️ 27.11.2020):

    pip install yappi
  • Conda (πŸ“₯ 58K Β· ⏱️ 28.11.2020):

    conda install -c conda-forge yappi
py-spy (πŸ₯ˆ26 Β· ⭐ 6.3K) - Sampling profiler for Python programs. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 220 Β· πŸ“₯ 4K Β· πŸ“¦ 490 Β· πŸ“‹ 180 - 28% open Β· ⏱️ 20.11.2020):

    git clone https://github.com/benfred/py-spy
  • PyPi (πŸ“₯ 220K / month Β· πŸ“¦ 25 Β· ⏱️ 22.06.2020):

    pip install py-spy
pyinstrument (πŸ₯ˆ26 Β· ⭐ 2.4K) - Call stack profiler for Python. Shows you why your code is slow!. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 90 Β· πŸ“¦ 200 Β· πŸ“‹ 69 - 26% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/joerick/pyinstrument
  • PyPi (πŸ“₯ 57K / month Β· πŸ“¦ 140 Β· ⏱️ 09.09.2020):

    pip install pyinstrument
  • Conda (πŸ“₯ 60K Β· ⏱️ 17.12.2020):

    conda install -c conda-forge pyinstrument
gpustat (πŸ₯ˆ26 Β· ⭐ 2.2K Β· πŸ’€) - A simple command-line utility for querying and monitoring GPU status. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 180 Β· πŸ“¦ 530 Β· πŸ“‹ 62 - 29% open Β· ⏱️ 19.05.2020):

    git clone https://github.com/wookayin/gpustat
  • PyPi (πŸ“₯ 140K / month Β· πŸ“¦ 58 Β· ⏱️ 22.07.2019):

    pip install gpustat
  • Conda (πŸ“₯ 5.6K Β· ⏱️ 24.11.2020):

    conda install -c conda-forge gpustat
Scalene (πŸ₯‰22 Β· ⭐ 3K) - Scalene: a high-performance, high-precision CPU and memory profiler.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 110 Β· πŸ“¦ 18 Β· πŸ“‹ 69 - 24% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/emeryberger/scalene
  • PyPi (πŸ“₯ 1.7K / month Β· ⏱️ 09.12.2020):

    pip install scalene
Profiling (πŸ₯‰20 Β· ⭐ 3K) - Was an interactive continuous Python profiler. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 110 Β· πŸ“¦ 29 Β· πŸ“‹ 35 - 42% open Β· ⏱️ 24.08.2020):

    git clone https://github.com/what-studio/profiling
  • PyPi (πŸ“₯ 3K / month Β· πŸ“¦ 5 Β· ⏱️ 28.06.2017):

    pip install profiling
Show 4 hidden projects...
  • Glances (πŸ₯‡29 Β· ⭐ 17K) - Glances an Eye on your system. A top/htop alternative for.. ❗️LGPL-3.0
  • Diamond (πŸ₯ˆ26 Β· ⭐ 1.6K Β· πŸ’€) - Diamond is a python daemon that collects system metrics and.. MIT
  • line_profiler (πŸ₯‰25 Β· ⭐ 3.5K Β· πŸ’€) - (OLD REPO) Line-by-line profiling for Python - Current repo.. BSD-3
  • GPUtil (πŸ₯‰23 Β· ⭐ 660 Β· πŸ’€) - A Python module for getting the GPU status from NVIDA GPUs using.. MIT

Others

Back to top

SymPy (πŸ₯‡36 Β· ⭐ 7.7K) - A computer algebra system written in pure Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 1K Β· πŸ”€ 3.2K Β· πŸ“₯ 410K Β· πŸ“¦ 27K Β· πŸ“‹ 11K - 35% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/sympy/sympy
  • PyPi (πŸ“₯ 600K / month Β· πŸ“¦ 6.4K Β· ⏱️ 12.12.2020):

    pip install sympy
  • Conda (πŸ“₯ 1.2M Β· ⏱️ 13.12.2020):

    conda install -c conda-forge sympy
imbalanced-learn (πŸ₯‡30 Β· ⭐ 4.9K) - A Python Package to Tackle the Curse of Imbalanced.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 51 Β· πŸ”€ 1K Β· πŸ“¦ 3.6K Β· πŸ“‹ 450 - 10% open Β· ⏱️ 03.11.2020):

    git clone https://github.com/scikit-learn-contrib/imbalanced-learn
  • PyPi (πŸ“₯ 780K / month Β· πŸ“¦ 280 Β· ⏱️ 09.06.2020):

    pip install imbalanced-learn
  • Conda (πŸ“₯ 100K Β· ⏱️ 14.06.2020):

    conda install -c conda-forge imbalanced-learn
MLxtend (πŸ₯‡30 Β· ⭐ 3.3K) - A library of extension and helper modules for Python's data.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 79 Β· πŸ”€ 680 Β· πŸ“¦ 2.3K Β· πŸ“‹ 360 - 27% open Β· ⏱️ 26.12.2020):

    git clone https://github.com/rasbt/mlxtend
  • PyPi (πŸ“₯ 250K / month Β· πŸ“¦ 95 Β· ⏱️ 26.11.2020):

    pip install mlxtend
  • Conda (πŸ“₯ 150K Β· ⏱️ 26.11.2020):

    conda install -c conda-forge mlxtend
PyOD (πŸ₯‡28 Β· ⭐ 3.9K) - A Python Toolbox for Scalable Outlier Detection (Anomaly Detection). BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 810 Β· πŸ“¦ 430 Β· πŸ“‹ 180 - 55% open Β· ⏱️ 23.12.2020):

    git clone https://github.com/yzhao062/pyod
  • PyPi (πŸ“₯ 96K / month Β· πŸ“¦ 6 Β· ⏱️ 23.12.2020):

    pip install pyod
Cython BLIS (πŸ₯‡28 Β· ⭐ 160) - Fast matrix-multiplication as a self-contained Python library no.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 22 Β· πŸ“¦ 7.4K Β· πŸ“‹ 21 - 28% open Β· ⏱️ 07.12.2020):

    git clone https://github.com/explosion/cython-blis
  • PyPi (πŸ“₯ 810K / month Β· πŸ“¦ 390 Β· ⏱️ 07.12.2020):

    pip install blis
  • Conda (πŸ“₯ 380K Β· ⏱️ 07.12.2020):

    conda install -c conda-forge cython-blis
hdbscan (πŸ₯ˆ27 Β· ⭐ 1.8K Β· βž•) - A high performance implementation of HDBSCAN clustering. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 63 Β· πŸ”€ 330 Β· πŸ“¦ 690 Β· πŸ“‹ 350 - 60% open Β· ⏱️ 17.12.2020):

    git clone https://github.com/scikit-learn-contrib/hdbscan
  • PyPi (πŸ“₯ 140K / month Β· πŸ“¦ 120 Β· ⏱️ 19.03.2020):

    pip install hdbscan
  • Conda (πŸ“₯ 500K Β· ⏱️ 02.11.2020):

    conda install -c conda-forge hdbscan
pyopencl (πŸ₯ˆ27 Β· ⭐ 760 Β· βž•) - OpenCL integration for Python, plus shiny features. MIT
  • GitHub (πŸ‘¨β€πŸ’» 82 Β· πŸ”€ 200 Β· πŸ“¦ 430 Β· πŸ“‹ 260 - 19% open Β· ⏱️ 20.12.2020):

    git clone https://github.com/inducer/pyopencl
  • PyPi (πŸ“₯ 6.1K / month Β· πŸ“¦ 240 Β· ⏱️ 20.11.2020):

    pip install pyopencl
  • Conda (πŸ“₯ 290K Β· ⏱️ 20.11.2020):

    conda install -c conda-forge pyopencl
Streamlit (πŸ₯ˆ26 Β· ⭐ 12K) - Streamlit The fastest way to build data apps in Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 82 Β· πŸ”€ 1K Β· πŸ“¦ 41 Β· πŸ“‹ 1.4K - 35% open Β· ⏱️ 24.12.2020):

    git clone https://github.com/streamlit/streamlit
  • PyPi (πŸ“₯ 120K / month Β· πŸ“¦ 34 Β· ⏱️ 21.12.2020):

    pip install streamlit
carla (πŸ₯ˆ26 Β· ⭐ 5.4K Β· βž•) - Open-source simulator for autonomous driving research.

About

πŸ† A ranked list of awesome machine learning Python libraries. Updated weekly.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%