Stars
An Open Source Machine Learning Framework for Everyone
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Tesseract Open Source OCR Engine (main repository)
A library for efficient similarity search and clustering of dense vectors.
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Automatic License Plate Recognition library
Unsupervised text tokenizer for Neural Network-based text generation.
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…
A flexible, high-performance serving system for machine learning models
Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models
The repository contains Google's robots.txt parser and matcher as a C++ library (compliant to C++11).
Stanford Network Analysis Platform (SNAP) is a general purpose network analysis and graph mining library.
A high-quality speech analysis, manipulation and synthesis system
Trust & Safety tools for working together to fight digital harms.
NetworKit is a growing open-source toolkit for large-scale network analysis.
Experiments towards neural network theorem proving
Main codebase for TeXworks, a simple interface for working with TeX documents
Sketching linear classifiers over data streams with the Weight-Median Sketch (SIGMOD 2018).