Stars
Open and efficient video watermarking
The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Large-Scale Multimodal Dataset of Astronomical Data
A 15TB Collection of Physics Simulation Datasets
Fuzzy Logic and Fuzzy Inference for Python 3
Python implementation for some Interval Type 2 Fuzzy Set (IT2 FS) concepts.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Ongoing research training transformer models at scale
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Language-Agnostic SEntence Representations
Models, data loaders and abstractions for language processing, powered by PyTorch
A system for quickly generating training data with weak supervision
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Efficiently computes derivatives of NumPy code.
😈Awful AI is a curated list to track current scary usages of AI - hoping to raise awareness
Supervisely SDK for Python - convenient way to automate, customize and extend Supervisely Platform for your computer vision task