Starred repositories
The code, training pipeline, and models that power Firefox Translations
Run PyTorch LLMs locally on servers, desktop and mobile
A vector search SQLite extension that runs anywhere!
Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)
Deezer source separation library including pretrained models.
Distribute and run LLMs with a single file.
OpusFilter - Parallel corpus processing toolkit
OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.
Generates task dependency graphs for Taskcluster CI
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A high-throughput and memory-efficient inference and serving engine for LLMs
Awesome-LLM: a curated list of Large Language Model
Exploration of processing multimedia content on social networks with AI
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A Python library for calculating a large variety of metrics from text
🦜🔗 Build context-aware reasoning applications
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Ongoing research training transformer models at scale
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
A library for efficient similarity search and clustering of dense vectors.
Python client for Qdrant vector search engine
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Best Practices on Recommendation Systems