Stars
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Faster Whisper transcription with CTranslate2
Running large language models on a single GPU for throughput-oriented scenarios.
A curated list of awesome papers on contextualizing E2E ASR outputs
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Fast inference engine for Transformer models
Tools for handling speech data in machine learning projects.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Interactive Neural Machine Translation tool
A tool for extracting plain text from Wikipedia dumps
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Fast and customizable text tokenization library with BPE and SentencePiece support
Reading list for Awesome Sentiment Analysis papers
A Code-First Introduction to NLP course
Minimal Docker images: a collection of Dockerfiles illustrating how to reduce container image size.
Deezer source separation library including pretrained models.