Stars
aider is AI pair programming in your terminal
The fastest way to develop full-stack web apps with React & Node.js.
A machine learning software for extracting information from scholarly documents
smolLM with Entropix sampler on pytorch
Entropy Based Sampling and Parallel CoT Decoding
Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
A container registry backed by Workers and R2.
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching of inference workloads.
Instant is a modern Firebase. We make you productive by giving your frontend a real-time database.
Maelstrom is a fast Rust, Go, and Python test runner that runs every test in its own container. Tests are either run locally or distributed to a clustered job runner.
Simplex Random Feature attention, in PyTorch
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
event broker with a focus on low operational cost
A novel human-interaction method for real-time speech extraction on headphones.
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation
Tile primitives for speedy kernels
HazyResearch / train-tk
Forked from karpathy/nanoGPTtrain with kittens!
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
SGLang is a fast serving framework for large language models and vision language models.