Stars
Entropy Based Sampling and Parallel CoT Decoding
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Open Source Alternative to Vercel, Netlify and Heroku.
🦄 0-legacy, tiny & fast web framework as a replacement of Express
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
Supercharge your nuxt app with managed job queues, workers, and customizable scheduling.
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.