Stars
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Optimized primitives for collective multi-GPU communication
FlashInfer: Kernel Library for LLM Serving
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
A brand-new multi-scenarios smart contract compiler framework
DLRover: An Automatic Distributed Deep Learning System
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
Clone a voice in 5 seconds to generate arbitrary speech in real-time
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official GitHub page for the survey paper "A Survey of Large Language Models".
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A Rust wrapper around native shared memory for Linux and Windows
Rust crate for implementing FUSE backends
Build, Share and Run Both Your Kubernetes Cluster and Distributed Applications (Project under CNCF)
Rust crate for implementing FUSE backends