Starred repositories
A high-throughput and memory-efficient inference and serving engine for LLMs
SGLang is a fast serving framework for large language models and vision language models.
Fully open reproduction of DeepSeek-R1
Make websites accessible for AI agents
Rich is a Python library for rich text and beautiful formatting in the terminal.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
Anthropic's educational courses
A react-based starter app for using the Multimodal Live API over websockets with Gemini
Recipes to scale inference-time compute of open models
A course on aligning smol models.
AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/pAbnFJrkgZ
Agentic components of the Llama Stack APIs
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
PyTorch extensions for high performance and large scale training.
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
An annotated implementation of the Transformer paper.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Unsupervised text tokenizer for Neural Network-based text generation.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production