Stars
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
⚡ TabPFN: Foundation Model for Tabular Data ⚡
Focused on fast experimentation and simplicity
A generative world for general-purpose robotics & embodied AI learning.
the AI-native open-source embedding database
Numerical differential equation solvers in JAX. Autodifferentiable and GPU-capable. https://docs.kidger.site/diffrax/
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Generate large synthetic data using an LLM
Scan for React performance issues and eliminate slow renders in your app
Fast and accurate automatic speech recognition (ASR) for edge devices
Quantitative Investment Strategies (QIS) package implements Python analytics for visualisation of financial data, performance reporting, analysis of quantitative strategies.
Tools for merging pretrained large language models.
Curated list of datasets and tools for post-training.
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
Machine Learning Engineering Open Book
Codebase for Aria - an Open Multimodal Native MoE
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
Companion code to the "How to Write a Google Maps React Component" Tutorial
Helpful tools and examples for working with flex-attention
SGLang is a fast serving framework for large language models and vision language models.