Stars
This repository contains the Hugging Face Agents Course.
Instruct-tune LLaMA on consumer hardware
A library for mechanistic interpretability of GPT-style language models
A course on aligning smol models.
Port of OpenAI's Whisper model in C/C++
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
List of papers related to neural network quantization in recent AI conferences and journals.
A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
This repository contains the experimental PyTorch native float8 training UX
A pytorch quantization backend for optimum
LLM Finetuning with peft
💯 Curated coding interview preparation materials for busy software engineers
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
A minimal version of GPT-2 in 175 lines of PyTorch code.
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022