karpathy

Follow

Andrej karpathy

Follow

I like to train Deep Neural Nets on large datasets.

102k followers · 8 following

Stanford
https://twitter.com/karpathy

Achievements

Achievements

Highlights

Pro

Stars

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,164 37 Updated Apr 18, 2025

Alx-AI / AI_Diplomacy

Python 218 10 Updated Apr 22, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,642 1,470 Updated Apr 2, 2025

SonyResearch / micro_diffusion

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,393 53 Updated Jan 12, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 2,501 289 Updated Apr 23, 2025

pytorch / torchtitan

A PyTorch native library for large-scale model training

Python 3,627 343 Updated Apr 24, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,909 310 Updated Apr 24, 2025

filipstrand / mflux

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Python 1,335 81 Updated Apr 24, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 21,460 1,518 Updated Feb 6, 2025

djhworld / simple-computer

the scott CPU from "But How Do It Know?" by J. Clark Scott

Go 1,913 163 Updated Oct 21, 2020

pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,572 249 Updated Apr 23, 2025

3b1b / manim

Animation engine for explanatory math videos

Python 76,979 6,657 Updated Mar 20, 2025

AnswerDotAI / gpu.cpp

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,853 187 Updated Mar 11, 2025

ash-01xor / bpe.c

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

C 129 5 Updated Nov 11, 2024

clu0 / unet.cu

UNet diffusion model in pure CUDA

Cuda 602 28 Updated Jun 28, 2024

microsoft / Samba

[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Python 865 47 Updated Feb 19, 2025

pranavjad / mlx-gpt2

gpt-2 from scratch in mlx

Python 382 27 Updated Jun 12, 2024

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 91,116 11,512 Updated Apr 23, 2025

kvfrans / jax-diffusion-transformer

Implementation of Diffusion Transformer (DiT) in JAX

Python 272 6 Updated Jun 11, 2024

ridgerchu / matmulfreellm

Implementation for MatMul-free LM.

Python 2,991 187 Updated Nov 5, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,146 74 Updated Apr 11, 2025

skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 7,711 616 Updated Apr 24, 2025

ItzCrazyKns / Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 21,480 2,185 Updated Apr 23, 2025

mcinglis / c-style

My favorite C programming practices.

2,066 98 Updated Oct 1, 2020

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,288 137 Updated Apr 24, 2025

adam-maj / tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,237 623 Updated Aug 18, 2024

google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,361 538 Updated Apr 22, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 78,721 11,501 Updated Apr 24, 2025

Mozilla-Ocho / llamafile

Distribute and run LLMs with a single file.

C++ 22,257 1,169 Updated Apr 21, 2025

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 534 29 Updated Apr 23, 2025