Skip to content
View karpathy's full-sized avatar

Highlights

  • Pro

Block or report karpathy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,164 37 Updated Apr 18, 2025
Python 218 10 Updated Apr 22, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,642 1,470 Updated Apr 2, 2025

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,393 53 Updated Jan 12, 2025

NanoGPT (124M) in 3 minutes

Python 2,501 289 Updated Apr 23, 2025

A PyTorch native library for large-scale model training

Python 3,627 343 Updated Apr 24, 2025

Efficient Triton Kernels for LLM Training

Python 4,909 310 Updated Apr 24, 2025

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Python 1,335 81 Updated Apr 24, 2025

Official inference repo for FLUX.1 models

Python 21,460 1,518 Updated Feb 6, 2025

the scott CPU from "But How Do It Know?" by J. Clark Scott

Go 1,913 163 Updated Oct 21, 2020

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,572 249 Updated Apr 23, 2025

Animation engine for explanatory math videos

Python 76,979 6,657 Updated Mar 20, 2025

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,853 187 Updated Mar 11, 2025

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

C 129 5 Updated Nov 11, 2024

UNet diffusion model in pure CUDA

Cuda 602 28 Updated Jun 28, 2024

[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Python 865 47 Updated Feb 19, 2025

gpt-2 from scratch in mlx

Python 382 27 Updated Jun 12, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 91,116 11,512 Updated Apr 23, 2025

Implementation of Diffusion Transformer (DiT) in JAX

Python 272 6 Updated Jun 11, 2024

Implementation for MatMul-free LM.

Python 2,991 187 Updated Nov 5, 2024

Schedule-Free Optimization in PyTorch

Python 2,146 74 Updated Apr 11, 2025

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 7,711 616 Updated Apr 24, 2025

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 21,480 2,185 Updated Apr 23, 2025

My favorite C programming practices.

2,066 98 Updated Oct 1, 2020

Tile primitives for speedy kernels

Cuda 2,288 137 Updated Apr 24, 2025

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,237 623 Updated Aug 18, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,361 538 Updated Apr 22, 2025

LLM inference in C/C++

C++ 78,721 11,501 Updated Apr 24, 2025

Distribute and run LLMs with a single file.

C++ 22,257 1,169 Updated Apr 21, 2025

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 534 29 Updated Apr 23, 2025
Next