My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…

Jupyter Notebook 1,024 176 Updated Dec 27, 2020

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 26,424 3,040 Updated Oct 2, 2024

jessevig / bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 7,344 815 Updated Aug 24, 2023

openai / spinningup-workshop

For educational materials related to the spinning up workshops.

TeX 200 48 Updated Feb 12, 2019

vmayoral / basic_reinforcement_learning

An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.

Jupyter Notebook 1,145 364 Updated Jul 14, 2023

prateekjoshi565 / Fine-Tuning-BERT

Jupyter Notebook 150 121 Updated Jan 17, 2023

imelnyk / ArxivPapers

Code behind Arxiv Papers

Python 513 60 Updated Apr 2, 2024

mshumer / gpt-llm-trainer

Jupyter Notebook 4,113 543 Updated Mar 28, 2024

MzeroMiko / VMamba

VMamba: Visual State Space Models，code is based on mamba

Python 2,556 175 Updated Mar 7, 2025

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,378 231 Updated Feb 13, 2025

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 43,600 6,565 Updated Apr 23, 2025

karpathy / ng-video-lecture

Python 3,931 1,050 Updated Jan 31, 2024

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,953 294 Updated Feb 27, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 40,850 6,763 Updated Dec 9, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,597 912 Updated Jul 1, 2024

kyegomez / EvoVLM-JP

Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI

Python 30 1 Updated Nov 11, 2024

fangyuan-ksgk / Evolutionary-Model-Merge

Unofficial Implementation of Evolutionary Model Merging

Python 38 2 Updated Mar 28, 2024

xai-org / grok-1

Grok open release

Python 50,235 8,347 Updated Aug 30, 2024

anthropics / anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 11,885 1,391 Updated Apr 17, 2025

bbycroft / llm-viz

3D Visualization of an GPT-style LLM

TypeScript 4,638 520 Updated Aug 24, 2024

hkproj / mistral-llm-notes

Notes on the Mistral AI model

Jupyter Notebook 19 5 Updated Dec 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fkhawar

Block or report fkhawar

Stars

reczoo / FuxiCTR

graviraja / MLOps-Basics

EurekaLabsAI / ngram

apple / corenet

autogluon / autogluon

deep-diver / llamaduo

mistralai / mistral-common

dvlab-research / LongLoRA

google-deepmind / recurrentgemma

gordicaleksa / pytorch-original-transformer