minLlama3

for a better guide than mine, click here

minLlama3

This repo is meant as a guide on how Llama3's architecture works in the same vein of Andrej Karpathy's minGPT (hint: it's basically the same as Llama2). Over in the colab notebook I'll hold your hand through every single operation performed in Llama 3, and in 'model.py' you can check out what that code looks like once it's turned into actual pytorch nn.Module objects. 'training.ipynb' and 'inference.ipynb' are what they sound like. there are 3 different models and 4 different tokenizers over in 'models/' and 'tokenizers/'. The only requirement you'll need to install in order for everything to run that doesn't come with python by default is pytorch. Check out the accompanying youtube video!

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
models		models
tokenizers		tokenizers
.gitignore		.gitignore
README.md		README.md
inference.ipynb		inference.ipynb
input.txt		input.txt
model.py		model.py
newmodel.py		newmodel.py
newtrain.py		newtrain.py
params.py		params.py
tiny_shakespeare_tokenizer.py		tiny_shakespeare_tokenizer.py
train.py		train.py
train_latency_gpu.py		train_latency_gpu.py
training.ipynb		training.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

minLlama3

About

Releases

Packages

Languages

ryyzn9/minLlama3

Folders and files

Latest commit

History

Repository files navigation

minLlama3

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages