AnnotatedGPT

This a simple version of GPT model with 300 lines of code.

The project is adapted from The Annotated Transformer.

The transformer is encoder-decoder architecture, but the GPT is decoder-only architecture. This is the main difference between these two projects. Other than that, most of the code is the same as that project.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
annotated_GPT.egg-info		annotated_GPT.egg-info
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
simple_gpt_with_training_and_visulization.ipynb		simple_gpt_with_training_and_visulization.ipynb
vocab.pt		vocab.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AnnotatedGPT

The Decoder-Only Architecture

The Encoder-Decoder Architecture

About

Releases

Packages

Languages

0324wy/annotated-GPT

Folders and files

Latest commit

History

Repository files navigation

AnnotatedGPT

The Decoder-Only Architecture

The Encoder-Decoder Architecture

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages