Skip to content

0324wy/annotated-GPT

Repository files navigation

AnnotatedGPT

This a simple version of GPT model with 300 lines of code.

The project is adapted from The Annotated Transformer.

The transformer is encoder-decoder architecture, but the GPT is decoder-only architecture. This is the main difference between these two projects. Other than that, most of the code is the same as that project.

The Decoder-Only Architecture

The Encoder-Decoder Architecture

About

A simple GPT model with 300 lines of code

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published