dreaming
Undergraduate from ZJU, interested in LLM & Trading.
-
UG student@ZJU
- 求是鼎
-
08:22
(UTC +08:00) - Frankgu3528.github.io
Highlights
- Pro
Stars
transformer
5 repositories
A pytorch &keras implementation and demo of Fastformer.
Transformer related optimization, including BERT, GPT
Fast and memory-efficient exact attention
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"