Skip to content

ejyuen/llm-foundry

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

benchmarks

Fast reference benchmarks for training ML models with recipes. Designed to be easily forked and modified.

ResNet-50

drawing

Figure 1: Comparison of MosaicML recipes against other results, all measured on 8x A100s on MosaicML Cloud.

Train the MosaicML ResNet, the fastest ResNet50 implementation that yields a ✨ 7x ✨ faster time-to-train compared to a strong baseline. See our blog for more details and recipes. Our recipes were also demonstrated at MLPerf, a cross industry ML benchmark.

🚀 Get started with the code here.

GPT-3

Simple yet feature complete implementation of GPT-3, that scales to 175B parameters while maintaining similar GPU utilization as other approaches. Flexible code, written in vanilla pytorch, that uses PyTorch FSDP and some recent efficiency improvements.

🚀 Get started with the code here.

About

LLM training code for MosaicML foundation models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 96.7%
  • Shell 3.2%
  • Other 0.1%