Stars
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Tools for merging pretrained large language models.
๐ค ๐๐ฒ๐ฎ๐ฟ๐ป for ๐ณ๐ฟ๐ฒ๐ฒ how to ๐ฏ๐๐ถ๐น๐ฑ an end-to-end ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐๐ถ๐ผ๐ป-๐ฟ๐ฒ๐ฎ๐ฑ๐ ๐๐๐ & ๐ฅ๐๐ ๐๐๐๐๐ฒ๐บ using ๐๐๐ ๐ข๐ฝ๐ best practices: ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + 12 ๐ฉ๐ข๐ฏ๐ฅ๐ด-๐ฐ๐ฏ ๐ญ๐ฆ๐ด๐ด๐ฐ๐ฏ๐ด
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
machine learning system examples
My solution for TalkingData AdTracking Fraud Detection Challenge (https://www.kaggle.com/c/talkingdata-adtracking-fraud-detection/)
building machine learning system
Codebase for BirdClef 2023 solution
๐ฅ42nd place in CommonLit Readability Prize competition๐ฅ
abeja-inc / Megatron-LM
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
My training code which is used in 4th place solution in kaggleใCommonLit - Evaluate Student Summariesใ.