-
Peking University
- China, Beijing
-
22:18
(UTC +08:00) - blog.fizzmy.club
Highlights
- Pro
Stars
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
PKU-DAIR / Hetu-Galvatron
Forked from AFDWang/Hetu-GalvatronGalvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).
Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you have any interests, please visit/star/fork https://github.com/P…
alibaba / Megatron-LLaMA
Forked from NVIDIA/Megatron-LMBest practice for training LLaMA models in Megatron-LM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, please visit/star/fork https://github.com/PKU-DAIR/Hetu
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
《Machine Learning Systems: Design and Implementation》- Chinese Version
Ongoing research training transformer models at scale
A curated reading list of research in Mixture-of-Experts(MoE).
Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
LightSeq: A High Performance Library for Sequence Processing and Generation
🚀 AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Vi…
🌟 Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
关于2022年CS保研预推免通知公告的汇总,欢迎大家积极分享预推免信息,资瓷一下互联网精神吼不吼啊?
Pre-Built Vulnerable Environments Based on Docker-Compose