Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you have any interests, please visit/star/fork https://github.com/P…

Python 11 4 Updated Jul 11, 2024

pengsida / learning_research

本人的科研经验

5,601 335 Updated Sep 28, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 23,747 2,652 Updated Oct 2, 2024

GMyhf / 2024spring-cs410

5 Updated Apr 9, 2024

alibaba / Megatron-LLaMA

Forked from NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Python 613 51 Updated Jan 2, 2024

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

15,244 1,416 Updated Sep 19, 2024

microsoft / AI-System

System for AI Education Resource.

Python 3,461 431 Updated Jun 21, 2024

academicpages / academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript 11,962 42,717 Updated Oct 3, 2024

hithesis / hithesis

嗨！thesis！哈尔滨工业大学毕业论文LaTeX模板

TeX 1,655 364 Updated Jul 29, 2024

meta-llama / llama

Inference code for Llama models

Python 55,856 9,513 Updated Aug 18, 2024

Hsword / Hetu

A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, please visit/star/fork https://github.com/PKU-DAIR/Hetu

Python 103 46 Updated Dec 18, 2023

PlexPt / awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

52,340 13,518 Updated Jul 30, 2024

linguishi / chinese_sentiment

中文情感分析，CNN，BI-LSTM，文本分类

Python 864 108 Updated Oct 22, 2022

1c7 / chinese-independent-developer

👩🏿‍💻👨🏾‍💻👩🏼‍💻👨🏽‍💻👩🏻‍💻中国独立开发者项目列表 -- 分享大家都在做什么

37,263 3,109 Updated Oct 6, 2024

openmlsys / openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 3,995 431 Updated Apr 13, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,177 2,288 Updated Oct 5, 2024

codecaution / Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

528 40 Updated Sep 4, 2023

PaddlePaddle / PaddleHelix

Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集

Python 985 222 Updated Sep 9, 2024

mlc-ai / mlc-zh

Python 586 63 Updated Jun 4, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,986 4,062 Updated Oct 6, 2024

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,180 328 Updated May 16, 2023

HuaizhengZhang / AI-System-School

🚀 AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Vi…

2,667 305 Updated Aug 14, 2024

OI-wiki / OI-wiki

🌟 Wiki of OI / ICPC for everyone. （某大型游戏线上攻略，内含炫酷算术魔法）

TypeScript 20,770 3,900 Updated Oct 6, 2024

CS-BAOYAN / CSYuTuiMian2022

关于2022年CS保研预推免通知公告的汇总，欢迎大家积极分享预推免信息，资瓷一下互联网精神吼不吼啊？

630 61 Updated Sep 27, 2022

mli / paper-reading

深度学习经典、新论文逐段精读

26,522 2,408 Updated Aug 8, 2024

vulhub / vulhub

Pre-Built Vulnerable Environments Based on Docker-Compose

Dockerfile 17,579 4,445 Updated Sep 29, 2024

FudanNLP / nlp-beginner

NLP上手教程

5,842 1,314 Updated May 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xinyi Liu Fizzmy

Achievements