A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,464 2,751 Updated Mar 26, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 15,705 1,828 Updated Mar 2, 2025

chenzomi12 / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 13,000 1,871 Updated Mar 1, 2025

alibaba / Megatron-LLaMA

Forked from NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Python 645 56 Updated Jan 2, 2024

Xiaojiu-z / Stable-Hair

Stable-Hair: Real-World Hair Transfer via Diffusion Model (AAAI 2025)

Python 443 40 Updated Mar 14, 2025

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 55,035 11,916 Updated Mar 17, 2025

poloclub / transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 4,141 404 Updated Mar 21, 2025

SamLynnEvans / Transformer

Transformer seq2seq model, program that can build a language translator from parallel corpus

Python 1,385 349 Updated May 19, 2023

hyunwoongko / transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,516 502 Updated Aug 6, 2024

AgibotTech / agibot_x1_hardware

The hardware design for AgiBot X1.

888 277 Updated Mar 14, 2025

Oneflow-Inc / oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 7,589 831 Updated Mar 18, 2025

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 955 141 Updated Mar 25, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,617 4,315 Updated Mar 25, 2025

yuanzhoulvpi2017 / vscode_debug_transformers

Python 294 26 Updated Feb 10, 2025

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,011 334 Updated Jan 13, 2025

AutoGPTQ / AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,769 512 Updated Mar 17, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,883 2,664 Updated Mar 25, 2025

meta-llama / llama

Inference code for Llama models

Python 57,933 9,717 Updated Jan 26, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 6,404 735 Updated Oct 22, 2024

EnflameTechnology / ModelZoo

Python 17 2 Updated Jan 17, 2025

DaTongjie / BEVSpread

Python 52 2 Updated Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhoushuang66

Achievements

Achievements

Block or report zhoushuang66

Stars

vllm-project / vllm

modelscope / DiffSynth-Studio

mannaandpoem / OpenManus

FareedKhan-dev / train-deepseek-r1

deepseek-ai / DualPipe

KellerJordan / Muon

huggingface / open-r1

om-ai-lab / VLM-R1

chunhuizhang / pytorch_distribute_tutorials

NVIDIA / NeMo