shaoshitong

🏠

Working from home

yuqs shaoshitong

🏠

Working from home

I am a PhD in HKUST (GZ). My interests in efficient diffsuon/llm, model/dataset compression and optimization.

72 followers · 24 following

The Hong Kong University of Science and Technology (Guangzhou)
Guangzhou, Guangdong, China
https://scholar.google.com.hk/citations?user=hmUOaNcAAAAJ&hl=zh-CN

Starred repositories

djghosh13 / geneval

GenEval: An object-focused framework for evaluating text-to-image alignment

HTML 160 8 Updated Jul 24, 2024

dome272 / MaskGIT-pytorch

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Python 421 35 Updated Sep 3, 2023

lucidrains / soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,467 91 Updated Oct 31, 2024

SYZhang0805 / DisBack

The official implementation of Distribution Backtracking Distillation for One-step Diffusion Models

Python 27 1 Updated Jan 25, 2025

MRzzm / HDTF

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

Python 365 68 Updated May 12, 2024

Leeroo-AI / mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Python 433 28 Updated Aug 26, 2024

ali-vilab / dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Python 1,671 206 Updated Jan 15, 2024

antgroup / echomimic

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,496 396 Updated Dec 10, 2024

baaivision / NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 352 9 Updated Feb 3, 2025

LINs-lab / RDED

[CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm

Python 61 4 Updated Apr 28, 2024

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,750 86 Updated Oct 31, 2024

bobo0810 / LearnDeepSpeed

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

Python 146 1 Updated Sep 7, 2023

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 23,533 1,993 Updated Feb 3, 2025

hao-ai-lab / FastVideo

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 956 57 Updated Feb 3, 2025

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 629 34 Updated Jan 21, 2025

princeton-nlp / Edge-Pruning

[NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".

Python 45 5 Updated Dec 6, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,513 4,224 Updated Feb 5, 2025

shawnricecake / search-llm

[NeurIPS 2024] Search for Efficient LLMs

Python 12 Updated Jan 16, 2025

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,029 59 Updated Jan 2, 2025

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 820 40 Updated Jan 28, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,065 652 Updated Jan 24, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,487 427 Updated Jan 12, 2025

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Python 1,321 88 Updated Jan 23, 2024

wangkai930418 / awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

1,474 73 Updated Feb 4, 2025

thu-ml / SageAttention

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 921 54 Updated Jan 30, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 36,193 5,478 Updated Feb 5, 2025

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

Python 689 50 Updated Jan 31, 2025

dongzhuoyao / awesome-flow-matching

A summary of related works about flow matching, stochastic interpolants

376 14 Updated Jul 29, 2024

sail-sg / sdft

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Shell 107 4 Updated Nov 2, 2024

xie-lab-ml / Golden-Noise-for-Diffusion-Models

The code of our work "Golden Noise for Diffusion Models: A Learning Framework".

Python 81 7 Updated Dec 20, 2024

Starred topics

score-based-generative-models

Provide feedback

Saved searches

Use saved searches to filter your results more quickly