Skip to content
View shaoshitong's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report shaoshitong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

GenEval: An object-focused framework for evaluating text-to-image alignment

HTML 160 8 Updated Jul 24, 2024

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Python 421 35 Updated Sep 3, 2023

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,467 91 Updated Oct 31, 2024

The official implementation of Distribution Backtracking Distillation for One-step Diffusion Models

Python 27 1 Updated Jan 25, 2025

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

Python 365 68 Updated May 12, 2024

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Python 433 28 Updated Aug 26, 2024

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Python 1,671 206 Updated Jan 15, 2024

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,496 396 Updated Dec 10, 2024

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 352 9 Updated Feb 3, 2025

[CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm

Python 61 4 Updated Apr 28, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,750 86 Updated Oct 31, 2024

DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)

Python 146 1 Updated Sep 7, 2023

A generative world for general-purpose robotics & embodied AI learning.

Python 23,533 1,993 Updated Feb 3, 2025

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 956 57 Updated Feb 3, 2025

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 629 34 Updated Jan 21, 2025

[NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".

Python 45 5 Updated Dec 6, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,513 4,224 Updated Feb 5, 2025

[NeurIPS 2024] Search for Efficient LLMs

Python 12 Updated Jan 16, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,029 59 Updated Jan 2, 2025

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 820 40 Updated Jan 28, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,065 652 Updated Jan 24, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,487 427 Updated Jan 12, 2025

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Python 1,321 88 Updated Jan 23, 2024

collection of diffusion model papers categorized by their subareas

1,474 73 Updated Feb 4, 2025

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 921 54 Updated Jan 30, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 36,193 5,478 Updated Feb 5, 2025

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

Python 689 50 Updated Jan 31, 2025

A summary of related works about flow matching, stochastic interpolants

376 14 Updated Jul 29, 2024

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Shell 107 4 Updated Nov 2, 2024

The code of our work "Golden Noise for Diffusion Models: A Learning Framework".

Python 81 7 Updated Dec 20, 2024
Next