Skip to content
View SHYuanBest's full-sized avatar

Organizations

@PKU-YuanGroup

Block or report SHYuanBest

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Blending Custom Photos with Video Diffusion Transformers

Python 35 Updated Jan 7, 2025

Memory-optimized training scripts for video models based on Diffusers

Python 713 74 Updated Jan 13, 2025

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 257 9 Updated Jan 13, 2025

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Python 316 22 Updated Jul 17, 2024

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 3,361 274 Updated Dec 14, 2024

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Python 1,123 59 Updated Jul 17, 2024

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 845 48 Updated Jan 14, 2025

A pipeline parallel training script for diffusion models.

Python 400 38 Updated Jan 13, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,165 106 Updated Jan 11, 2025

A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation

43 3 Updated Dec 16, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,344 559 Updated Jan 13, 2025
Jupyter Notebook 6 Updated Nov 28, 2024

【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".

Python 30 Updated Dec 5, 2024

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,731 64 Updated Jan 8, 2025

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 560 28 Updated Jan 10, 2025

Fundamentals of Digital Media Technology(04713901) | Peking University ECE Course Materials

C 19 1 Updated Feb 4, 2022

Face analysis tools for modern research, equipped with state-of-the-art Face Parsing and Face Alignment

Python 354 39 Updated May 27, 2024

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Python 605 41 Updated Dec 16, 2024

Experiencing lightning fast (~1s) and accurate drag-based image editing

Python 63 2 Updated Oct 23, 2024

Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]

Python 8 Updated Sep 8, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 894 37 Updated Jan 11, 2025

Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Python 107 6 Updated Dec 3, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,596 1,339 Updated Dec 25, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,668 122 Updated Jan 13, 2025

CLIP+MLP Aesthetic Score Predictor

Python 958 90 Updated Jul 1, 2024

Bring portraits to life!

Python 13,612 1,458 Updated Jan 1, 2025
Python 414 89 Updated Nov 25, 2024
Next