Skip to content
View Xuxue1's full-sized avatar
  • 小影科技
  • 浙江省杭州市

Block or report Xuxue1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 766 37 Updated Feb 24, 2025

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 802 53 Updated Sep 8, 2024

FlashMLA: Efficient MLA decoding kernels

C++ 11,017 748 Updated Mar 1, 2025

Solve Visual Understanding with Reinforced VLMs

Python 3,702 222 Updated Mar 3, 2025
Python 3,734 293 Updated Feb 27, 2025

DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.

Python 318 27 Updated Jan 22, 2025

Official implementation of SVFR.

Python 750 72 Updated Jan 19, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,150 792 Updated Mar 3, 2025

super fast propainter | 百倍速propainter

Python 65 4 Updated Jul 8, 2024

Official mirror of libplacebo

C 597 79 Updated Feb 27, 2025

Collect every awesome work about r1!

Python 236 6 Updated Mar 3, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,323 171 Updated Feb 14, 2025

🥧 Savoury implementation of the QUIC transport protocol and HTTP/3

Rust 9,915 768 Updated Mar 3, 2025

"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"

Python 411 44 Updated Mar 2, 2025

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 1,287 127 Updated Jul 15, 2024

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 405 29 Updated Feb 14, 2025

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 195 20 Updated Feb 24, 2025

A Training-free Iterative Framework for Long Story Visualization

Python 806 111 Updated Jan 18, 2025

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,185 447 Updated Mar 1, 2025

Yet another SIP003 plugin based on IETF-QUIC

Rust 124 15 Updated Dec 31, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 378 25 Updated Mar 3, 2025

A pipeline parallel training script for diffusion models.

Python 594 58 Updated Feb 27, 2025

A PyTorch native library for large model training

Python 3,383 299 Updated Mar 3, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 1,900 157 Updated Feb 10, 2025

Diffusion-based Portrait and Animal Animation

Python 684 60 Updated Jan 13, 2025

📖A curated list of Awesome Diffusion Inference Papers with codes: Sampling, Caching, Multi-GPUs, etc. 🎉🎉

195 12 Updated Jan 16, 2025

[CVPR'25] Official Implementations for Paper - AniDoc: Animation Creation Made Easier

Python 482 32 Updated Feb 27, 2025

Taming Stable Diffusion for Lip Sync!

Python 2,766 407 Updated Jan 19, 2025
Python 500 30 Updated Jan 20, 2025

Helpful tools and examples for working with flex-attention

Python 666 36 Updated Feb 18, 2025
Next