Skip to content
View kxgong's full-sized avatar

Highlights

  • Pro

Block or report kxgong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Wan: Open and Advanced Large-Scale Video Generative Models

Python 8,789 939 Updated Mar 20, 2025

Video-R1: Towards Super Reasoning Ability in Video Understanding MLLMs

Python 103 2 Updated Feb 23, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,746 501 Updated Mar 20, 2025

Understand Human Behavior to Align True Needs

Python 3,797 346 Updated Jul 20, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,584 365 Updated Mar 21, 2025

Bring portraits to life!

Python 14,379 1,551 Updated Feb 28, 2025

ComfyUI nodes for LivePortrait

Python 1,888 152 Updated Aug 5, 2024

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 3,925 430 Updated Feb 12, 2025

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

137 1 Updated Feb 19, 2025

Refine high-quality datasets and visual AI models

Python 9,298 607 Updated Mar 21, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 40,215 6,613 Updated Dec 9, 2024

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 16,790 3,500 Updated Oct 9, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,621 73 Updated Aug 15, 2024

LenslessFace : An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification

Python 21 1 Updated Jul 21, 2024

[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,519 157 Updated Dec 2, 2024

PyTorch native post-training library

Python 5,014 561 Updated Mar 20, 2025

A PyTorch native library for large model training

Python 3,476 318 Updated Mar 21, 2025

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,513 275 Updated Jan 12, 2025
Python 195 23 Updated Sep 4, 2023

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 25,605 3,727 Updated Feb 10, 2025

Grok open release

Python 50,251 8,363 Updated Aug 30, 2024

[CSUR] A Survey on Video Diffusion Models

2,026 105 Updated Mar 14, 2025

Official implementation of AnimateDiff.

Python 11,182 909 Updated Jul 31, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 623 36 Updated Oct 22, 2024

Collection of awesome test-time (domain/batch/instance) adaptation methods

887 64 Updated Mar 7, 2025

✨✨Latest Advances on Multimodal Large Language Models

14,369 923 Updated Mar 21, 2025
17 Updated Oct 15, 2023

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,700 277 Updated Feb 15, 2023

Meta-Transformer for Unified Multimodal Learning

Python 1,578 118 Updated Dec 5, 2023

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

Vue 31,914 11,224 Updated Aug 16, 2024
Next