Skip to content
View HSID's full-sized avatar

Block or report HSID

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for "StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation", Arxiv 2025.

41 Updated Jan 13, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 23,089 1,909 Updated Jan 20, 2025

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

Python 586 15 Updated Dec 21, 2024

Feature splatting based on INRIA GS rasterizer

Python 52 6 Updated Nov 4, 2024
Python 134 10 Updated Dec 23, 2024

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Jupyter Notebook 320 33 Updated Aug 12, 2024

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Jupyter Notebook 1,736 108 Updated Sep 18, 2024

A collection of resources on controllable generation with text-to-image diffusion models.

969 27 Updated Dec 31, 2024

Learning Continuous Image Representation with Local Implicit Image Function, in CVPR 2021 (Oral)

Python 1,294 146 Updated Aug 21, 2021

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Python 1,047 97 Updated Dec 26, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,362 964 Updated Jan 20, 2025

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,964 1,162 Updated Jul 30, 2024

[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation

Python 212 6 Updated Nov 4, 2024

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Python 1,090 56 Updated Sep 27, 2024

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Python 2,297 150 Updated Jul 12, 2024
Jupyter Notebook 2,945 284 Updated Feb 27, 2023

[CVPR 2024] SceneWiz3D: Towards Text-guided 3D Scene Composition

96 5 Updated May 4, 2024

[CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

Python 42 3 Updated May 7, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,178 991 Updated Nov 18, 2024

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,770 125 Updated Aug 20, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,156 2,327 Updated Aug 12, 2024

Kolors Team

Python 4,118 305 Updated Nov 13, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,556 353 Updated Jun 28, 2024

T2I-Adapter

Python 3,562 214 Updated Jun 21, 2024

Official implementation of AnimateDiff.

Python 10,881 880 Updated Jul 31, 2024

[CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields

C++ 418 28 Updated Oct 17, 2024

OpenXRLab Structure-from-Motion Toolbox and Benchmark

C++ 202 23 Updated Jul 31, 2024

OpenXRLab Visual Localization Toolbox and Server

Python 211 24 Updated Oct 24, 2023
Next