[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Jupyter Notebook 320 33 Updated Aug 12, 2024

instantX-research / InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Jupyter Notebook 1,736 108 Updated Sep 18, 2024

PRIV-Creation / Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

969 27 Updated Dec 31, 2024

yinboc / liif

Learning Continuous Image Representation with Local Implicit Image Function, in CVPR 2021 (Oral)

Python 1,294 146 Updated Aug 21, 2021

THUDM / SwissArmyTransformer

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Python 1,047 97 Updated Dec 26, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,368 965 Updated Jan 20, 2025

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,964 1,162 Updated Jul 30, 2024

YangLing0818 / VideoTetris

[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation

Python 212 6 Updated Nov 4, 2024

sczhou / Upscale-A-Video

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Python 1,090 56 Updated Sep 27, 2024

IceClear / StableSR

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Python 2,297 150 Updated Jul 12, 2024

rinongal / textual_inversion

Jupyter Notebook 2,945 284 Updated Feb 27, 2023

zqh0253 / SceneWiz3D

[CVPR 2024] SceneWiz3D: Towards Text-guided 3D Scene Composition

96 5 Updated May 4, 2024

zqh0253 / BerfScene

[CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

Python 42 3 Updated May 7, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,179 991 Updated Nov 18, 2024

3DTopia / LGM

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,770 125 Updated Aug 20, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,157 2,327 Updated Aug 12, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 4,119 305 Updated Nov 13, 2024

VAST-AI-Research / TripoSR

Python 4,790 560 Updated Aug 16, 2024

weepiess / StyleFlow-Content-Fixed-I2I

Python 83 15 Updated Aug 26, 2023

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,555 353 Updated Jun 28, 2024

TencentARC / T2I-Adapter

T2I-Adapter

Python 3,563 214 Updated Jun 21, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 10,884 881 Updated Jul 31, 2024

ShijieZhou-UCLA / feature-3dgs

[CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields

C++ 418 28 Updated Oct 17, 2024

openxrlab / xrsfm

OpenXRLab Structure-from-Motion Toolbox and Benchmark

C++ 202 23 Updated Jul 31, 2024

openxrlab / xrlocalization

OpenXRLab Visual Localization Toolbox and Server

Python 211 24 Updated Oct 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HSID

Block or report HSID

Stars

zju3dv / StarGen

Genesis-Embodied-AI / Genesis

baaivision / See3D

vuer-ai / feature-splatting-inria

vuer-ai / feature-splatting

VQAssessment / DOVER