Skip to content
View runjiali-rl's full-sized avatar
:shipit:
:shipit:
  • University of Oxford
  • Oxford, United Kingdom
  • 01:47 (UTC -12:00)

Sponsoring

@opencv

Highlights

  • Pro

Organizations

@torrvision

Block or report runjiali-rl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] VGGT: Visual Geometry Grounded Transformer

Python 3,566 219 Updated Mar 25, 2025

[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Python 858 39 Updated Mar 26, 2025

[ICLR 2025 Spotlight] Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation

Python 31 1 Updated Mar 9, 2025

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 1,237 45 Updated Nov 6, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 6,075 644 Updated Sep 20, 2024
274 7 Updated Sep 29, 2024

🐍 Geometric Computer Vision Library for Spatial AI

Python 10,348 1,008 Updated Mar 27, 2025

Official Pytorch Implementation for "SceneScape: Text-Driven Consistent Scene Generation"

Python 153 10 Updated Jun 14, 2023

A tiny, didactical implementation of LLAMA 3

Python 35 2 Updated Dec 2, 2024

[ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman

Python 23 1 Updated Jan 28, 2025

The Arcade Learning Environment (ALE) -- a platform for AI research.

C++ 2,247 439 Updated Feb 15, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,767 124 Updated Dec 6, 2024

[ICLR2025] Kolmogorov-Arnold Transformer

Python 739 43 Updated Mar 23, 2025

MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)

Python 2,280 40 Updated Mar 9, 2025

Official Implementation of Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics

Python 88 3 Updated Jan 16, 2025

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 446 26 Updated Mar 18, 2025

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 2,530 217 Updated Oct 27, 2024

DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors

Python 37 4 Updated Sep 13, 2024

[3DV 2025]🐱🐶🐲🐮🐷Official Implementation of DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer

Python 66 8 Updated Mar 20, 2025

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python 1,027 81 Updated Mar 11, 2025

A pure pytorch implementation of 3D gaussian Splatting

Python 370 37 Updated Jan 8, 2025

Examples and exercises for B16 - Algorithms.

C++ 8 87 Updated Feb 4, 2025

[CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC

Python 451 20 Updated Sep 27, 2024

News: the 10k dataset is ready for download.

HTML 403 6 Updated Mar 21, 2025

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,293 56 Updated Mar 24, 2025

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 734 43 Updated Aug 5, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,020 2,416 Updated Aug 12, 2024

Rembg is a tool to remove images background

Python 18,440 1,979 Updated Mar 26, 2025
Python 18 Updated Jul 1, 2024
Next