Skip to content
View ztianlin's full-sized avatar
  • Ecole Centrale Lyon
  • Lyon, France

Block or report ztianlin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

Python 748 38 Updated Jan 13, 2025

Vision-and-Language Navigation in Continuous Environments using Habitat

Python 395 59 Updated Jan 7, 2025

SpatialLM: Large Language Model for Spatial Understanding

Python 2,761 197 Updated Mar 28, 2025

[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 574 43 Updated Feb 26, 2025

[ECCV 2024] Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

Python 477 33 Updated Mar 18, 2025

[ECCV 2024] GenAD: Generative End-to-End Autonomous Driving

Python 379 43 Updated Jan 11, 2025

OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving

Python 168 9 Updated May 31, 2024

official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*

Python 40 1 Updated Jan 10, 2025

[ECCV 2024] 3D World Model for Autonomous Driving

Python 427 29 Updated Apr 12, 2024

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 3,189 396 Updated Mar 31, 2025

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Python 779 101 Updated Sep 15, 2024

[CVPR 2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

Python 66 5 Updated Mar 25, 2025

ViPlanner: Visual Semantic Imperative Learning for Local Navigation

Python 444 46 Updated Feb 12, 2025
Python 382 40 Updated Nov 14, 2024

[CVPR'25] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 852 46 Updated Mar 23, 2025

[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Python 565 20 Updated Apr 1, 2025

open-sourced video dataset with dynamic scenes and camera movements annotation

18 Updated Mar 27, 2025
JavaScript 22 1 Updated Mar 26, 2025

official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"

Python 117 6 Updated Mar 26, 2025
Python 511 23 Updated May 24, 2024

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 1,864 117 Updated Mar 28, 2025

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

Python 914 47 Updated Dec 9, 2024

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 3,254 1,525 Updated Apr 2, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,009 2,221 Updated Feb 1, 2025
Python 466 30 Updated Nov 26, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

7,873 502 Updated Apr 2, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,929 190 Updated Mar 24, 2025

[CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis

Python 168 6 Updated Oct 11, 2024
Next