Skip to content
View ymxlzgy's full-sized avatar

Block or report ymxlzgy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 894 50 Updated Mar 23, 2025

Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias" (ICLR 2025 Oral)

Python 187 5 Updated Apr 8, 2025

[TMLR 2025🔥] A survey for the autoregressive models in vision.

505 15 Updated Apr 16, 2025

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 3,364 270 Updated Dec 27, 2024

[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Python 605 43 Updated Mar 13, 2025

Official implementation of Continuous 3D Perception Model with Persistent State

Python 740 31 Updated Apr 11, 2025

[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"

Python 102 5 Updated Apr 17, 2025

[ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving

Python 26 2 Updated Feb 14, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 7,917 507 Updated Apr 2, 2025

[AAAI 2025]MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation

Python 23 Updated Apr 17, 2025

A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks

Python 105 8 Updated Apr 14, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 12,426 1,398 Updated Apr 17, 2025

This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatial-Temporal Reasoning

Jupyter Notebook 18 Updated Nov 15, 2024

[CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation

Python 95 5 Updated Oct 8, 2024

Open Vocabulary Object Navigation

Python 70 7 Updated Feb 22, 2025
Python 37 2 Updated Sep 30, 2023

Code for training embodied agents using imitation learning at scale in Habitat-Lab

Python 39 7 Updated Apr 13, 2025

SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

Python 1,487 257 Updated Apr 12, 2025
Python 15 Updated Dec 30, 2024

QuadWBG: Generalizable Quadrupedal Whole-Body Grasping

21 Updated Nov 8, 2024

A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds

Python 26 7 Updated Jan 19, 2025

Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".

Python 44 3 Updated Dec 27, 2024

[ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.

Python 92 5 Updated Jun 3, 2024

Clarity: A Minimalist Website Template for AI Research

CSS 110 14 Updated Jan 12, 2025

Pandora: Towards General World Model with Natural Language Actions and Video States

Python 502 35 Updated Sep 23, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 12,748 11,730 Updated Apr 14, 2025

Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

Python 128 6 Updated Oct 24, 2024
Python 59 6 Updated Jun 14, 2024

[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation

Python 311 11 Updated Apr 4, 2025
Next