Lists (13)
Sort Name ascending (A-Z)
Stars
Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
A comprehensive list of papers about Robot Manipulation, including papers, codes, and related websites.
📄 🇨🇳 📃 论文阅读笔记(分布式系统、虚拟化、机器学习)Papers Notebook (Distributed System, Virtualization, Machine Learning)
📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[CVPR 2025 Oral] VGGT: Visual Geometry Grounded Transformer
Open-source and strong foundation image recognition models.
Fully open reproduction of DeepSeek-R1
Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Collect some related resources of NVIDIA Isaac Sim
The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes.
[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
Embodied Reasoning Question Answer (ERQA) Benchmark
[ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs
RLBench_ACT: Running ALoha ACT and Diffusion Policy in the RLBench Framework
Sourcetrail - free and open-source interactive source explorer
📚 A collection of edge/contour/boundary detection papers and toolbox.