Lists (2)
Sort Name ascending (A-Z)
Stars
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Aether: Geometric-Aware Unified World Modeling
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)
JAX - A curated list of resources https://github.com/google/jax
[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacch…
[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation
Collect and summarize point cloud sota methods.
Hackable and optimized Transformers building blocks, supporting a composable construction.
The earliest versions of the very first c compiler known to exist in the wild written by the late legend himself dmr.
[CVPR 25 (Highlight)] RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Solve Visual Understanding with Reinforced VLMs
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
Dot access dictionary with dynamic hierarchy creation and ordered iteration
Fully open reproduction of DeepSeek-R1