Stars
[ECCV2024] Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance
The official Implementation of "VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection" [CVPR 2024]
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.
Calculates the extrinsic calibration between a Navtech radar and a 3D (Velodyne) lidar
[AAAI 2025] RCTrans: Radar-Camera Transformer via Radar Densiffer and Sequential Decoder for 3D Object Detection
About [CVPR 2024] The official implementation of paper " Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving"
[NeurIPS 2024] RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar
Implementation of "TrackFormer: Multi-Object Tracking with Transformers”. [Conference on Computer Vision and Pattern Recognition (CVPR), 2022]
[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method …
[SCIS] SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model
HE-Drive: Human-Like End-to-End Driving with Vision Language Models
V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion
Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.
[ECAI 2024] TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement
[ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
[IEEE T-IV] This is the official implementation of Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection
CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.