Highlights
- Pro
Stars
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Fast and memory-efficient exact attention
We propose a model to analyze sentiment of online stock forum and use the information to predict stock volatility in the Chinese market. By generating a sentimental dictionary, we analyze the senti…
Recently, realistic image generation using deep neural networks has become a hot topic in machine learning and computer vision. Such an image can be generated at pixel level by learning from a larg…
Enforcing temporal consistency in real-time per-frame semantic video segmentation
The official code for the paper 'Structured Knowledge Distillation for Semantic Segmentation'. (CVPR 2019 ORAL) and extension to other tasks.
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
Object Detection Metrics. 14 object detection metrics: mean Average Precision (mAP), Average Recall (AR), Spatio-Temporal Tube Average Precision (STT-AP). This project supports different bounding b…
Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, oral).
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
ncnn is a high-performance neural network inference framework optimized for the mobile platform
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
This repo provides the source code for "Cross-Domain Adaptive Teacher for Object Detection".
PyTorch code for CVPR 2022 paper Unbiased Teacher v2 Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
aim-uofa / model-quantization
Forked from ziplab/QToolCollections of model quantization algorithms. Any issues, please contact Peng Chen ([email protected])
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
The PyTorch Implementation of F-ConvNet for 3D Object Detection