-
Qualcomm AI Research
- rajeevyasarla.github.io/rajeev.github.io/
Stars
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers
Latent Point Diffusion Models for 3D Shape Generation
This is an official implementation of our CVPR 2023 paper "Revealing the Dark Secrets of Masked Image Modeling" on Depth Estimation.
[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.
Monocular Depth Estimation Toolbox based on MMSegmentation.
[ECCV 2022] SimpleRecon: 3D Reconstruction Without 3D Convolutions
A curated list of publication for depth estimation
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
The official code for our ECCV22 oral paper: tracking objects as pixel-wise distributions.
End-to-End Object Detection with Transformers
Image augmentation for machine learning experiments.
[IGARSS'22]: A Transformer-Based Siamese Network for Change Detection
[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
Bringing Old Photo Back to Life (CVPR 2020 oral)
Densely Connected Pyramid Dehazing Network (CVPR'2018)