Stars
Code for "Zero-shot Improvement of Object Counting with CLIP"
The official implementation of the crowd counting model CLIP-EBC.
[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model
[ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting
This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.
Official implementation of Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting
[AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting
URL phishing detection using Generative Adversarial Network (GAN)
A curated list of neural network pruning resources.
(RA-L & IROS 2020) Cross-view Semantic Segmentation for Sensing Surroundings.
Files for a tutorial to train SegNet for road scenes using the CamVid dataset
[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation
SkyEye: Self-Supervised Bird's-Eye-View Semantic Mapping Using Monocular Frontal View Images
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images. http://panoptic-bev.cs.uni-freiburg.de
Drone-based Joint Density Map Estimation, Localization and Tracking with Space-Time Multi-Scale Attention Network
PSGCNet: A Pyramidal Scale and Global Context Guided Network for Dense Object Counting in Remote-Sensing Images
Counting from Sky: A Large-scale Dataset for Remote Sensing Object Counting and A Benchmark Method
Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training
LaRa: Latents and Rays for Multi-Camera Bird’s-Eye-View Semantic Segmentation
PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
[CVPR2023] Practical Network Acceleration with Tiny Sets
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, UniAD
Repository for the paper I See You: A Vehicle-Pedestrian Interaction Dataset from Traffic Surveillance Cameras, presented at the LXAI workshop at NeurIPS 2022. This dataset captures critical vehicl…