-
Carnegie Mellon University
- https://www.linkedin.com/in/elvishelvisshi/
Highlights
- Pro
Stars
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
Papers and Datasets about Point Cloud.
Safe Local Motion Planning with Self-Supervised Freespace Forecasting, CVPR 2021
Implementation of the ECCV '22 paper, "Differentiable Raycasting for Self-supervised Occupancy Forecasting"
Paper reading notes on Deep Learning and Machine Learning
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
Official code for VisProg (CVPR 2023 Best Paper!)
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
Robust Speech Recognition via Large-Scale Weak Supervision
Code and results accompanying our paper titled Leveraging Unlabeled Data to Predict Out-of-Distribution Performance at ICLR 2022
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Source code for "Nonlinear 3D Face Morphable Model"
Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
PyTorch code and models for the DINOv2 self-supervised learning method.
[CVPR 2022] Pytorch implementation for “Debiased Learning from Naturally Imbalanced Pseudo-Labels”
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Painter & SegGPT Series: Vision Foundation Models from BAAI
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
Hierarchy-based Image Embeddings for Semantic Image Retrieval
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.