Stars
PyTorch implementation of Levenberg-Marquardt training algorithm
Official implementation of OneDiffusion paper
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Universal Monocular Metric Depth Estimation
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
From Coarse to Fine: Robust Hierarchical Localization at Large Scale with HF-Net (https://arxiv.org/abs/1812.03506)
A List of Recommender Systems and Resources
A curated list of awesome Recommender System (Books, Conferences, Researchers, Papers, Github Repositories, Useful Sites, Youtube Videos)
aider is AI pair programming in your terminal
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
An easy to use PyTorch to TensorRT converter
Real-time and accurate open-vocabulary end-to-end object detection
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
✨✨Latest Advances on Multimodal Large Language Models
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
GIM: Learning Generalizable Image Matcher From Internet Videos (ICLR 2024 Spotlight)
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
[3DV 2024 Oral] DeDoDe 🎶 Detect, Don't Describe --- Describe, Don't Detect, for Local Feature Matching
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
Code for "Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed", CVPR 2024