Stars
Official implementation of "DepthLab: From Partial to Complete"
This is a list of awesome paper about optical flow and related work.
Real-time dense scene reconstruction with SLAM3R
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
[NeurIPS 2024] AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos
ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild. ECCV 2022.
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
[ICRA'2024] MonoOcc: Digging into Monocular Semantic Occupancy Prediction
[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
A Survey on Vision-Language Geo-Foundation Models (VLGFMs)
[ECCV 2024] DoubleTake: Geometry Guided Depth Estimation
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"
Code to easily try 30 (and growing) different image matching methods
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".
Learning Text-Enhanced Urban Region Profiling with Contrastive Language-Image Pre-Training
Official repository for TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization
「TCSVT」A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization