Skip to content
View hmf21's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Tsinghua University
  • Beijing

Block or report hmf21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 1,145 43 Updated Nov 6, 2024

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

740 21 Updated Jan 14, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,437 464 Updated Feb 11, 2025

Statewide Visual Geolocalization in the Wild (ECCV 2024)

Python 61 4 Updated Dec 2, 2024

ROS packages for DVS

C++ 303 156 Updated May 15, 2024

[ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification

Python 18 2 Updated Oct 17, 2024

Graph-Based Image Matching System

33 Updated Dec 26, 2024

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

Python 36 2 Updated Jan 10, 2025

M2DGR: a Multi-modal and Multi-scenario Dataset for Ground Robots(RA-L2021 & ICRA2022)

940 122 Updated Nov 23, 2024
Python 203 12 Updated Oct 25, 2024

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 529 32 Updated Dec 20, 2024

🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.

478 13 Updated Jan 24, 2025

Real-time dense scene reconstruction with SLAM3R

Python 408 12 Updated Jan 6, 2025

Code for Safe-Net

Python 5 Updated May 27, 2023

[ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.

Python 61 2 Updated Feb 11, 2025
Python 470 54 Updated Aug 22, 2024

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

Python 47 1 Updated Nov 25, 2024

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 573 68 Updated Oct 8, 2024

S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions

Python 47 6 Updated May 26, 2023

Grounding Image Matching in 3D with MASt3R

Python 1,683 129 Updated Jan 2, 2025

Video Depth without Video Models

Python 441 16 Updated Dec 9, 2024

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 111 12 Updated Feb 11, 2025

[🎉IEEE TGRS'24] The official code for paper "CAMP: A Cross-View Geo-Localization Method using Contrastive Attributes Mining and Position-aware Partitioning"

Python 9 Updated Nov 1, 2024

UAV Visual Localization

Python 15 2 Updated Jan 11, 2025

DUSt3R: Geometric 3D Vision Made Easy

Python 5,782 623 Updated Sep 20, 2024

[Arxiv'24] OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Seman- tic Guidances

47 Updated Nov 20, 2024

[IROS 2024] BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis

Python 52 3 Updated Oct 18, 2024

Offical implementation of "Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training" (IEEE T-PAMI2025)

Python 28 3 Updated Feb 10, 2025

[IEEE JSTARS 2024] CV-Cities: Advancing Cross-view Geo-localization in Global Cities

Python 24 Updated Jan 9, 2025

Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"

Python 475 50 Updated Sep 7, 2024
Next