Skip to content
View hmf21's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Tsinghua University
  • Beijing

Block or report hmf21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 1,133 42 Updated Nov 6, 2024

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

715 19 Updated Jan 14, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,383 460 Updated Jan 28, 2025

Statewide Visual Geolocalization in the Wild (ECCV 2024)

Python 61 4 Updated Dec 2, 2024

ROS packages for DVS

C++ 302 155 Updated May 15, 2024

[ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification

Python 18 2 Updated Oct 17, 2024

Graph-Based Image Matching System

33 Updated Dec 26, 2024

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

Python 36 2 Updated Jan 10, 2025

M2DGR: a Multi-modal and Multi-scenario Dataset for Ground Robots(RA-L2021 & ICRA2022)

939 122 Updated Nov 23, 2024
Python 201 12 Updated Oct 25, 2024

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 524 32 Updated Dec 20, 2024

🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.

454 13 Updated Jan 24, 2025

Real-time dense scene reconstruction with SLAM3R

Python 403 12 Updated Jan 6, 2025

Code for Safe-Net

Python 5 Updated May 27, 2023

[ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.

Python 59 2 Updated Jan 16, 2025
Python 467 54 Updated Aug 22, 2024

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

Python 47 1 Updated Nov 25, 2024

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 570 68 Updated Oct 8, 2024

S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions

Python 47 6 Updated May 26, 2023

Grounding Image Matching in 3D with MASt3R

Python 1,658 126 Updated Jan 2, 2025

Video Depth without Video Models

Python 437 16 Updated Dec 9, 2024

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 106 12 Updated Feb 5, 2025

[🎉IEEE TGRS'24] The official code for paper "CAMP: A Cross-View Geo-Localization Method using Contrastive Attributes Mining and Position-aware Partitioning"

Python 9 Updated Nov 1, 2024

UAV Visual Localization

Python 14 2 Updated Jan 11, 2025

DUSt3R: Geometric 3D Vision Made Easy

Python 5,745 622 Updated Sep 20, 2024

[Arxiv'24] OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Seman- tic Guidances

47 Updated Nov 20, 2024

[IROS 2024] BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis

Python 52 3 Updated Oct 18, 2024

Offical implementation of "Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training" (IEEE T-PAMI2025)

Python 26 3 Updated Jan 6, 2025

[IEEE JSTARS 2024] CV-Cities: Advancing Cross-view Geo-localization in Global Cities

Python 24 Updated Jan 9, 2025

Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"

Python 475 50 Updated Sep 7, 2024
Next