hmf21

🎯

Focusing

Mengfan He hmf21

🎯

Focusing

38 followers · 199 following

Tsinghua University
Beijing

Achievements

Lists (26)

Sort

Starred repositories

Drexubery / ViewCrafter

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 1,145 43 Updated Nov 6, 2024

zju3dv / MatchAnything

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

740 21 Updated Jan 14, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,437 464 Updated Feb 11, 2025

fferflo / statewide-visual-geolocalization

Statewide Visual Geolocalization in the Wild (ECCV 2024)

Python 61 4 Updated Dec 2, 2024

uzh-rpg / rpg_dvs_ros

ROS packages for DVS

C++ 303 156 Updated May 15, 2024

jessemelpolio / AnytimeCL

[ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification

Python 18 2 Updated Oct 17, 2024

songxf1024 / GIMS

Graph-Based Image Matching System

33 Updated Dec 26, 2024

ai4ce / CityWalker

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

Python 36 2 Updated Jan 10, 2025

SJTU-ViSYS / M2DGR

M2DGR： a Multi-modal and Multi-scenario Dataset for Ground Robots(RA-L2021 & ICRA2022)

940 122 Updated Nov 23, 2024

pablovela5620 / mini-dust3r

Python 203 12 Updated Oct 25, 2024

cvg / GeoCalib

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 529 32 Updated Dec 20, 2024

ruili3 / awesome-dust3r

🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.

478 13 Updated Jan 24, 2025

PKU-VCL-3DV / SLAM3R

Real-time dense scene reconstruction with SLAM3R

Python 408 12 Updated Jan 6, 2025

AggMan96 / Safe-Net

Code for Safe-Net

Python 5 Updated May 27, 2023

yejy53 / EP-BEV

[ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.

Python 61 2 Updated Feb 11, 2025

NVlabs / CF-3DGS

Python 470 54 Updated Aug 22, 2024

lucidrains / LVMAE-pytorch

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

Python 47 1 Updated Nov 25, 2024

OpenGVLab / VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 573 68 Updated Oct 8, 2024

alinlab / s-clip

S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions

Python 47 6 Updated May 26, 2023

naver / mast3r

Grounding Image Matching in 3D with MASt3R

Python 1,683 129 Updated Jan 2, 2025

prs-eth / RollingDepth

Video Depth without Video Models

Python 441 16 Updated Dec 9, 2024

EdoardoBotta / RQ-VAE-Recommender

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 111 12 Updated Feb 11, 2025

Mabel0403 / CAMP

[🎉IEEE TGRS'24] The official code for paper "CAMP: A Cross-View Geo-Localization Method using Contrastive Attributes Mining and Position-aware Partitioning"

Python 9 Updated Nov 1, 2024

UAV-AVL / Benchmark

UAV Visual Localization

Python 15 2 Updated Jan 11, 2025

naver / dust3r

DUSt3R: Geometric 3D Vision Made Easy

Python 5,782 623 Updated Sep 20, 2024

WHU-USI3DV / OSMLoc

[Arxiv'24] OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Seman- tic Guidances

47 Updated Nov 20, 2024

rpl-cmu / bevloc

[IROS 2024] BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis

Python 52 3 Updated Oct 18, 2024

BICLab / Spike-Driven-Transformer-V3

Offical implementation of "Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training" (IEEE T-PAMI2025)

Python 28 3 Updated Feb 10, 2025

GaoShuang98 / CVCities

[IEEE JSTARS 2024] CV-Cities: Advancing Cross-view Geo-localization in Global Cities

Python 24 Updated Jan 9, 2025

facebookresearch / OrienterNet

Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"

Python 475 50 Updated Sep 7, 2024

Mengfan He hmf21

Lists (26)

Cross View Match

Multi Modal Sensors

3D Reconstruction

Image Matching

Foundation Model

Visual Place Recognition

Event Camera

Continue Learning

Navigation

Dataset

Satellite Imagery Comprehension

Neural Map

Generation Model

UAV Localization

SNN

Competition

Lesson

Daily Life

Hardware Setting

Deep Learning Tips

Scene Coordinates Regression

Useful Utility

Uncertanty

VIO

Equivariant Networks

Deep Dense Image Alignment

Starred repositories

visual-place-recognition

Linux