hmf21

🎯

Focusing

Mengfan He hmf21

🎯

Focusing

37 followers · 198 following

Tsinghua University
Beijing

Achievements

Lists (26)

Sort

Starred repositories

Drexubery / ViewCrafter

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 1,133 42 Updated Nov 6, 2024

zju3dv / MatchAnything

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

715 19 Updated Jan 14, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,383 460 Updated Jan 28, 2025

fferflo / statewide-visual-geolocalization

Statewide Visual Geolocalization in the Wild (ECCV 2024)

Python 61 4 Updated Dec 2, 2024

uzh-rpg / rpg_dvs_ros

ROS packages for DVS

C++ 302 155 Updated May 15, 2024

jessemelpolio / AnytimeCL

[ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification

Python 18 2 Updated Oct 17, 2024

songxf1024 / GIMS

Graph-Based Image Matching System

33 Updated Dec 26, 2024

ai4ce / CityWalker

CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

Python 36 2 Updated Jan 10, 2025

SJTU-ViSYS / M2DGR

M2DGR： a Multi-modal and Multi-scenario Dataset for Ground Robots(RA-L2021 & ICRA2022)

939 122 Updated Nov 23, 2024

pablovela5620 / mini-dust3r

Python 201 12 Updated Oct 25, 2024

cvg / GeoCalib

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 524 32 Updated Dec 20, 2024

ruili3 / awesome-dust3r

🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.

454 13 Updated Jan 24, 2025

PKU-VCL-3DV / SLAM3R

Real-time dense scene reconstruction with SLAM3R

Python 403 12 Updated Jan 6, 2025

AggMan96 / Safe-Net

Code for Safe-Net

Python 5 Updated May 27, 2023

yejy53 / EP-BEV

[ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.

Python 59 2 Updated Jan 16, 2025

NVlabs / CF-3DGS

Python 467 54 Updated Aug 22, 2024

lucidrains / LVMAE-pytorch

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

Python 47 1 Updated Nov 25, 2024

OpenGVLab / VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 570 68 Updated Oct 8, 2024

alinlab / s-clip

S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions

Python 47 6 Updated May 26, 2023

naver / mast3r

Grounding Image Matching in 3D with MASt3R

Python 1,658 126 Updated Jan 2, 2025

prs-eth / RollingDepth

Video Depth without Video Models

Python 437 16 Updated Dec 9, 2024

EdoardoBotta / RQ-VAE-Recommender

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 106 12 Updated Feb 5, 2025

Mabel0403 / CAMP

[🎉IEEE TGRS'24] The official code for paper "CAMP: A Cross-View Geo-Localization Method using Contrastive Attributes Mining and Position-aware Partitioning"

Python 9 Updated Nov 1, 2024

UAV-AVL / Benchmark

UAV Visual Localization

Python 14 2 Updated Jan 11, 2025

naver / dust3r

DUSt3R: Geometric 3D Vision Made Easy

Python 5,745 622 Updated Sep 20, 2024

WHU-USI3DV / OSMLoc

[Arxiv'24] OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Seman- tic Guidances

47 Updated Nov 20, 2024

rpl-cmu / bevloc

[IROS 2024] BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis

Python 52 3 Updated Oct 18, 2024

BICLab / Spike-Driven-Transformer-V3

Offical implementation of "Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training" (IEEE T-PAMI2025)

Python 26 3 Updated Jan 6, 2025

GaoShuang98 / CVCities

[IEEE JSTARS 2024] CV-Cities: Advancing Cross-view Geo-localization in Global Cities

Python 24 Updated Jan 9, 2025

facebookresearch / OrienterNet

Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"

Python 475 50 Updated Sep 7, 2024

Mengfan He hmf21

Lists (26)

3D Reconstruction

Competition

Continue Learning

Cross View Match

Daily Life

Dataset

Deep Dense Image Alignment

Deep Learning Tips

Equivariant Networks

Event Camera

Foundation Model

Generation Model

Hardware Setting

Image Matching

Lesson

Multi Modal Sensors

Navigation

Neural Map

Satellite Imagery Comprehension

Scene Coordinates Regression

SNN

UAV Localization

Uncertanty

Useful Utility

VIO

Visual Place Recognition

Starred repositories

visual-place-recognition

Linux