Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 67 2 Updated Nov 8, 2024

Pang-Yatian / Pang-Yatian.github.io

HTML 1 Updated Dec 18, 2024

PKU-YuanGroup / Next-Patch-Prediction

Python 81 2 Updated Dec 23, 2024

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,555 483 Updated May 31, 2024

z-x-yang / Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 2,886 343 Updated Apr 25, 2024

uzh-rpg / rpg_vid2e

Open source implementation of CVPR 2020 "Video to Events: Recycling Video Dataset for Event Cameras"

Python 332 82 Updated Mar 4, 2024

clash-verge-rev / clash-verge-rev

Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)

TypeScript 43,056 3,327 Updated Dec 25, 2024

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,563 60 Updated Dec 23, 2024

TimoStoff / event_utils

Collection of event based vision utility functions

Python 180 31 Updated Jan 29, 2021

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 516 27 Updated Dec 18, 2024

ZcsrenlongZ / Deblur4DGS

[arXiv 2024] Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video

20 1 Updated Dec 10, 2024

KwaiVGI / SynCamMaster

[ARXIV'24] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 298 7 Updated Dec 11, 2024

sony / genwarp

Python 239 20 Updated Sep 26, 2024

ldkong1205 / OpenESS

[CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies

47 1 Updated Apr 27, 2024

ldkong1205 / Calib3D

[WACV 2025] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding

Python 46 3 Updated Mar 26, 2024

dnvtmf / SK_GS

The official implementation of "Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis"

Python 10 Updated Dec 21, 2024

924973292 / DeMo

【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification

Python 17 1 Updated Dec 26, 2024

924973292 / MambaPro

【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

Python 20 Updated Dec 19, 2024

baaivision / See3D

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

Python 563 13 Updated Dec 21, 2024

AsuradaYuci / CLIMB-ReID

CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification（AAAI2025）

12 Updated Dec 10, 2024

Chaoran Feng SuperFCR

Highlights

Organizations

Lists (5)

🔮 Future ideas

🔥MyTools

🍭 Paper Reading

Tools

🧸 Toturial

Stars