Skip to content
View Friedrich-M's full-sized avatar
:electron:
Focusing
:electron:
Focusing

Organizations

@ZJUEAI

Block or report Friedrich-M

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Python 669 47 Updated Mar 12, 2025

Official implementation of TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 309 6 Updated Mar 11, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 30,148 4,627 Updated Mar 12, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 993 73 Updated Mar 11, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,487 1,661 Updated Feb 26, 2025

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 1,677 102 Updated Mar 12, 2025

3D Gaussian Splatting (3DGS) extension for Omniverse

Python 10 Updated Mar 11, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 8,067 837 Updated Mar 7, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,263 788 Updated Mar 1, 2025

[CVPR 2024] MemFlow: Optical Flow Estimation and Prediction with Memory

Python 151 11 Updated Jan 11, 2025

[CVPR 2025] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 720 47 Updated Mar 11, 2025

Official implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

Python 282 31 Updated Sep 20, 2023

An ML research template with good documentation by Boyuan Chen, an MIT PhD student

Python 62 4 Updated Mar 4, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,018 248 Updated Mar 9, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,642 94 Updated Mar 7, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 6,154 423 Updated Mar 12, 2025
Python 471 29 Updated Mar 6, 2025

https://huyenchip.com/ml-interviews-book/

HTML 3,639 556 Updated Jun 12, 2024

[CVPR 2025] Official repository for “MagicArticulate: Make Your 3D Models Articulation-Ready”

Python 221 1 Updated Feb 27, 2025

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,680 275 Updated Feb 19, 2025

Stereo4D data processing pipeline

Jupyter Notebook 63 1 Updated Mar 11, 2025

[CVPR 2024] Memory-based Adapters for Online 3D Scene Perception

Python 111 4 Updated Sep 22, 2024

News: the 10k dataset is ready for download.

HTML 396 6 Updated Jan 19, 2025

Fillerbuster: Multi-View Scene Completion for Casual Captures

Jupyter Notebook 92 4 Updated Feb 13, 2025

Seeing World Dynamics in a Nutshell

94 1 Updated Feb 6, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,999 242 Updated Mar 7, 2025

DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)

Python 71 1 Updated Mar 4, 2025
Next