Skip to content
View Yewandou7's full-sized avatar

Highlights

  • Pro

Block or report Yewandou7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

450 18 Updated Mar 14, 2025

A web client for ScreenAgent: Let Large Models Control Your Desktop

Vue 38 5 Updated Aug 16, 2024

ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)

Python 421 43 Updated Nov 25, 2024

No fortress, purely open ground. OpenManus is Coming.

Python 40,963 6,892 Updated Mar 30, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,216 247 Updated Mar 30, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,393 812 Updated Mar 1, 2025

PyTorch implementation of Pointnet2/Pointnet++

Python 1,607 359 Updated Mar 21, 2025

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 5,077 455 Updated Jan 22, 2025

-游戏文字交流AI嘴强王者工具

JavaScript 1,385 86 Updated Feb 3, 2025

[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving

Python 56 1 Updated Sep 4, 2024

3D Occupancy Prediction Benchmark in Autonomous Driving

Python 349 21 Updated May 27, 2024

Spatial Sparse Convolution Library

Python 2,001 374 Updated Dec 15, 2024

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

Python 330 33 Updated Feb 3, 2025

Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)

Python 1,148 228 Updated Oct 15, 2024

CVPR 2023: Official code for `Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting'

Python 226 23 Updated Apr 11, 2024

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Python 2,528 457 Updated Jul 31, 2024

script for downloading nuscenes

Python 84 13 Updated Nov 4, 2024

An open-source overseas graduate application information-sharing platform for ShanghaiTech University

CSS 88 2 Updated Mar 12, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,080 246 Updated Mar 30, 2025

Collect some World Models for Autonomous Driving (and Robotic) papers.

869 30 Updated Mar 28, 2025

Prioritize Alignment in Dataset Distillation

Python 20 2 Updated Dec 3, 2024

macOS system monitor in your menu bar

Swift 30,266 955 Updated Mar 30, 2025

[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"

Python 192 18 Updated Dec 29, 2024

⏰ AI conference deadline countdowns

JavaScript 5,797 1,006 Updated Sep 15, 2024

A list of papers and datasets about point cloud analysis (processing) since 2017. Update every day!

1,551 191 Updated Apr 10, 2024

awesome-autonomous-driving

885 87 Updated Aug 19, 2024

[Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective

351 27 Updated Mar 28, 2025

An easy calibration toolbox for VECtor Benchmark

C++ 27 3 Updated Jan 17, 2024
Next