Skip to content
View yl3800's full-sized avatar

Block or report yl3800

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.

Jupyter Notebook 86 Updated Jan 10, 2025

Papers and Datasets about Point Cloud.

Python 2,576 306 Updated Aug 30, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,235 343 Updated Jan 14, 2025

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

Python 582 26 Updated Dec 16, 2023

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Python 914 50 Updated Dec 2, 2024

[ACL’24 Findings] Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

35 Updated Aug 26, 2024

Refine high-quality datasets and visual AI models

Python 9,079 591 Updated Jan 16, 2025

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

14,074 1,414 Updated Feb 13, 2023

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 6,633 396 Updated Jan 14, 2025

A curated list of awesome AIGC 3D papers

608 22 Updated Dec 18, 2024

A collection of papers on diffusion models for 3D generation.

941 39 Updated Dec 16, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,545 861 Updated Jan 13, 2025

Famous Vision Language Models and Their Architectures

Markdown 565 31 Updated Sep 8, 2024

A curated list of awesome 3d generation papers

1,111 54 Updated Mar 9, 2023

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,820 89 Updated Jan 15, 2025

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 612 33 Updated Oct 22, 2024

Awesome-LLM: a curated list of Large Language Model

20,625 1,686 Updated Jan 13, 2025

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

1,385 85 Updated Dec 16, 2024

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 8,651 357 Updated Jan 11, 2025

This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"

Python 90 9 Updated Mar 14, 2023

Point Transformers

Python 662 102 Updated Mar 13, 2024

[ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization

Python 54 2 Updated Nov 10, 2023

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,583 485 Updated May 31, 2024

Open3D: A Modern Library for 3D Data Processing

C++ 11,791 2,358 Updated Jan 13, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,738 4,056 Updated Jul 17, 2024

[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models

284 17 Updated Jan 16, 2025

Video Graph Transformer for Video Question Answering (ECCV'22)

Python 46 12 Updated Jun 8, 2023

CVPR 2024 论文和开源项目合集

18,720 2,621 Updated Jul 4, 2024

ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议

Python 1,319 116 Updated Nov 22, 2024

AI education materials for Chinese students, teachers and IT professionals.

HTML 13,710 2,939 Updated May 16, 2024
Next