-
National University of Singapore
- Singapore
- https://yl3800.github.io
Stars
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
Papers and Datasets about Point Cloud.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
[ACL’24 Findings] Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
Refine high-quality datasets and visual AI models
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
A collection of papers on diffusion models for 3D generation.
✨✨Latest Advances on Multimodal Large Language Models
Famous Vision Language Models and Their Architectures
A curated list of awesome 3d generation papers
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
Awesome-LLM: a curated list of Large Language Model
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"
[ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Open3D: A Modern Library for 3D Data Processing
Code and documentation to train Stanford's Alpaca models, and generate the data.
[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models
Video Graph Transformer for Video Question Answering (ECCV'22)
ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议
AI education materials for Chinese students, teachers and IT professionals.