Skip to content
View Qidian213's full-sized avatar
  • Beijing University of Posts and Telecommunications
  • Beijing, China

Block or report Qidian213

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

具身智能入门指南 Embodied-AI-Guide

1,815 91 Updated Feb 10, 2025

🧑‍🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

3,269 371 Updated Feb 10, 2025

[ICCV'23] Hidden Biases of End-to-End Driving Models

Python 308 25 Updated Feb 10, 2025

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

Python 1,067 70 Updated Feb 5, 2025

[NeurIPS 2024] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking

Python 362 26 Updated Jan 17, 2025

[ICRA 2025] Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving (expert-level performance on Waymax)

Python 36 3 Updated Jan 28, 2025

Fully open reproduction of DeepSeek-R1

Python 18,654 1,567 Updated Feb 11, 2025

VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning

Python 24 2 Updated Dec 16, 2024

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

Python 591 26 Updated Dec 16, 2023

🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM!🌏 Train a 27M-parameter VLM from scratch in just 3 hours!

Python 945 95 Updated Feb 10, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 8,473 858 Updated Feb 10, 2025

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Python 72 4 Updated Dec 10, 2024

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 276 7 Updated Jul 9, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,986 498 Updated Feb 10, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,712 2,592 Updated Feb 6, 2025

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

340 9 Updated Jan 17, 2025

[NeurIPS 2024] Behavioral Topology (BeTop), a multi-agent behavior formulation for interactive motion prediction and planning

Python 96 5 Updated Nov 12, 2024

Code for "Heterogeneous Graph Transformer" (WWW'20), which is based on Deep Graph Library (DGL)

Python 72 13 Updated Aug 17, 2022

Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

Python 57 2 Updated Jan 23, 2025

Official Github Repo for GEM

10 Updated Dec 10, 2024

VQ-Map[NeurIPS 2024]

Python 24 Updated Jan 6, 2025

A tiny deep learning training framework implemented from scratch in C++ that follows PyTorch's API.

C++ 28 6 Updated Jan 17, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,429 464 Updated Feb 11, 2025

[CVPR 2024] A world model for autonomous driving.

Python 334 9 Updated Dec 7, 2023

A library for advanced large language model reasoning

Python 1,820 160 Updated Feb 6, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,709 467 Updated Sep 25, 2024

The official Meta Llama 3 GitHub site

Python 28,256 3,267 Updated Jan 26, 2025

Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"

Python 151 13 Updated Jan 15, 2025
Next