Skip to content
View Depth2World's full-sized avatar
  • University of Science and Technology of China

Block or report Depth2World

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,836 211 Updated Feb 19, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,062 485 Updated Nov 5, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,081 2,114 Updated Feb 1, 2025

CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving

Python 53 1 Updated Oct 30, 2024
69 2 Updated Sep 14, 2024

DriveBench: A Comprehensive Benchmark for Evaluating Large Vision-Language Models on Autonomous Driving

2 Updated Nov 26, 2024

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

Python 591 26 Updated Dec 16, 2023

Waymo Open Dataset

Python 2,828 627 Updated Dec 2, 2024

OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.

Python 495 67 Updated Feb 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 41,176 5,055 Updated Feb 20, 2025

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,682 486 Updated Feb 20, 2025

[ECCV 2024] Embodied Understanding of Driving Scenarios

Python 175 12 Updated Jan 2, 2025
Python 94 2 Updated Dec 22, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,874 4,249 Updated Feb 21, 2025
Python 87 7 Updated Feb 20, 2025

A recipe for online RLHF and online iterative DPO.

Python 484 47 Updated Dec 28, 2024

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Python 131 4 Updated Dec 17, 2024

LLM inference in C/C++

C++ 74,855 10,818 Updated Feb 20, 2025

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 294 12 Updated Dec 7, 2024

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Python 51 Updated Jan 16, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,660 128 Updated Jan 17, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,386 199 Updated Aug 11, 2024

Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Python 301 15 Updated Dec 26, 2024

Composable building blocks to build Llama Apps

Python 7,287 876 Updated Feb 21, 2025

O1 Replication Journey

1,950 62 Updated Jan 14, 2025

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 348 13 Updated Jan 19, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,090 488 Updated Jan 16, 2025
8 1 Updated Sep 24, 2024
Next