Skip to content
View Depth2World's full-sized avatar
  • University of Science and Technology of China

Block or report Depth2World

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,779 205 Updated Feb 19, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,053 485 Updated Nov 5, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,010 2,106 Updated Feb 1, 2025

CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving

Python 53 1 Updated Oct 30, 2024
69 2 Updated Sep 14, 2024

DriveBench: A Comprehensive Benchmark for Evaluating Large Vision-Language Models on Autonomous Driving

2 Updated Nov 26, 2024

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

Python 591 26 Updated Dec 16, 2023

Waymo Open Dataset

Python 2,827 627 Updated Dec 2, 2024

OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.

Python 492 65 Updated Feb 14, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,909 5,022 Updated Feb 18, 2025

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,622 484 Updated Feb 19, 2025

[ECCV 2024] Embodied Understanding of Driving Scenarios

Python 175 12 Updated Jan 2, 2025
Python 94 2 Updated Dec 22, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,816 4,244 Updated Feb 19, 2025
Python 87 7 Updated Feb 19, 2025

A recipe for online RLHF and online iterative DPO.

Python 482 47 Updated Dec 28, 2024

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Python 131 4 Updated Dec 17, 2024

LLM inference in C/C++

C++ 74,671 10,795 Updated Feb 18, 2025

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 291 12 Updated Dec 7, 2024

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Python 51 Updated Jan 16, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,639 128 Updated Jan 17, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,378 199 Updated Aug 11, 2024

Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Python 300 15 Updated Dec 26, 2024

Composable building blocks to build Llama Apps

Python 7,253 872 Updated Feb 19, 2025

O1 Replication Journey

1,949 62 Updated Jan 14, 2025

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 347 13 Updated Jan 19, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,087 488 Updated Jan 16, 2025
8 1 Updated Sep 24, 2024
Next