Skip to content
View wangxingjun778's full-sized avatar
  • Tongyi Lab, Alibaba Group
  • Hangzhou, China

Organizations

@embodied-agent

Block or report wangxingjun778

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A fork to add multimodal model training to open-r1

Python 920 49 Updated Feb 8, 2025

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Python 638 64 Updated Sep 19, 2024

This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the informati…

275 24 Updated Jan 10, 2022

A lightweight data processing framework built on DuckDB and 3FS.

Python 3,012 252 Updated Mar 3, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 6,651 558 Updated Mar 3, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,391 217 Updated Feb 28, 2025

Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 296 22 Updated Mar 1, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,645 427 Updated Mar 3, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 961 48 Updated Feb 28, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 10,994 746 Updated Mar 1, 2025

Solve Visual Understanding with Reinforced VLMs

Python 3,693 221 Updated Mar 3, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,271 580 Updated Feb 26, 2025

This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]

Python 59 5 Updated Feb 28, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,100 373 Updated Mar 3, 2025
Python 5 1 Updated Feb 18, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,039 311 Updated Mar 2, 2025

LIMO: Less is More for Reasoning

Python 803 35 Updated Feb 24, 2025

s1: Simple test-time scaling

Python 5,803 659 Updated Feb 23, 2025
Jupyter Notebook 400 32 Updated Jul 22, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,872 1,390 Updated Feb 1, 2025

Witness the aha moment of VLM with less than $3.

Python 2,980 238 Updated Mar 1, 2025

DeepSeek Coder: Let the Code Write Itself

Python 20,753 2,312 Updated May 21, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,526 2,169 Updated Feb 1, 2025

Collect every awesome work about r1!

Python 235 6 Updated Mar 2, 2025

Fully open data curation for reasoning models

Python 1,419 121 Updated Feb 23, 2025

Fully open reproduction of DeepSeek-R1

Python 21,927 1,954 Updated Mar 2, 2025

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 128 9 Updated Jul 17, 2024

ModelScope

Python 1 Updated Mar 3, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,803 513 Updated Mar 3, 2025
Next