Skip to content
View BinWang28's full-sized avatar

Highlights

  • Pro

Organizations

@USC-MCL @emnlp-2023 @SeaEval @NLGPerson @AudioLLMs

Block or report BinWang28

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open-source RL System from ByteDance Seed and Tsinghua AIR

792 27 Updated Mar 20, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 602 24 Updated Mar 25, 2025

A collection of recent open-source math datasets for training and evaluating Math LLMs

5 Updated Mar 24, 2025

A Survey on Efficient Reasoning for LLMs

133 2 Updated Mar 24, 2025

Fully open data curation for reasoning models

Python 1,579 135 Updated Mar 16, 2025

Fully open reproduction of DeepSeek-R1

Python 23,273 2,116 Updated Mar 24, 2025
Python 561 18 Updated Mar 14, 2025

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,689 193 Updated Jan 16, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,060 976 Updated Mar 24, 2025
Python 5 Updated Dec 16, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 12,330 1,241 Updated Mar 25, 2025

Neural Code Intelligence Survey 2024; Reading lists and resources

257 13 Updated Mar 19, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,523 2,393 Updated Mar 17, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,234 277 Updated Nov 5, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,635 549 Updated Mar 25, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,461 2,747 Updated Mar 25, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 9,118 927 Updated Mar 20, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,863 160 Updated Mar 19, 2025

Latest Advances on System-2 Reasoning

Python 843 33 Updated Mar 25, 2025

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

Python 7,890 1,328 Updated Mar 25, 2025

The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.

Python 201 19 Updated Mar 17, 2025

llama-omni训练代码复现

Python 57 7 Updated Jan 23, 2025

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,864 196 Updated Nov 14, 2024

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 11,092 1,143 Updated Mar 25, 2025

[ArXiv 2024] Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models

Python 4 Updated Dec 17, 2024

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,328 199 Updated Mar 8, 2025

A series of technical report on Slow Thinking with LLM

Python 589 31 Updated Mar 21, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 667 44 Updated Mar 21, 2025

[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Python 31 4 Updated Feb 27, 2025
Next