Skip to content
View WhiteFu's full-sized avatar

Block or report WhiteFu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

4,442 275 Updated Apr 11, 2025

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

187 4 Updated Apr 17, 2025
Python 380 21 Updated Apr 18, 2025

Witness the aha moment of VLM with less than $3.

Python 3,549 277 Updated Mar 1, 2025

A fork to add multimodal model training to open-r1

Python 1,209 60 Updated Feb 8, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,303 153 Updated Mar 20, 2025

s1: Simple test-time scaling

Python 6,205 726 Updated Apr 4, 2025

System 2 Reasoning Link Collection

826 73 Updated Mar 16, 2025
Python 518 48 Updated Apr 15, 2025
Python 81 4 Updated Nov 22, 2024

Simple RL training for reasoning

Python 3,469 259 Updated Apr 10, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,383 123 Updated Jan 2, 2025

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 900 59 Updated Oct 28, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,698 833 Updated Apr 18, 2025

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

136 5 Updated Apr 18, 2025

This is the repository for the Tool Learning survey.

356 14 Updated Mar 4, 2025

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 613 48 Updated Jan 20, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 741 50 Updated Apr 17, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,315 166 Updated Apr 14, 2025

O1 Replication Journey

1,985 66 Updated Jan 14, 2025

Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥

249 20 Updated Jan 24, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,522 191 Updated Apr 10, 2025

我们致力于量化知识的开源与汉化,打破国内外量化金融行业信息差。

1,286 93 Updated Apr 19, 2025

100+套大数据可视化炫酷大屏Html5模板;包含行业:社区、物业、政务、交通、金融银行等,全网最新、最多,最全、最酷、最炫大数据可视化模板。陆续更新中

JavaScript 3,917 1,148 Updated Jul 24, 2024

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,290 112 Updated Apr 13, 2025

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

418 9 Updated Jan 17, 2025

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 253 29 Updated Mar 12, 2025

微信视频号下载工具,支持视频、直播回放、直播下载

2,535 253 Updated Mar 13, 2025

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

Python 93 16 Updated Dec 20, 2024
Next