WhiteFu

WhiteFu

speech synthesis & voice conversion & speech enhancement

44 followers · 442 following

Lists (1)

Sort

🔮 Future ideas

Starred repositories

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

4,442 275 Updated Apr 11, 2025

Hongcheng-Gao / Awesome-Long2short-on-LRMs

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

187 4 Updated Apr 17, 2025

0russwest0 / Agent-R1

Python 380 21 Updated Apr 18, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,549 277 Updated Mar 1, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,209 60 Updated Feb 8, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,303 153 Updated Mar 20, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,205 726 Updated Apr 4, 2025

Dereck0602 / Awesome_Test_Time_LLMs

92 5 Updated Mar 12, 2025

open-thought / system-2-research

System 2 Reasoning Link Collection

826 73 Updated Mar 16, 2025

microsoft / rStar

Python 518 48 Updated Apr 15, 2025

tencent-ailab / MuCodec

Python 81 4 Updated Nov 22, 2024

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,469 259 Updated Apr 10, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,383 123 Updated Jan 2, 2025

facebookresearch / spiritlm

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 900 59 Updated Oct 28, 2024

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,698 833 Updated Apr 18, 2025

imxtx / awesome-controllable-speech-synthesis

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

136 5 Updated Apr 18, 2025

quchangle1 / LLM-Tool-Survey

This is the repository for the Tool Learning survey.

356 14 Updated Mar 4, 2025

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 613 48 Updated Jan 20, 2025

DAMO-NLP-SG / VideoLLaMA3

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 741 50 Updated Apr 17, 2025

hkchengrex / MMAudio

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,315 166 Updated Apr 14, 2025

GAIR-NLP / O1-Journey

O1 Replication Journey

1,985 66 Updated Jan 14, 2025

pengr / LLM-Synthetic-Data

Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥

249 20 Updated Jan 24, 2025

MiniMax-AI / MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,522 191 Updated Apr 10, 2025

LLMQuant / quant-wiki

我们致力于量化知识的开源与汉化，打破国内外量化金融行业信息差。

1,286 93 Updated Apr 19, 2025

iGaoWei / BigDataView

100+套大数据可视化炫酷大屏Html5模板；包含行业：社区、物业、政务、交通、金融银行等，全网最新、最多，最全、最酷、最炫大数据可视化模板。陆续更新中

JavaScript 3,917 1,148 Updated Jul 24, 2024

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,290 112 Updated Apr 13, 2025

LMM101 / Awesome-Multimodal-Next-Token-Prediction

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

418 9 Updated Jan 17, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 253 29 Updated Mar 12, 2025

qiye45 / wechatVideoDownload

微信视频号下载工具，支持视频、直播回放、直播下载

2,535 253 Updated Mar 13, 2025

ScottishFold007 / TTSAudioNormalizer

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

Python 93 16 Updated Dec 20, 2024

WhiteFu

Lists (1)

🔮 Future ideas

Starred repositories

audio-codec

forced-alignment