jfsantos

João Felipe Santos jfsantos

270 followers · 120 following

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

declare-lab / TangoFlux

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 539 46 Updated Jan 12, 2025

yuhanghe01 / RiTTA

Event Relation in Text-to-Audio (TTA) Generation

Python 16 Updated Jan 2, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 6,715 408 Updated Jan 9, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,533 925 Updated Jan 14, 2025

naver-ai / usdm

Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)

Python 69 1 Updated Dec 3, 2024

apple / ml-tarflow

Python 91 2 Updated Dec 17, 2024

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,748 71 Updated Jan 2, 2025

FunAudioLLM / InspireMusic

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 296 26 Updated Dec 27, 2024

imxtx / awesome-controllabe-speech-synthesis

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

108 3 Updated Jan 14, 2025

shinjiwlab / versa

Versatile Evaluation of Speech and Audio

Python 145 11 Updated Dec 31, 2024

NVIDIA / NeMo-speech-data-processor

A toolkit for processing speech data and creating speech datasets

Python 103 22 Updated Jan 10, 2025

LTH14 / mage

A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis

Python 546 26 Updated Mar 10, 2023

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,233 66 Updated Sep 27, 2024

pola-rs / polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 31,366 2,039 Updated Jan 14, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,390 227 Updated Jan 14, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,907 1,173 Updated Jan 14, 2025

jacobgil / vit-explain

Explainability for Vision Transformers

Python 886 103 Updated Mar 12, 2022

microsoft / P.808

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Ama…

HTML 212 58 Updated May 23, 2024