Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,825 2,306 Updated Mar 13, 2025

llvm / torch-mlir

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,505 543 Updated Apr 23, 2025

shiguredo / sora-unity-sdk

WebRTC SFU Sora Unity SDK

C++ 76 13 Updated Apr 16, 2025

astral-sh / rye

a Hassle-Free Python Experience

Rust 14,150 474 Updated Apr 23, 2025

xiph / opus

Modern audio compression for the internet.

C 2,544 663 Updated Apr 22, 2025

PlayVoice / lora-svc

singing voice change based on whisper, and lora for singing voice clone

Python 637 77 Updated Nov 3, 2023

WangHelin1997 / MaskSpec

The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training

Python 42 8 Updated Dec 17, 2024

MWM-io / SpecTNT-pytorch

Unofficial implementation of SpecTNT in pytorch

Python 45 4 Updated Oct 14, 2022

rkmt / summarize_arxv

Python 174 20 Updated May 22, 2023

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,616 807 Updated Jul 31, 2024

Vaibhavs10 / fast-whisper-finetuning

Jupyter Notebook 510 42 Updated Jul 10, 2024

solidiquis / erdtree

A modern, cross-platform, multi-threaded, and general purpose filesystem and disk-usage utility that is aware of .gitignore and hidden file rules.

Rust 2,461 66 Updated May 19, 2024

AndreyGuzhov / AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 812 98 Updated Sep 30, 2021

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,546 4,445 Updated Aug 19, 2024

google / clasp

🔗 Command Line Apps Script Projects

TypeScript 4,889 444 Updated Mar 26, 2025

archinetai / audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1,899 71 Updated Jan 4, 2024

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 28,887 4,065 Updated Nov 24, 2024

kamepong / ConvS2S-VC

Python 29 7 Updated Dec 14, 2021

ggml-org / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 39,465 4,144 Updated Apr 23, 2025

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 2,036 173 Updated Jun 12, 2023

nnsvs / nnsvs

Neural network-based singing voice synthesis library for research

Python 715 83 Updated Oct 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ryosuke sonobe oryosu

Achievements

Achievements

Block or report oryosu

Stars

menloresearch / ichigo

ufal / whisper_streaming

yxlllc / DDSP-SVC

bshall / knn-vc

mmorise / World

google / oboe

RustAudio / cpal

gemelo-ai / vocos

sarulab-speech / UTMOS22

facebookresearch / audiocraft