Skip to content
View oryosu's full-sized avatar

Block or report oryosu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Local realtime voice AI

Python 2,280 129 Updated Mar 3, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,767 336 Updated Jan 7, 2025

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 2,125 257 Updated Apr 12, 2025

Voice Conversion With Just Nearest Neighbors

Python 484 67 Updated Mar 18, 2024

A high-quality speech analysis, manipulation and synthesis system

C++ 1,225 257 Updated Feb 21, 2025

Oboe is a C++ library that makes it easy to build high-performance audio apps on Android.

C++ 3,819 585 Updated Apr 17, 2025

Cross-platform audio I/O library in pure Rust

Rust 3,020 413 Updated Apr 17, 2025

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 918 108 Updated Aug 7, 2024

UT-Sarulab MOS prediction system using SSL models

Python 225 14 Updated Apr 11, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,825 2,306 Updated Mar 13, 2025

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,505 543 Updated Apr 23, 2025

WebRTC SFU Sora Unity SDK

C++ 76 13 Updated Apr 16, 2025

a Hassle-Free Python Experience

Rust 14,150 474 Updated Apr 23, 2025

Modern audio compression for the internet.

C 2,544 663 Updated Apr 22, 2025

singing voice change based on whisper, and lora for singing voice clone

Python 637 77 Updated Nov 3, 2023

The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training

Python 42 8 Updated Dec 17, 2024

Unofficial implementation of SpecTNT in pytorch

Python 45 4 Updated Oct 14, 2022
Python 174 20 Updated May 22, 2023

ImageBind One Embedding Space to Bind Them All

Python 8,616 807 Updated Jul 31, 2024
Jupyter Notebook 510 42 Updated Jul 10, 2024

A modern, cross-platform, multi-threaded, and general purpose filesystem and disk-usage utility that is aware of .gitignore and hidden file rules.

Rust 2,461 66 Updated May 19, 2024

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 812 98 Updated Sep 30, 2021

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,546 4,445 Updated Aug 19, 2024

🔗 Command Line Apps Script Projects

TypeScript 4,889 444 Updated Mar 26, 2025

A timeline of the latest AI models for audio generation, starting in 2023!

1,899 71 Updated Jan 4, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 28,887 4,065 Updated Nov 24, 2024
Python 29 7 Updated Dec 14, 2021

Port of OpenAI's Whisper model in C/C++

C++ 39,465 4,144 Updated Apr 23, 2025

Audio generation using diffusion models, in PyTorch.

Python 2,036 173 Updated Jun 12, 2023

Neural network-based singing voice synthesis library for research

Python 715 83 Updated Oct 9, 2023
Next