Skip to content
View otey247's full-sized avatar

Block or report otey247

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Audio AI

85 repositories

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,496 4,293 Updated Aug 19, 2024

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook 730 70 Updated Sep 25, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,216 2,185 Updated Nov 11, 2024

Suno AI's Bark model in C/C++ for fast text-to-speech generation

C++ 751 62 Updated Nov 16, 2024

🔊 Text-Prompted Generative Audio Model with Gradio

Python 682 64 Updated Nov 23, 2023

🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model

Jupyter Notebook 999 91 Updated Oct 21, 2023

Audio waveform player

TypeScript 8,912 1,643 Updated Dec 25, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,340 8,758 Updated Dec 1, 2024

a list of demo websites for automatic music generation research

645 43 Updated Dec 25, 2024

A timeline of the latest AI models for audio generation, starting in 2023!

1,896 70 Updated Jan 4, 2024

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)

TypeScript 1,907 207 Updated Dec 17, 2024

Site for sharing Bark voices

TypeScript 49 Updated Jul 2, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,550 310 Updated Jan 4, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,070 868 Updated Jul 6, 2024

Real-time transcription of audio, integrated with ChatGPT for interactive use. Save, load, and append transcripts for effective context management in conversations.

Python 439 41 Updated Jun 6, 2023

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,564 799 Updated Dec 13, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,346 4,466 Updated Aug 16, 2024

A PyTorch-based Speech Toolkit

Python 9,106 1,413 Updated Dec 20, 2024

Record voice notes & transcribe, summarize, and get tasks

TypeScript 1,796 291 Updated Jul 3, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,135 163 Updated Dec 6, 2024

🚀 BARK INFINITY 🎶 Power Up The Bark Text-prompted Generative Audio Model

Python 14 2 Updated May 1, 2023

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,136 2,253 Updated Jun 26, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,779 758 Updated Jun 24, 2024

Instant voice cloning by MIT and MyShell.

Python 30,191 2,986 Updated Dec 24, 2024

A nearly-live implementation of OpenAI's Whisper.

Python 2,234 297 Updated Dec 18, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 37,337 4,241 Updated Dec 19, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 25,453 3,708 Updated Nov 24, 2024

Speak (speech-to-text) to Ollama LLMs in any lanaguage - Streamlit app

Python 39 8 Updated Feb 27, 2024

Voice Emotion Detector that detects emotion from audio speech using one dimensional CNNs (convolutional neural networks) using keras and tensorflow on Jupyter Notebook.

Jupyter Notebook 104 35 Updated Mar 21, 2018

Fine-tune ChatGPT to write lyrics of your favorite artist

Python 8 2 Updated Aug 25, 2023