otey247

otey247

17 followers · 12 following

Achievements

Stars

Audio AI

85 repositories

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,496 4,293 Updated Aug 19, 2024

teticio / audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook 730 70 Updated Sep 25, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,216 2,185 Updated Nov 11, 2024

PABannier / bark.cpp

Suno AI's Bark model in C/C++ for fast text-to-speech generation

C++ 751 62 Updated Nov 16, 2024

C0untFloyd / bark-gui

🔊 Text-Prompted Generative Audio Model with Gradio

Python 682 64 Updated Nov 23, 2023

JonathanFly / bark

🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model

Jupyter Notebook 999 91 Updated Oct 21, 2023

katspaugh / wavesurfer.js

Audio waveform player

TypeScript 8,912 1,643 Updated Dec 25, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,340 8,758 Updated Dec 1, 2024

affige / genmusic_demo_list

a list of demo websites for automatic music generation research

645 43 Updated Dec 25, 2024

archinetai / audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1,896 70 Updated Jan 4, 2024

rsxdalv / tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)

TypeScript 1,907 207 Updated Dec 17, 2024

rsxdalv / bark-speaker-directory

Site for sharing Bark voices

TypeScript 49 Updated Jul 2, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,550 310 Updated Jan 4, 2024

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,070 868 Updated Jul 6, 2024

e-johnstonn / wingmanAI

Real-time transcription of audio, integrated with ChatGPT for interactive use. Save, load, and append transcripts for effective context management in conversations.

Python 439 41 Updated Jun 6, 2023

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,564 799 Updated Dec 13, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,346 4,466 Updated Aug 16, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,106 1,413 Updated Dec 20, 2024

Nutlope / notesGPT

Record voice notes & transcribe, summarize, and get tasks

TypeScript 1,796 291 Updated Jul 3, 2024

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,135 163 Updated Dec 6, 2024

ArchitectIndustries / bark.infinity

🚀 BARK INFINITY 🎶 Power Up The Bark Text-prompted Generative Audio Model

Python 14 2 Updated May 1, 2023

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,136 2,253 Updated Jun 26, 2024

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,779 758 Updated Jun 24, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.

Python 30,191 2,986 Updated Dec 24, 2024

collabora / WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Python 2,234 297 Updated Dec 18, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 37,337 4,241 Updated Dec 19, 2024

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 25,453 3,708 Updated Nov 24, 2024

iamaziz / llm-voice-bot

Speak (speech-to-text) to Ollama LLMs in any lanaguage - Streamlit app

Python 39 8 Updated Feb 27, 2024

crhung / Voice-Emotion-Detector

Voice Emotion Detector that detects emotion from audio speech using one dimensional CNNs (convolutional neural networks) using keras and tensorflow on Jupyter Notebook.

Jupyter Notebook 104 35 Updated Mar 21, 2018

unconv / plagiarist-gpt

Fine-tune ChatGPT to write lyrics of your favorite artist

Python 8 2 Updated Aug 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

otey247

Achievements

Achievements

Block or report otey247

Audio AI

suno-ai / bark

teticio / audio-diffusion

facebookresearch / audiocraft

PABannier / bark.cpp

C0untFloyd / bark-gui

JonathanFly / bark

katspaugh / wavesurfer.js

openai / whisper

affige / genmusic_demo_list

archinetai / audio-ai-timeline

rsxdalv / tts-generation-webui

rsxdalv / bark-speaker-directory

facebookresearch / encodec

AIGC-Audio / AudioGPT

e-johnstonn / wingmanAI

pyannote / pyannote-audio

coqui-ai / TTS

speechbrain / speechbrain

Nutlope / notesGPT

linto-ai / whisper-timestamped

ArchitectIndustries / bark.infinity

OpenTalker / SadTalker

jasonppy / VoiceCraft

myshell-ai / OpenVoice

collabora / WhisperLive

RVC-Boss / GPT-SoVITS

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

iamaziz / llm-voice-bot

crhung / Voice-Emotion-Detector

unconv / plagiarist-gpt