Highlights
- Pro
Stars
This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".
Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Post-processing for CREPE to turn f0 pitch estimates into discrete notes e.g. MIDI
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
A command line tool to fetch lyrics from spotify and save it to lrc file. It can fetch both synced and unsynced lyrics from spotify.
AI Audio Datasets (AI-ADS) π΅, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio aβ¦
Keep track of big models in audio domain, including speech, singing, music etc.
Community-maintained collection of scripts for REAPER
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors
A multi-task learning example for the paper https://arxiv.org/abs/1705.07115
DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.
PyTorch Implementation of Mean-Variance Loss for age estimation.
Code for the ALiBi method for transformer language models (ICLR 2022)
Official PyTorch implementation of Contrastive Learning of Musical Representations
Emotional conditioned music generation using transformer-based model.
MIDI, WAV domain music emotion recognition [ISMIR 2021]
A straightforward collection of Music Generation research resources.
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
MIDI / symbolic music tokenizers for Deep Learning models πΆ
Utility functions for handling MIDI data in a nice/intuitive way.
Python MIDI track classifier and tonal tension calculation based on spiral array theory
a free python grammar checker πβ