Lists (9)
Sort Name ascending (A-Z)
Starred repositories
Generating Chords from Melody with Flexible Harmonic Rhythm and Controllable Harmonic Density [EURASIP JASMP]
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Core Engine of Singing Voice Conversion & Singing Voice Clone
Official inference repo for FLUX.1 models
Neural network-based singing voice synthesis library for research
Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation
Encode and decode audio samples to/from compressed latent representations!
multi-task and multi-track music transcription for everyone
[ICLR 2023] "Dilated convolution with learnable spacings" Ismail Khalfaoui Hassani, Thomas Pellegrini and Timothée Masquelier
MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.
This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics
MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metrics to assess exact data replication of the training set.
Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song
A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)
Python audio and music signal processing library
Hum2Song: Multi-track Polyphonic Music Generation from Voice Melody Transcription with Neural Networks
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
Materials for the Hugging Face Diffusion Models Course
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…