Skip to content
View paranoid2droid's full-sized avatar

Highlights

  • Pro

Block or report paranoid2droid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
118 results for source starred repositories
Clear filter

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

111 3 Updated Jan 14, 2025

Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp

Python 486 24 Updated Jan 8, 2025

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 1,903 143 Updated Jan 15, 2025

All-In-One Music Structure Analyzer

Python 490 65 Updated May 9, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,195 166 Updated Dec 6, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,501 1,464 Updated Jan 25, 2025

Post-processing for CREPE to turn f0 pitch estimates into discrete notes e.g. MIDI

Python 24 3 Updated Aug 14, 2023

Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

Python 18,495 1,676 Updated Jan 18, 2025

A command line tool to fetch lyrics from spotify and save it to lrc file. It can fetch both synced and unsynced lyrics from spotify.

Python 178 15 Updated Nov 28, 2023

AI Audio Datasets (AI-ADS) 🎡, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

603 46 Updated Jan 15, 2025

Keep track of big models in audio domain, including speech, singing, music etc.

469 28 Updated Sep 26, 2024

Community-maintained collection of scripts for REAPER

Lua 295 151 Updated Jan 18, 2025

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

C++ 964 68 Updated Sep 9, 2024

A multi-task learning example for the paper https://arxiv.org/abs/1705.07115

Jupyter Notebook 852 208 Updated Jul 6, 2020

DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.

Python 354 34 Updated Jun 11, 2020

PyTorch Implementation of Mean-Variance Loss for age estimation.

Python 67 14 Updated May 12, 2020

Code for the ALiBi method for transformer language models (ICLR 2022)

Python 512 38 Updated Oct 30, 2023

Official PyTorch implementation of Contrastive Learning of Musical Representations

Python 313 48 Updated Jul 25, 2024

Emotional conditioned music generation using transformer-based model.

Jupyter Notebook 146 18 Updated Oct 21, 2022

MIDI, WAV domain music emotion recognition [ISMIR 2021]

Python 75 11 Updated Oct 29, 2021

A straightforward collection of Music Generation research resources.

591 36 Updated Jan 20, 2025

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Python 188 35 Updated Apr 2, 2024

πŸŽ› πŸ”Š A Python library for audio.

C++ 5,345 273 Updated Nov 26, 2024

MIDI / symbolic music tokenizers for Deep Learning models 🎢

Python 727 86 Updated Dec 23, 2024

Utility functions for handling MIDI data in a nice/intuitive way.

Jupyter Notebook 894 155 Updated Dec 11, 2024

Python MIDI track classifier and tonal tension calculation based on spiral array theory

Python 104 22 Updated Jun 18, 2024

a free python grammar checker πŸ“βœ…

Python 441 65 Updated Jan 21, 2025
Next