Skip to content
View paranoid2droid's full-sized avatar

Highlights

  • Pro

Block or report paranoid2droid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
118 results for source starred repositories
Clear filter

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

108 3 Updated Dec 10, 2024

Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp

Python 485 24 Updated Jan 8, 2025

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 1,897 141 Updated Aug 25, 2024

All-In-One Music Structure Analyzer

Python 484 63 Updated May 9, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,166 162 Updated Dec 6, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,279 1,440 Updated Jan 8, 2025

Post-processing for CREPE to turn f0 pitch estimates into discrete notes e.g. MIDI

Python 24 3 Updated Aug 14, 2023

Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

Python 18,324 1,651 Updated Dec 24, 2024

A command line tool to fetch lyrics from spotify and save it to lrc file. It can fetch both synced and unsynced lyrics from spotify.

Python 177 15 Updated Nov 28, 2023

AI Audio Datasets (AI-ADS) 🎡, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

580 41 Updated Nov 24, 2024

Keep track of big models in audio domain, including speech, singing, music etc.

466 28 Updated Sep 26, 2024

Community-maintained collection of scripts for REAPER

Lua 290 148 Updated Jan 10, 2025

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

C++ 957 68 Updated Sep 9, 2024

A multi-task learning example for the paper https://arxiv.org/abs/1705.07115

Jupyter Notebook 851 207 Updated Jul 6, 2020

DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.

Python 351 34 Updated Jun 11, 2020

PyTorch Implementation of Mean-Variance Loss for age estimation.

Python 67 14 Updated May 12, 2020

Code for the ALiBi method for transformer language models (ICLR 2022)

Python 508 38 Updated Oct 30, 2023

Official PyTorch implementation of Contrastive Learning of Musical Representations

Python 312 48 Updated Jul 25, 2024

Emotional conditioned music generation using transformer-based model.

Jupyter Notebook 146 18 Updated Oct 21, 2022

MIDI, WAV domain music emotion recognition [ISMIR 2021]

Python 75 11 Updated Oct 29, 2021

A straightforward collection of Music Generation research resources.

585 36 Updated Jan 7, 2025

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Python 188 35 Updated Apr 2, 2024

πŸŽ› πŸ”Š A Python library for audio.

C++ 5,313 270 Updated Nov 26, 2024

MIDI / symbolic music tokenizers for Deep Learning models 🎢

Python 715 86 Updated Dec 23, 2024

Utility functions for handling MIDI data in a nice/intuitive way.

Jupyter Notebook 889 155 Updated Dec 11, 2024

Python MIDI track classifier and tonal tension calculation based on spiral array theory

Python 104 22 Updated Jun 18, 2024

a free python grammar checker πŸ“βœ…

Python 438 65 Updated Dec 26, 2024
Next