Skip to content
View ivcylc's full-sized avatar
🎾
Coding
🎾
Coding

Block or report ivcylc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Python 4 2 Updated Nov 15, 2024
Python 102 4 Updated Apr 18, 2025

Generative models for conditional audio generation

Python 3,032 305 Updated Mar 21, 2025

Audiogen Codec

Python 135 12 Updated Jul 9, 2024

Perceptual Quality Estimator for speech and audio

C++ 766 131 Updated Aug 2, 2024

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,408 146 Updated Feb 10, 2025
5 Updated Feb 4, 2025
Python 2 1 Updated Nov 5, 2024

Shortcut flow matching Pytorch implementation

Python 33 3 Updated Dec 31, 2024

The official Implementation of PeriodWave and PeriodWave-Turbo

Python 186 12 Updated Apr 14, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 465 29 Updated Apr 7, 2025
Python 460 13 Updated Dec 5, 2024

easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox

Python 49 10 Updated Nov 19, 2019

🎛 🔊 A Python library for audio.

C++ 5,477 286 Updated Nov 26, 2024

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 701 18 Updated Apr 8, 2025

编译原理习题精选 陈意云 PDF

2 Updated Jan 4, 2025
HTML 6 2 Updated Nov 13, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,155 2,234 Updated Feb 1, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

515 24 Updated Apr 9, 2025

Music repair method to convert lossy MP3 compressed music to lossless music.

Python 227 21 Updated Mar 7, 2025

[ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"

Python 61 6 Updated Jan 17, 2025
Python 128 9 Updated Jan 20, 2025

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 3,309 337 Updated Apr 15, 2025

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,005 128 Updated Sep 5, 2024

Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models", presented in LAMIR 2024 Workshop

Python 40 3 Updated Nov 20, 2024

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 649 53 Updated Oct 1, 2024
Python 25 5 Updated Mar 28, 2024

🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.

140 5 Updated Dec 26, 2024
Next