Skip to content
View ANLGBOY's full-sized avatar
💥
Good vibes only
💥
Good vibes only

Highlights

  • Pro

Block or report ANLGBOY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 692 39 Updated Mar 5, 2025

🎛 🔊 A Python library for audio.

C++ 5,407 278 Updated Nov 26, 2024

Viterbi decoding in PyTorch

Python 27 4 Updated Feb 26, 2025

[ICLR 2024] Continual Momentum Filtering on Parameter Space for Online Test-time Adaptation.

Python 4 2 Updated Apr 2, 2024

Code and materials for ICML2024 submission

Python 4 1 Updated Mar 28, 2024

An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)

Python 119 7 Updated Jul 30, 2024

Generative models for conditional audio generation

Python 2,942 294 Updated Feb 28, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 15,485 1,458 Updated Jan 19, 2025

Full models and training code for PESTO

Python 60 14 Updated Jun 12, 2024

Pitch Estimating Neural Networks (PENN)

Python 242 23 Updated Jul 31, 2024

Inference and training library for high-quality TTS models.

Python 5,104 538 Updated Dec 10, 2024

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,972 195 Updated Mar 5, 2025

This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.

32 Updated Jan 26, 2024

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

Python 176 15 Updated Jul 14, 2023

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,650 672 Updated Mar 3, 2025

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,775 267 Updated Sep 15, 2024

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter Notebook 178 13 Updated Mar 25, 2024

Micro Feature Architecture Best Practice

Swift 92 4 Updated Jul 28, 2023

pdfrx is yet another PDF viewer implementation that built on the top of PDFium. The plugin currently supports Android, iOS, Windows, macOS, Linux, and Web.

Dart 146 72 Updated Feb 20, 2025

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,357 142 Updated Feb 10, 2025

An Open-source Streaming High-fidelity Neural Audio Codec

Python 458 23 Updated Mar 4, 2025

Text-to-Audio/Music Generation

Python 2,382 187 Updated Sep 29, 2024

Convolutions for Sequence Modeling

Assembly 877 70 Updated Jun 13, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,620 2,265 Updated Jan 15, 2025
Jupyter Notebook 41 8 Updated Dec 7, 2023

BigVGAN with Neural Source-Filter

Python 51 7 Updated Sep 21, 2023

Pronounce English in Japanese way

Python 4 1 Updated Apr 1, 2023

speech self-supervised representations

Python 481 39 Updated Apr 27, 2023

INSTA - Instant Volumetric Head Avatars [CVPR2023]

C++ 475 41 Updated Mar 2, 2025
Next