Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
Official repo for consistency models.
Awesome-LLM: a curated list of Large Language Model
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Conditional Diffusion Probabilistic Model for Speech Enhancement
Differentiable SDE solvers with GPU support and efficient sensitivity analysis.
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
A collection of Beamer themes from the community
✨✨Latest Advances on Multimodal Large Language Models
Official PyTorch implementation of BigVGAN (ICLR 2023)
Generative models for conditional audio generation
Release for Improved Denoising Diffusion Probabilistic Models
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
A collection of resources and papers on Diffusion Models
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
A Collection of Variational Autoencoders (VAE) in PyTorch.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)
Effective Data Augmentation With Diffusion Models
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.