Skip to content
View echo-hmwang's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Sha Tin, Hong Kong SAR

Highlights

  • Pro

Block or report echo-hmwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
158 results for source starred repositories
Clear filter

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Python 271 61 Updated May 23, 2023

Official repo for consistency models.

Python 6,217 425 Updated Mar 22, 2024

Awesome-LLM: a curated list of Large Language Model

19,631 1,623 Updated Dec 26, 2024

TODO

Python 37 8 Updated Nov 1, 2023

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

Python 195 26 Updated Sep 13, 2024

Conditional Diffusion Probabilistic Model for Speech Enhancement

Python 220 34 Updated Dec 20, 2022

Differentiable SDE solvers with GPU support and efficient sensitivity analysis.

Python 1,594 202 Updated May 25, 2024

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

Python 233 30 Updated Feb 3, 2022

A collection of Beamer themes from the community

1,379 118 Updated Nov 13, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,256 837 Updated Dec 26, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 923 111 Updated Sep 5, 2024

Generative models for conditional audio generation

Python 2,792 265 Updated Nov 5, 2024

Release for Improved Denoising Diffusion Probabilistic Models

Python 3,356 494 Updated Jul 18, 2024

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Python 760 112 Updated May 22, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 26,838 5,520 Updated Dec 26, 2024

Variational auto-encoders for audio

Python 114 20 Updated May 20, 2020

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,608 1,061 Updated Oct 9, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,791 324 Updated Jul 14, 2024

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,025 61 Updated Jul 20, 2024

A collection of resources and papers on Diffusion Models

HTML 11,268 952 Updated Aug 1, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,648 130 Updated Sep 19, 2023

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

Python 224 30 Updated Jul 13, 2022

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 6,813 1,080 Updated Jun 13, 2024

Intro to Reinforcement Learning (强化学习纲要)

3,264 489 Updated Jul 25, 2020

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,616 591 Updated May 31, 2024

Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)

Python 96 7 Updated Jul 8, 2024

Effective Data Augmentation With Diffusion Models

Python 226 18 Updated Jun 18, 2024

[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Python 744 92 Updated Mar 1, 2024

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 544 76 Updated Dec 21, 2024

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,682 3,409 Updated Dec 21, 2024
Next