Skip to content
View haloha123's full-sized avatar

Block or report haloha123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

πŸ”Š Text-Prompted Generative Audio Model

Jupyter Notebook 36,768 4,325 Updated Aug 19, 2024
Python 2 1 Updated Oct 22, 2019

CS229 Final Project

Jupyter Notebook 1 Updated Dec 10, 2022

Defending against Adversarial Audio via Diffusion Model (ICLR 2023)

Python 27 1 Updated Mar 2, 2023

fine-tune Whipser model for Taiwanese speech recognition

Python 28 8 Updated Mar 23, 2023

Replace arXiv links by their corresponding bibliography in markdowns / Notion database

Python 22 2 Updated Aug 6, 2023

Faster Whisper transcription with CTranslate2

Python 1 Updated Mar 23, 2023

Transcribe a Collection of Waveform Audio Files using whisper_timestamped

Python 1 Updated Apr 20, 2023

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,200 166 Updated Dec 6, 2024

A not very efficient attempt to create a real time openai/whisper (Audio to Text Transcriber)

Python 3 1 Updated Mar 27, 2023

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021…

Python 217 18 Updated May 9, 2022

Physical Symbolic Optimization

Python 1,858 258 Updated Dec 6, 2024

πŸ“– Paper reading list in conversational AI (constantly updating πŸ€—).

994 162 Updated Dec 31, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,181 706 Updated Dec 17, 2024

Implementation of Bit Diffusion, Hinton's group's attempt at discrete denoising diffusion, in Pytorch

Python 341 17 Updated Oct 14, 2023
Jupyter Notebook 180 32 Updated Feb 22, 2022
Python 1 1 Updated Mar 16, 2023

chinese speech pretrained models

Shell 1,065 90 Updated Aug 23, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,640 2,589 Updated Jan 7, 2025

Transcribing Speech with Multinomial Diffusion, training code and models.

Python 76 5 Updated Sep 27, 2023

Revisiting Denoising Diffusion Probabilistic Models for Speech Enhancement: Condition Collapse, Efficiency and Refinement, Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), 2023.

Python 35 3 Updated Dec 5, 2023

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

Python 163 15 Updated Jul 14, 2023

Trainer for audio-diffusion-pytorch

Python 128 22 Updated Jan 13, 2023
Python 4 Updated Nov 8, 2022

Code for a paper exploring using diffusion models to defend neural networks against adversarial attacks

Jupyter Notebook 8 1 Updated Jan 12, 2024

Domain adaptation made easy. Fully featured, modular, and customizable.

Python 361 16 Updated Jan 30, 2023