Skip to content
View Bing0cv's full-sized avatar

Block or report Bing0cv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
29 stars written in Python
Clear filter

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 66,088 11,275 Updated Jul 30, 2024

ALL IN ONE Hacking Tool For Hackers

Python 51,696 5,581 Updated Jul 31, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,676 5,688 Updated Feb 25, 2025

Mamba SSM architecture

Python 14,055 1,225 Updated Jan 18, 2025

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 8,868 1,390 Updated Jan 13, 2025

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Python 8,042 838 Updated Dec 7, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,804 774 Updated Feb 11, 2024

A frida tool to dump dex in memory to support security engineers analyzing malware.

Python 4,124 913 Updated Mar 4, 2023

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,004 427 Updated Feb 20, 2025

Stable diffusion for real-time music generation

Python 3,543 411 Updated Jul 22, 2024

A python package to analyze and compare voices with deep learning

Python 2,861 440 Updated Oct 12, 2023

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,091 323 Updated Nov 14, 2023

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,951 557 Updated Oct 27, 2023

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,606 115 Updated Jul 5, 2024

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,479 91 Updated Oct 31, 2024

Command line utility for forced alignment using Kaldi

Python 1,405 252 Updated Dec 2, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,311 104 Updated Sep 24, 2023

zero-shot voice conversion & singing voice conversion, with real-time support

Python 1,108 133 Updated Feb 18, 2025

The Implementation of FastSpeech based on pytorch.

Python 866 213 Updated Jul 6, 2023

Speech, Language, Audio, Music Processing with Large Language Model

Python 729 68 Updated Feb 9, 2025

Audio Dataset for training CLAP and other models

Python 666 56 Updated Feb 5, 2024

All-In-One Music Structure Analyzer

Python 502 69 Updated May 9, 2024

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Python 305 37 Updated Oct 20, 2024

西北工业大学ASLP实验室OSUM项目官方库

Python 283 14 Updated Feb 24, 2025

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

Python 210 22 Updated Jul 31, 2024

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 135 6 Updated Jan 9, 2025

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Python 86 7 Updated Jul 4, 2024

Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textgrid files.

Python 15 1 Updated Feb 9, 2024

It is some useful script for personal daily work.

Python 1 1 Updated Dec 13, 2024