Bing0cv

Follow

Bing0 Bing0cv

Follow

1 follower · 1 following

Stars

29 stars written in Python

d2l-ai / d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 66,088 11,275 Updated Jul 30, 2024

Z4nzu / hackingtool

ALL IN ONE Hacking Tool For Hackers

Python 51,696 5,581 Updated Jul 31, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,676 5,688 Updated Feb 25, 2025

state-spaces / mamba

Mamba SSM architecture

Python 14,055 1,225 Updated Jan 18, 2025

yzhao062 / pyod

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 8,868 1,390 Updated Jan 13, 2025

jianchang512 / clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

Python 8,042 838 Updated Dec 7, 2024

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,804 774 Updated Feb 11, 2024

hluwa / frida-dexdump

A frida tool to dump dex in memory to support security engineers analyzing malware.

Python 4,124 913 Updated Mar 4, 2023

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,004 427 Updated Feb 20, 2025

riffusion / riffusion-hobby

Stable diffusion for real-time music generation

Python 3,543 411 Updated Jul 22, 2024

resemble-ai / Resemblyzer

A python package to analyze and compare voices with deep learning

Python 2,861 440 Updated Oct 12, 2023

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,091 323 Updated Nov 14, 2023

ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,951 557 Updated Oct 27, 2023

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,606 115 Updated Jul 5, 2024

lucidrains / soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,479 91 Updated Oct 31, 2024

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python 1,405 252 Updated Dec 2, 2024

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,311 104 Updated Sep 24, 2023

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 1,108 133 Updated Feb 18, 2025

xcmyz / FastSpeech

The Implementation of FastSpeech based on pytorch.

Python 866 213 Updated Jul 6, 2023

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 729 68 Updated Feb 9, 2025

LAION-AI / audio-dataset

Audio Dataset for training CLAP and other models

Python 666 56 Updated Feb 5, 2024

mir-aidj / all-in-one

All-In-One Music Structure Analyzer

Python 502 69 Updated May 9, 2024

GitYCC / g2pW

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Python 305 37 Updated Oct 20, 2024

ASLP-lab / OSUM

西北工业大学ASLP实验室OSUM项目官方库

Python 283 14 Updated Feb 24, 2025

hayeong0 / DDDM-VC

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

Python 210 22 Updated Jul 31, 2024

tencent-ailab / MuQ

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 135 6 Updated Jan 9, 2025

BakerBunker / FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Python 86 7 Updated Jul 4, 2024

yuboona / some-script-to-help-using-Montreal-Forced-Aligner

Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textgrid files.

Python 15 1 Updated Feb 9, 2024

kkksuper / kkcode

It is some useful script for personal daily work.

Python 1 1 Updated Dec 13, 2024