Skip to content
View Bing0cv's full-sized avatar

Block or report Bing0cv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
HTML 1 Updated Dec 15, 2024

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 135 6 Updated Jan 9, 2025

西北工业大学ASLP实验室OSUM项目官方库

Python 282 14 Updated Feb 24, 2025

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,004 427 Updated Feb 20, 2025
60 4 Updated Sep 13, 2024

All-In-One Music Structure Analyzer

Python 502 69 Updated May 9, 2024

It is some useful script for personal daily work.

Python 1 1 Updated Dec 13, 2024

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

Python 210 22 Updated Jul 31, 2024

zero-shot voice conversion & singing voice conversion, with real-time support

Python 1,108 133 Updated Feb 18, 2025

Mamba SSM architecture

Python 14,055 1,225 Updated Jan 18, 2025

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Python 86 7 Updated Jul 4, 2024

A frida tool to dump dex in memory to support security engineers analyzing malware.

Python 4,124 913 Updated Mar 4, 2023

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,606 115 Updated Jul 5, 2024

让复制更加简单!

C# 918 62 Updated Feb 27, 2023

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

544 68 Updated Nov 13, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,804 774 Updated Feb 11, 2024

Speech, Language, Audio, Music Processing with Large Language Model

Python 729 68 Updated Feb 9, 2025

2024年计算机保研夏令营&冬令营通知

1,575 113 Updated Jul 18, 2024

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,091 323 Updated Nov 14, 2023

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Python 305 37 Updated Oct 20, 2024

Command line utility for forced alignment using Kaldi

Python 1,405 252 Updated Dec 2, 2024

Forced Alignment-MFA

37 1 Updated Jun 13, 2022

Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textgrid files.

Python 15 1 Updated Feb 9, 2024

A python package to analyze and compare voices with deep learning

Python 2,861 440 Updated Oct 12, 2023

Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrat…

Perl 15 8 Updated Feb 9, 2024

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 8,868 1,390 Updated Jan 13, 2025

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 944 82 Updated Nov 4, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,675 5,687 Updated Feb 25, 2025

Stable diffusion for real-time music generation

Python 3,542 411 Updated Jul 22, 2024

Audio Dataset for training CLAP and other models

Python 666 55 Updated Feb 5, 2024
Next