Skip to content
View crjc's full-sized avatar

Block or report crjc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
22 stars written in Python
Clear filter

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,854 4,472 Updated Jan 18, 2025

๐Ÿธ๐Ÿ’ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,381 4,664 Updated Aug 16, 2024

A generative speech model for daily dialogue.

Python 34,113 3,691 Updated Jan 25, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 30,723 3,075 Updated Jan 7, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 26,670 3,821 Updated Nov 24, 2024

๐Ÿฆ” PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.

Python 24,019 1,474 Updated Feb 6, 2025

Industry leading face manipulation platform

Python 21,292 3,225 Updated Feb 5, 2025

High-Resolution 3D Human Digitization from A Single Image.

Python 9,598 1,465 Updated Aug 19, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,373 1,251 Updated Feb 5, 2025

EmotiVoice ๐Ÿ˜Š: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,625 650 Updated Aug 13, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,852 1,016 Updated Aug 5, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 5,510 739 Updated Dec 24, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,398 479 Updated Aug 10, 2024

Foundational model for human-like, expressive TTS

Python 4,013 675 Updated Jul 30, 2024

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python 2,064 328 Updated Feb 5, 2025

Using neural networks to build an automatic number plate recognition system

Python 1,849 697 Updated Nov 14, 2019

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,274 122 Updated Apr 24, 2024

[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"

Python 687 101 Updated Dec 12, 2024

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

Python 457 43 Updated Jul 15, 2024

function calling-based LLM agents

Python 282 22 Updated Sep 16, 2024

Open Source and Free License Plate Recognition Software

Python 176 26 Updated Mar 17, 2023

Real time background replacement using DeepLabv3 MobileNetv2 model for person segmentation and OpenCV for image processing.

Python 70 17 Updated Sep 2, 2020