Skip to content
View R3gm's full-sized avatar

Block or report R3gm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 435 93 Updated Nov 25, 2024

workflow orchestration UI and nodes editor for your own python codebase

TypeScript 38 2 Updated Oct 30, 2024
C# 280 27 Updated Sep 9, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,223 863 Updated Apr 8, 2025

C++ library for converting text to phonemes for Piper

C++ 114 90 Updated Mar 13, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 14,876 1,620 Updated Mar 25, 2025

I've been trying quite hard to use the IVONA Amy voice on Linux natively, from trying to reverse engineer the APK's and dll files, to hacking Waydroid to be compatible, and port forwarding/ssh via …

Shell 3 Updated Jun 30, 2023

All Algorithms implemented in Python

Python 199,280 46,557 Updated Apr 9, 2025

A Jupyter widgets-based interactive notebook for Google Colab to generate images using Stable Diffusion.

Jupyter Notebook 17 10 Updated Dec 13, 2023

fMRI-to-image reconstruction on the NSD dataset.

Jupyter Notebook 321 47 Updated May 22, 2024
Python 55 15 Updated Mar 13, 2024

a colab notebook repo for using Diffusers library (not a webui)

Jupyter Notebook 20 Updated Oct 2, 2023

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,392 144 Updated Feb 10, 2025

Godot Engine – Multi-platform 2D and 3D game engine

C++ 95,950 22,100 Updated Apr 9, 2025

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Python 245 32 Updated Jul 25, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,690 2,465 Updated Feb 10, 2025

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 8,778 2,693 Updated Aug 13, 2024

A curated list of open source projects used in nuclear science and engineering

374 72 Updated Aug 20, 2024

Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project

Python 987 102 Updated Aug 29, 2023

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 8,308 812 Updated Feb 13, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,568 2,352 Updated Jun 26, 2024

A Chess Bot powered by OpenAI's ChatGPT

Python 21 7 Updated Mar 7, 2024

A multi document reader and chatbot using LangChain and ChatGPT

Python 141 55 Updated Feb 7, 2024

template for duplicating and executing Hugging Face Spaces either on SM Studio Lab, Google Colab, or locally.

Jupyter Notebook 11 2 Updated Jan 9, 2023

📚 A collection of sketch based application papers.

620 62 Updated Mar 23, 2025

Panel: The powerful data exploration & web app framework for Python

Python 5,145 540 Updated Apr 8, 2025

A list of awesome beginners-friendly projects.

73,012 7,219 Updated Mar 21, 2025

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)

TypeScript 2,094 220 Updated Apr 2, 2025

A timeline of the latest AI models for audio generation, starting in 2023!

1,896 71 Updated Jan 4, 2024

Finetuning VITS Efficiently

Python 32 6 Updated Nov 6, 2023
Next