Lists (1)
Sort Name ascending (A-Z)
Stars
Highly performant and modular controls for node-based editors designed for data-binding and MVVM.
Voice receive extension package for discord.py
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Very simple Nintendo Game Boy emulator written in C# + Blazor running in the web-browser.
QR Code Scanner Blazor component
A powerful and extensible cross-platform .NET audio engine. provides comprehensive audio processing capabilities including playback, recording, effects, analysis, and visualization, built with a mo…
Pngtuber app build on Avalonia.UI with twitch integration and a ttspet
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
NarForum is a simple and flexible forum software built with .NET 8 and Blazor.
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
Self-supervised learning for fast pitch estimation
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
.NET diagramming library for interactive flowcharts, org charts, design tools, planning tools, visual languages.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Web-based Process Visualization (SCADA/HMI/Dashboard) software
All generative model in one for better TTS model
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Turn any webpage into structured data using LLMs
API for a Vocal Remover that uses Deep Neural Networks.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Large World Model -- Modeling Text and Video with Millions Context
⭐️ Companies that don't have a broken hiring process
Pitch Estimating Neural Networks (PENN)
Foundational model for human-like, expressive TTS
A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singing.