Stars
Auto-scrubbing of articles and auto-likes in discourse
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
GUI for a Vocal Remover that uses Deep Neural Networks.
一款专注于Ai翻译的工具,一键自动翻译RPG SLG游戏,Epub TXT小说,Srt Vtt Lrc字幕,Word MD文档等等复杂长文本。
A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
A generative speech model for daily dialogue.
Windows desktop front end for Spleeter - AI source separation
Deezer source separation library including pretrained models.
an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网
DeepSeek LLM: Let there be answers
一个用于重新生成 Cu*sor 设备 ID 和设置 Access Token 的 Cu*sor 扩展
「海外工具网站」已经是我人生主要事业了,很庆幸还来得及,感谢这个伟大的 AI 时代。
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Easily train a good VC model with voice data <= 10 mins!
An open-source Chinese font derived from Fontworks' Klee One. 一款开源中文字体,基于 FONTWORKS 出品字体 Klee One 衍生。
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。