Lists (14)
Sort Name ascending (A-Z)
Stars
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
リアルタイムボイスチェンジャー Realtime Voice Changer
Industry leading face manipulation platform
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
FongMi影视和tvbox配置文件,如果喜欢,请Fork自用。使用前请仔细阅读仓库说明,一旦使用将被视为你已了解。
🌩「自选优选 IP」测试 Cloudflare CDN 延迟和速度,获取最快 IP !当然也支持其他 CDN / 网站 IP ~
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Real time interactive streaming digital human
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
The fastest digital human algorithm, now on your desktop.
A feature-rich command-line audio/video downloader
SD.Next: All-in-one for AI generative image
Official inference repo for FLUX.1 models
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Multilingual Voice Understanding Model
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Instant voice cloning by MIT and MyShell.
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…
阅读3服务器版,桌面端,iOS可用。后端 Kotlin + Spring Boot + Vert.x + Coroutine ;前端 Vue.js + Element。麻烦点点star,关注一下公众号【假装大佬】❗️
Port of OpenAI's Whisper model in C/C++
Robust Speech Recognition via Large-Scale Weak Supervision