Lists (1)
Sort Name ascending (A-Z)
Stars
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
English pronunciation correction teacher built with gemini
Profile-Based Long-Term Memory for AI Applications
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A framework for prompt tuning using Intent-based Prompt Calibration
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks
Conversational RPA SDK for Chatbot Makers. Join our Discord: https://discord.gg/7q8NBZbQzt
获取bilibili直播弹幕,使用WebSocket协议,支持web端和B站直播开放平台两种接口
A generative world for general-purpose robotics & embodied AI learning.
Task-Aware Agent-driven Prompt Optimization Framework
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
[LCLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
Port of OpenAI's Whisper model in C/C++
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Easegen is an open-source digital human course creation platform offering comprehensive solutions from course production and video management to intelligent quiz generation.Easegen 是一个开源的数字人课程制作平台,…