Lists (26)
Sort Name ascending (A-Z)
AI-Agent
AI模型相关
AI训练数据集
AI部署
DevOps开发
IOT开发
live2d模型
my-ideas
VIDEO开发
云桌面
云游戏
前端开发
后端开发
图像处理
大模型
学习集合
嵌入式开发
工具集合
数字人
数字人-framework
数字人-引擎
数字人-模型生成
数字人-语音
机器人开发
视频会议
计算机图形学
Stars
A community-maintained Python framework for creating mathematical animations.
Multilingual Voice Understanding Model
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
The easiest way to translate your NextJs apps.
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Speech To Speech: an effort for an open-sourced and modular GPT4-o
A high-performance runtime framework for modern robotics.
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Python Wechaty is a Conversational RPA SDK for Chatbot Makers written in Python
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…
E2M API, converting everything to markdown (LLM-friendly Format).