Lists (26)
Sort Newest
live2d模型
数字人-framework
数字人-模型生成
数字人-引擎
数字人-语音
后端开发
视频会议
云游戏
IOT开发
数字人
机器人开发
云桌面
AI-Agent
计算机图形学
大模型
嵌入式开发
前端开发
AI训练数据集
图像处理
DevOps开发
学习集合
工具集合
AI部署
my-ideas
VIDEO开发
AI模型相关
Stars
A community-maintained Python framework for creating mathematical animations.
Multilingual Voice Understanding Model
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
The easiest way to translate your NextJs apps.
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Speech To Speech: an effort for an open-sourced and modular GPT4-o
A high-performance runtime framework for modern robotics.
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Python Wechaty is a Conversational RPA SDK for Chatbot Makers written in Python
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…
E2M API, converting everything to markdown (LLM-friendly Format).