- Hangzhou, China
- http://qinxuye.me
- @qinxuye
Stars
Build GUI for your Python program with JavaScript, HTML, and CSS
Dynamic batching for Speech Enhancement, Speech Tokenizer and TTS.
A generative world for general-purpose robotics & embodied AI learning.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
On-device Diffusion Models for Apple Silicon
🔊 Text-Prompted Generative Audio Model
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
stackblitz-labs / bolt.diy
Forked from stackblitz/bolt.newPrompt, run, edit, and deploy full-stack web applications using any LLM you want!
📷 EasyPhoto | Your Smart AI Photo Generator.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Prompt, run, edit, and deploy full-stack web applications
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Efficient and easy multi-instance LLM serving
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Fast parallel LLM inference for MLX
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。