Stars
RooVetGit / Roo-Code
Forked from cline/clineRoo Code (prev. Roo Cline) is a VS Code plugin that enhances coding with AI-powered automation, multi-model support, and experimental features
Everything that makes working with databases easier
👻 A creative editor based on fabricjs. 一款基于 fabricjs 的开源创意图片编辑器,旨在让开发者快速构建属于自己的图片编辑器。可应用于海报设计、小红书公众号封面设计、banner 设计等场景。
Create visualizations and csv files to view your dynamic creative performance in Facebook Ads
Simple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more. Demo (login with guest/welc…
Layer-Divider, an extension for stable-diffusion-webui using the segment-anything model (SAM)
Auto Facebook post system with Gemini by fetching RSS as source
This bot can comment to the Facebook posts automatically!
Secure and fast microVMs for serverless computing.
Question and Answer based on Anything.
Instant voice cloning by MIT and MyShell. Audio foundation model.
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
🎙️ - Clone Voice 🎵 - TTS 🔮 - Voice Conversion,其中TTS参数可调
🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.
Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
A live speech recognition using Facebooks wav2vec 2.0 model.
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,标注平台,自动化标注,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持R…
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…
Mocap Dataset of “Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation”
code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021
用 FastAPI 和 Vue3 搭建的 ChatGLM 网页 (前端样式仿照chatgpt-web, 支持chatglm流式输出、前端调整参数、上下文选择、保存图片、知识库问答等功能)
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation