AI
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
A natural language interface for computers
这是一个可以识别视频语音自动生成字幕SRT文件的开源 Windows-GUI 软件工具。
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
faster_whisper GUI with PySide6
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
A ComfyUI workflows and models management extension to organize and manage all your workflows, models in one place. Seamlessly switch between workflows, as well as import, export workflows, reuse s…
Improved AnimateDiff for ComfyUI and Advanced Sampling Support
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…
智能微秘书,全能的微信机器人管理平台,最简单的方式接入ChatGPT,FastGPT,Dify,Coze,扣子.支持绘图,语音识别,语音发送,定时任务,支持企微、公众号、5G 消息、WhatsApp
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Automate Creation of YouTube Shorts using MoviePy.
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
The official gpt4free repository | various collection of powerful language models
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
基于Dify的企业微信知识库机器人,基于企微gpt知识库的bot机器人,能够自动回复企业微信中收到的消息。这个机器人能够处理私聊和群聊,还可以记住与用户的聊天内容,从而做出更加贴合上下文的回应。此外,您还可以设置白名单来控制机器人与哪些用户或群组交互。如需自己dify网站版的机器人WX:aiwis99
基于大模型的智能对话客服工具,支持微信、拼多多、千牛、哔哩哔哩、抖音企业号、抖音、抖店、微博聊天、小红书专业号运营、小红书、知乎等平台接入,可选择 GPT3.5/GPT4.0/ 懒人百宝箱 (后续会支持更多平台),能处理文本、语音和图片,通过插件访问操作系统和互联网等外部资源,支持基于自有知识库定制企业 AI 应用。