Lists (3)
Sort Name ascending (A-Z)
Stars
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
A webui for propainter. Easily pick up objects from the video and eliminate them.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Fork of https://github.com/Sanster/lama-cleaner
Build resilient language agents as graphs.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes.
Arbitrary-steps Image Super-resolution via Diffusion Inversion
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
神经网络的100万种整活方式(标题为机翻)
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
一行Docker命令部署的 OpenAI/GPT API代理,支持SSE流式返回、腾讯云函数 。Simple proxy for OpenAi api via a one-line docker command
A community-maintained Python framework for creating mathematical animations.
Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.
The Apple® Siri wave-form replicated in a JS library.
A proxy for Azure OpenAI API that can convert an OpenAI request into an Azure OpenAI request.
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Simple, unified interface to multiple Generative AI providers
Amphion-MaskGCT:0-sample voice synthesis and OpenAI-whisper-large-v3:Speech-to-text ComfyUI node packaging
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
[中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language models (LLMs) to provide a wide range of legal services.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)