Stars
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers. Support deepseek-r1
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Use your locally running AI models to assist you in your web browsing
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo
gradio WebUI for AdvancedLivePortrait
🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot …
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能
The fastest digital human algorithm, now on your desktop.
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer(RVC), zero-shot Voice Cloning (E2, F5-TTS), YouTub…
CapsWriter 的离线版,一个好用的 PC 端的语音输入工具
An open-source RAG-based tool for chatting with your documents.
StoryMaker: Towards consistent characters in text-to-image generation
Industry leading face manipulation platform
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
Drag & drop UI to build your customized LLM flow
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
A lightweight local-first graphic-centric productivity tool to build your second brain. Supporting Excalidraw/Tldraw whiteboard and notion-like note. 一款以图形为中心、轻量级、本地优先的用于构建第二大脑的效率工具。支持 Excalidraw、T…
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Make bilingual epub books Using AI translate