🤖 AIAssisiant
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Tesseract Open Source OCR Engine (main repository)
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Speech recognition module for Python, supporting several engines and APIs, online and offline.
WebUI extension for ControlNet
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A Gradio web UI for Large Language Models with support for multiple inference backends.
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答
stable-diffusion-webui 的汉化扩展
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
使用electron和live2D开发的类似桌面精灵的应用(A desktop application developed using electron and live2D)
An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux)
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
A ChatGPT C# client for MacOS, Windows, Linux, Android, iOS and Browser. Powered by Avalonia UI framework.
HiLoop是一个简约的桌面悬浮球工具,支持拖动及配置,提供了待办事项、快速笔记等功能。HiLoop is a minimalist desktop tool that supports drag and configuration, providing functions such as to-do lists and quick notes.
一个完整electron桌面记账程序,技术栈主要使用electron-vue+vuetify。开机自动启动,自动更新,托盘最小化,闪烁等常用功能,Nsis制作漂亮的安装包。
Build Multimodal AI Agents with memory, knowledge and tools. Simple, fast and model-agnostic.
Get your documents ready for gen AI
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.