Stars
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
An open-source RAG-based tool for chatting with your documents.
This is the Mov2mov plugin for Automatic1111/stable-diffusion-webui.
Advanced Multi-Turn QA System with LLM and Intent Recognition. 基于LLM大语言模型意图识别、参数抽取结合slot词槽技术实现多轮问答、NL2API. 打造Function Call多轮问答最佳实践
Make images smaller using best-in-class codecs, right in the browser.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests
Arduino Audio Tools (a powerful Audio library not only for Arduino)
A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.
#1 Locally hosted web application that allows you to perform various operations on PDF files
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Netease Youdao's open-source embedding and reranker models for RAG products.
A ChatGPT web client that supports multiple users, multiple languages, and multiple database connections for persistent data storage. Provides Docker images and quick deployment scripts.
Question and Answer based on Anything.
Virtual whiteboard for sketching hand-drawn like diagrams
c121914yu / FastGPT
Forked from labring/FastGPTA platform that uses the OpenAI API to quickly build an AI knowledge base, supporting many-to-many relationships.
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
基于向量数据库与GPT3.5的通用本地知识库方案(A universal local knowledge base solution based on vector database and GPT3.5)
Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates knowledge into your AI workflow
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
A latent text-to-image diffusion model
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.