LLM
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, Qwen2, OpenAI and more.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A high-throughput and memory-efficient inference and serving engine for LLMs
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Open deep learning compiler stack for cpu, gpu and specialized accelerators
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
A modular graph-based Retrieval-Augmented Generation (RAG) system
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.