huanjunjun

🌴

On vacation

huan jun huanjunjun

🌴

On vacation

5 followers · 18 following

Stars

LLM

16 repositories

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 34,056 5,786 Updated Nov 29, 2024

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 82,573 9,944 Updated Mar 11, 2025

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 44,088 3,956 Updated Mar 12, 2025

1Panel-dev / MaxKB

💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, Qwen2, OpenAI and more.

Python 14,347 1,911 Updated Mar 12, 2025

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 132,344 10,917 Updated Mar 12, 2025

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 26,331 1,598 Updated Mar 12, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41,150 6,202 Updated Mar 12, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 76,294 11,042 Updated Mar 11, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,094 4,213 Updated Mar 10, 2025

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,090 3,527 Updated Mar 12, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 5,994 694 Updated Oct 22, 2024

HKUDS / LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,674 1,799 Updated Mar 12, 2025

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,312 2,336 Updated Mar 12, 2025

cline / cline

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 34,096 3,390 Updated Mar 12, 2025

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 13,276 1,366 Updated Mar 5, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,852 705 Updated Mar 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly