Skip to content
View huanjunjun's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report huanjunjun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

16 repositories

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 34,056 5,786 Updated Nov 29, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 82,573 9,944 Updated Mar 11, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 44,088 3,956 Updated Mar 12, 2025

💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, Qwen2, OpenAI and more.

Python 14,347 1,911 Updated Mar 12, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 132,344 10,917 Updated Mar 12, 2025

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 26,331 1,598 Updated Mar 12, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41,150 6,202 Updated Mar 12, 2025

LLM inference in C/C++

C++ 76,294 11,042 Updated Mar 11, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,094 4,213 Updated Mar 10, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,090 3,527 Updated Mar 12, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 5,994 694 Updated Oct 22, 2024

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,674 1,799 Updated Mar 12, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,312 2,336 Updated Mar 12, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 34,096 3,390 Updated Mar 12, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 13,276 1,366 Updated Mar 5, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,852 705 Updated Mar 12, 2025