Stars
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
Question and Answer based on Anything.
DSPy: The framework for programming—not prompting—language models
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
🦜🔗 Build context-aware reasoning applications
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
A high-throughput and memory-efficient inference and serving engine for LLMs
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🔥Highlighting the top ML papers every week.
Drag & drop UI to build your customized LLM flow
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
☁️ Build multimodal AI applications with cloud-native stack
Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behi…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.