Skip to content
View kingyzf's full-sized avatar

Block or report kingyzf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
299 results for source starred repositories
Clear filter

Accessible large language models via k-bit quantization for PyTorch.

Python 6,761 670 Updated Mar 4, 2025

Python packaging and dependency management made easy

Python 32,732 2,332 Updated Mar 6, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 11,488 1,159 Updated Mar 7, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 79,038 11,530 Updated Mar 7, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,423 815 Updated Mar 6, 2025

LLM inference in C/C++

C++ 75,991 10,989 Updated Mar 7, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,687 2,360 Updated Mar 6, 2025

All things prompt engineering

Python 5,557 307 Updated Jun 4, 2024

Official inference framework for 1-bit LLMs

C++ 12,785 899 Updated Feb 18, 2025

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 42,771 5,522 Updated Mar 3, 2025

[ICLR 2025] Agent S: an open agentic framework that uses computers like a human

Python 828 108 Updated Feb 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,258 5,294 Updated Mar 6, 2025

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…

Jupyter Notebook 3,122 328 Updated Mar 5, 2025

Machine Learning Engineering Open Book

Python 13,079 796 Updated Mar 1, 2025

Knowledge Agents and Management in the Cloud

Python 3,753 369 Updated Mar 7, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,159 2,306 Updated Mar 6, 2025

A playbook for systematically maximizing the performance of deep learning models.

28,092 2,314 Updated Jun 18, 2024

Fast and memory-efficient exact attention

Python 16,127 1,527 Updated Mar 7, 2025

Practical GPU Sharing Without Memory Size Constraints

C 254 27 Updated Sep 23, 2024

A language model programming library.

Python 5,657 337 Updated Feb 25, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,516 6,097 Updated Mar 7, 2025

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 10,042 944 Updated Feb 24, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,951 507 Updated Feb 21, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,929 1,109 Updated Feb 28, 2025

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 6,042 542 Updated Jan 24, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 131,456 10,795 Updated Mar 7, 2025

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,028 5,237 Updated Jun 27, 2024

X-Ray Vision for your infrastructure!

C 73,696 6,021 Updated Mar 7, 2025
Next