Skip to content
View kingyzf's full-sized avatar

Block or report kingyzf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

All things prompt engineering

Python 5,505 302 Updated Jun 4, 2024

Official inference framework for 1-bit LLMs

C++ 12,632 882 Updated Dec 20, 2024

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,668 5,424 Updated Jan 17, 2025

Agent S: an open agentic framework that uses computers like a human

Python 753 102 Updated Jan 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,422 4,722 Updated Jan 18, 2025

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…

Jupyter Notebook 2,559 258 Updated Jan 16, 2025

Machine Learning Engineering Open Book

Python 12,408 759 Updated Jan 19, 2025

Parse files for optimal RAG

Python 3,535 344 Updated Jan 10, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,749 2,146 Updated Jan 18, 2025

A playbook for systematically maximizing the performance of deep learning models.

27,858 2,299 Updated Jun 18, 2024

Fast and memory-efficient exact attention

Python 15,118 1,428 Updated Jan 18, 2025

Practical GPU Sharing Without Memory Size Constraints

C 236 25 Updated Sep 23, 2024

A language model programming library.

Python 5,567 328 Updated Dec 18, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,974 5,219 Updated Jan 19, 2025

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 9,786 910 Updated Jan 9, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,752 482 Updated Jan 17, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,834 708 Updated Jan 11, 2025

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 5,333 450 Updated Jan 11, 2025

Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.

Go 108,336 8,684 Updated Jan 18, 2025

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,972 5,236 Updated Jun 27, 2024

Architected for speed. Automated for easy. Monitoring and troubleshooting, transformed!

C 72,962 5,995 Updated Jan 19, 2025

EPPlus-Excel spreadsheets for .NET

C# 1,857 284 Updated Jan 17, 2025

Read and extract text and other content from PDFs in C# (port of PDFBox)

C# 1,830 250 Updated Jan 19, 2025

File upload vulnerability scanner and exploitation tool.

Python 3,158 510 Updated Apr 16, 2023

the cross platform webshell tool in .NET

C# 537 224 Updated May 19, 2016

This is a webshell open source project

PHP 10,218 5,580 Updated Dec 24, 2024

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 22,713 3,421 Updated Jan 18, 2025

国密算法js版

JavaScript 986 262 Updated Aug 14, 2024

A webrtc interface wrapped in dart language.

Dart 31 37 Updated Dec 16, 2024
Next