Skip to content
View HUSTHY's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report HUSTHY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,123 997 Updated May 23, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,790 660 Updated May 23, 2025

Example models using DeepSpeed

Python 6,496 1,090 Updated May 15, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,391 548 Updated May 21, 2025

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,499 352 Updated May 13, 2025

很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 10,102 1,194 Updated May 22, 2025

Fast-Chat-X-Client-Python

Vue 12 2 Updated Aug 27, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,577 758 Updated May 15, 2025

Python bindings for llama.cpp

Python 9,132 1,139 Updated May 8, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,428 467 Updated May 17, 2025

learning how CUDA works

Cuda 262 35 Updated Mar 3, 2025

LLM training in simple, raw C/CUDA

Cuda 26,651 3,063 Updated May 10, 2025

通过浏览器渲染生成表格图像

Python 217 42 Updated Apr 10, 2024

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

584 43 Updated Apr 22, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,771 12,364 Updated May 22, 2025

Summarize existing representative LLMs text datasets.

1,273 129 Updated Mar 25, 2025
Python 41 1 Updated May 7, 2025

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

JavaScript 49,430 9,969 Updated Apr 2, 2025

This repository contains datasets and baselines for benchmarking Chinese text recognition.

Python 475 52 Updated Dec 2, 2022
Jupyter Notebook 107 5 Updated Dec 16, 2024

Generate text images for training deep learning ocr model

Python 1,436 387 Updated Jan 17, 2022

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,916 443 Updated Aug 7, 2024

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 448 23 Updated Mar 10, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,448 6,021 Updated May 21, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,166 270 Updated May 11, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,537 1,444 Updated May 23, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,886 7,558 Updated May 23, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.

Python 1,257 145 Updated May 18, 2025

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,248 75 Updated Mar 6, 2025

Fast and low-memory attention layer written in CUDA

Cuda 17 4 Updated Jul 14, 2023
Next