HUSTHY

Follow

💭

I may be slow to respond.

HUSTHY HUSTHY

💭

I may be slow to respond.

Follow

10 followers · 3 following

China

Achievements

Achievements

Stars

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,123 997 Updated May 23, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,790 660 Updated May 23, 2025

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,496 1,090 Updated May 15, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,391 548 Updated May 21, 2025

NExT-GPT / NExT-GPT

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,499 352 Updated May 13, 2025

DaoCloud / public-image-mirror

很多镜像都在国外。比如 gcr 。国内下载很慢，需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 10,102 1,194 Updated May 22, 2025

baobaoJK / Fast-Chat-X-Client-Python

Fast-Chat-X-Client-Python

Vue 12 2 Updated Aug 27, 2024

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,577 758 Updated May 15, 2025

abetlen / llama-cpp-python

Python bindings for llama.cpp

Python 9,132 1,139 Updated May 8, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,428 467 Updated May 17, 2025

ifromeast / cuda_learning

learning how CUDA works

Cuda 262 35 Updated Mar 3, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 26,651 3,063 Updated May 10, 2025

WenmuZhou / TableGeneration

通过浏览器渲染生成表格图像

Python 217 42 Updated Apr 10, 2024

SpursGoZmy / Tabular-LLM

本项目旨在收集开源的表格智能任务数据集（比如表格问答、表格-文本生成等），将原始数据整理为指令微调格式的数据并微调LLM，进而增强LLM对于表格数据的理解，最终构建出专门面向表格智能任务的大型语言模型。

584 43 Updated Apr 22, 2024

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,771 12,364 Updated May 22, 2025

lmmlzn / Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

1,273 129 Updated Mar 25, 2025

tencent-ailab / marl-mini

Python 41 1 Updated May 7, 2025

chinese-poetry / chinese-poetry

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

JavaScript 49,430 9,969 Updated Apr 2, 2025

FudanVI / benchmarking-chinese-text-recognition

This repository contains datasets and baselines for benchmarking Chinese text recognition.

Python 475 52 Updated Dec 2, 2022

open-compass / GAOKAO-Eval

Jupyter Notebook 107 5 Updated Dec 16, 2024

Sanster / text_renderer

Generate text images for training deep learning ocr model

Python 1,436 387 Updated Jan 17, 2022

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,916 443 Updated Aug 7, 2024

Coobiw / MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 448 23 Updated Mar 10, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,448 6,021 Updated May 21, 2025

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,166 270 Updated May 11, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,537 1,444 Updated May 23, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,886 7,558 Updated May 23, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.

Python 1,257 145 Updated May 18, 2025

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,248 75 Updated Mar 6, 2025

kst179 / fused-attention

Fast and low-memory attention layer written in CUDA

Cuda 17 4 Updated Jul 14, 2023