Skip to content
View HUSTHY's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report HUSTHY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,100 455 Updated Jan 6, 2025

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,376 344 Updated Nov 3, 2024

很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 7,666 991 Updated Jan 6, 2025

Fast-Chat-X-Client-Python

Vue 11 2 Updated Aug 27, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,075 248 Updated Dec 4, 2024

Python bindings for llama.cpp

Python 8,381 1,001 Updated Dec 30, 2024

📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 1,883 198 Updated Jan 6, 2025

learning how CUDA works

Cuda 184 24 Updated Aug 16, 2024

LLM training in simple, raw C/CUDA

Cuda 24,953 2,839 Updated Oct 2, 2024

通过浏览器渲染生成表格图像

Python 210 42 Updated Apr 10, 2024

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

511 38 Updated Apr 22, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 54,867 6,766 Updated Jan 6, 2025

Summarize existing representative LLMs text datasets.

1,109 115 Updated Dec 23, 2024

Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such as complexity and accessibility.

Python 34 Updated Aug 28, 2024

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

JavaScript 48,502 9,744 Updated Aug 10, 2024

This repository contains datasets and baselines for benchmarking Chinese text recognition.

Python 445 52 Updated Dec 2, 2022
Jupyter Notebook 99 6 Updated Dec 16, 2024

Generate text images for training deep learning ocr model

Python 1,409 386 Updated Jan 17, 2022

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,274 403 Updated Aug 7, 2024

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 396 21 Updated Dec 9, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,377 4,607 Updated Jan 4, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,865 228 Updated Dec 30, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,080 1,045 Updated Jan 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,239 5,055 Updated Jan 6, 2025

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Python 901 93 Updated Jan 2, 2025

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,171 70 Updated Oct 14, 2024

Fast and low-memory attention layer written in CUDA

Cuda 15 4 Updated Jul 14, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,590 4,519 Updated Aug 16, 2024

Fast and memory-efficient exact attention

Python 14,941 1,410 Updated Jan 6, 2025

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,606 321 Updated May 21, 2024
Next