Skip to content
View 1050705324's full-sized avatar

Block or report 1050705324

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TinyChatEngine: On-Device LLM Inference Library

C++ 797 79 Updated Jul 4, 2024

高性能并行编程与优化 - 课件

C++ 3,863 549 Updated Oct 18, 2024

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

61,444 7,884 Updated Jan 19, 2025

【A common used C++ DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流

C++ 1,871 338 Updated Jan 27, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 6,817 1,909 Updated Jul 26, 2024

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Python 268 5 Updated Oct 2, 2024

VisionLLM Series

Python 984 35 Updated Jan 26, 2025

LLM training in simple, raw C/CUDA

Cuda 25,170 2,887 Updated Oct 2, 2024

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,158 229 Updated Jan 27, 2025

Universal LLM Deployment Engine with ML Compilation

Python 19,774 1,636 Updated Jan 24, 2025

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 2,679 303 Updated Oct 26, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,415 1,511 Updated Jan 15, 2025

LLM Tokenizer with BPE algorithm

Python 27 8 Updated May 7, 2024