Skip to content
View wan-h's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 成都

Block or report wan-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI部署

8 repositories

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,236 1,583 Updated May 22, 2025

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Python 825 126 Updated Mar 12, 2025

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 712 94 Updated Jan 26, 2023

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,514 4,253 Updated May 22, 2025

Caffe: a fast open framework for deep learning.

C++ 4,787 1,669 Updated Apr 21, 2023

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end…

C++ 3,188 477 Updated Feb 24, 2025

LLM inference in C/C++

C++ 80,718 11,872 Updated May 23, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,509 3,494 Updated May 20, 2025