Skip to content
View wan-h's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 成都

Block or report wan-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI部署

8 repositories

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,851 1,529 Updated Mar 7, 2025

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Python 788 121 Updated Mar 4, 2025

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 702 92 Updated Jan 26, 2023

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,063 4,211 Updated Mar 5, 2025

Caffe: a fast open framework for deep learning.

C++ 4,779 1,670 Updated Apr 21, 2023

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end…

C++ 3,121 475 Updated Feb 24, 2025

LLM inference in C/C++

C++ 75,994 10,990 Updated Mar 7, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,085 3,447 Updated Mar 6, 2025