Skip to content
View wan-h's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 成都

Block or report wan-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI部署

8 repositories

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,557 1,506 Updated Jan 8, 2025

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Python 774 118 Updated Dec 19, 2024

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 697 92 Updated Jan 26, 2023

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,771 4,192 Updated Jan 7, 2025

Caffe: a fast open framework for deep learning.

C++ 4,775 1,670 Updated Apr 21, 2023

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end…

C++ 3,054 468 Updated Jan 8, 2025

LLM inference in C/C++

C++ 70,364 10,163 Updated Jan 8, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,754 3,412 Updated Jan 8, 2025