This directory contains self-documented cloud integrations and demos to illustrate examples of DeepSparse usage.
Open a Pull Request to contribute your own examples.
Example | Description |
---|---|
AWS Marketplace | How to launch a DeepSparse integrated instance via the AWS Marketplace. |
AWS Sagemaker Integration | How to deploy a DeepSparse inference server on SageMaker. |
AWS Serverless Integration | How to deploy a DeepSparse pipeline for batch or real-time inference on select serverless services. |
Azure VM | How to launch a DeepSparse image in an Azure virtual machine. |
DigitalOcean Marketplace | How to launch a DeepSparse integrated instance via the DigitalOcean Marketplace. |
GCP Marketplace | How to launch a DeepSparse integrated instance on the Google Cloud Marketplace. |
Google Cloud Run | How to deploy the DeepSparse Server on Cloud Run. |
Google Kubernetes Engine | How to deploy the DeepSparse Server on GKE. |
Example | Description |
---|---|
Benchmark and ONNX Model Correctness | Comparing predictions and benchmark performance between DeepSparse Engine and ONNXRuntime. |
Benchmark UI | How to deploy a gradio UI for benchmarking SparseZoo models on a DigitalOcean instance or local machine. |
ChatGPT Cheat Sheet | User guide for prompting ChatGPT segmented by use-case. |
Hugging Face Transformers | Serving, benchmarking, and running NLP models from Hugging Face. |
YOLOv3 and YOLOv5 | Serving, benchmarking, and running annotation inferences with YOLOv3 and YOLOv5 models. |
Image Classification | How to use image classification models from SparseZoo to perform inference and benchmarking with the DeepSparse Engine. |
Object Detection | How to use object detection models from SparseZoo to perform inference and benchmarking with the DeepSparse Engine. |
Instance Segmentation | How to use an optimized YOLACT model and the DeepSparse Engine to perform real-time instance segmentation. |
SparseServer.UI | A Streamlit app for deploying the DeepSparse Server to compare the latency and accuracy of sparse BERT models. |
Twitter Sentiment Analysis | Example of scraping, processing, and classifying Twitter data using the DeepSparse Engine for 10x faster performance on CPUs. |
Flask Model Server | Simple model server and client example, showing how to use the DeepSparse Engine as an inference backend for a real-time inference server. |