Skip to content

Commit

Permalink
Update NVIDIA Jetson TensorRT Benchmarks (ultralytics#16156)
Browse files Browse the repository at this point in the history
Co-authored-by: Glenn Jocher <[email protected]>
  • Loading branch information
lakshanthad and glenn-jocher authored Sep 9, 2024
1 parent be71a43 commit fbc6bae
Showing 1 changed file with 71 additions and 61 deletions.
132 changes: 71 additions & 61 deletions docs/en/guides/nvidia-jetson.md
Original file line number Diff line number Diff line change
Expand Up @@ -287,7 +287,7 @@ YOLOv8 benchmarks were run by the Ultralytics team on 10 different model formats
Even though all model exports are working with NVIDIA Jetson, we have only included **PyTorch, TorchScript, TensorRT** for the comparison chart below because, they make use of the GPU on the Jetson and are guaranteed to produce the best results. All the other exports only utilize the CPU and the performance is not as good as the above three. You can find benchmarks for all exports in the section after this chart.

<div style="text-align: center;">
<img width="800" src="https://github.com/ultralytics/docs/releases/download/0/nvidia-jetson-ecosystem-1.avif" alt="NVIDIA Jetson Ecosystem">
<img width="800" src="https://github.com/ultralytics/docs/releases/download/0/nvidia-jetson-ecosystem-2.avif" alt="NVIDIA Jetson Ecosystem">
</div>

### Detailed Comparison Table
Expand All @@ -298,78 +298,88 @@ The below table represents the benchmark results for five different models (YOLO

=== "YOLOv8n"

| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|---------------|--------|-----------|-------------|------------------------|
| PyTorch | βœ… | 6.2 | 0.6381 | 14.3 |
| TorchScript | βœ… | 12.4 | 0.6117 | 13.3 |
| ONNX | βœ… | 12.2 | 0.6092 | 70.6 |
| OpenVINO | βœ… | 12.3 | 0.6092 | 104.2 |
| TensorRT | βœ… | 13.6 | 0.6117 | 8.9 |
| TF SavedModel | βœ… | 30.6 | 0.6092 | 141.74 |
| TF GraphDef | βœ… | 12.3 | 0.6092 | 199.93 |
| TF Lite | βœ… | 12.3 | 0.6092 | 349.18 |
| PaddlePaddle | βœ… | 24.4 | 0.6030 | 555 |
| NCNN | βœ… | 12.2 | 0.6092 | 32 |
| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|-----------------|--------|-------------------|-------------|------------------------|
| PyTorch | βœ… | 6.2 | 0.6381 | 14.3 |
| TorchScript | βœ… | 12.4 | 0.6117 | 13.3 |
| ONNX | βœ… | 12.2 | 0.6092 | 70.6 |
| OpenVINO | βœ… | 12.3 | 0.6092 | 104.2 |
| TensorRT (FP32) | βœ… | 16.1 | 0.6091 | 8.01 |
| TensorRT (FP16) | βœ… | 9.2 | 0.6093 | 4.55 |
| TensorRT (INT8) | βœ… | 5.9 | 0.2759 | 4.09 |
| TF SavedModel | βœ… | 30.6 | 0.6092 | 141.74 |
| TF GraphDef | βœ… | 12.3 | 0.6092 | 199.93 |
| TF Lite | βœ… | 12.3 | 0.6092 | 349.18 |
| PaddlePaddle | βœ… | 24.4 | 0.6030 | 555 |
| NCNN | βœ… | 12.2 | 0.6092 | 32 |

=== "YOLOv8s"

| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|---------------|--------|-----------|-------------|------------------------|
| PyTorch | βœ… | 21.5 | 0.6967 | 18 |
| TorchScript | βœ… | 43.0 | 0.7136 | 23.81 |
| ONNX | βœ… | 42.8 | 0.7136 | 185.55 |
| OpenVINO | βœ… | 42.9 | 0.7136 | 243.97 |
| TensorRT | βœ… | 44.0 | 0.7136 | 14.82 |
| TF SavedModel | βœ… | 107 | 0.7136 | 260.03 |
| TF GraphDef | βœ… | 42.8 | 0.7136 | 423.4 |
| TF Lite | βœ… | 42.8 | 0.7136 | 1046.64 |
| PaddlePaddle | βœ… | 85.5 | 0.7140 | 1464 |
| NCNN | βœ… | 42.7 | 0.7200 | 63 |
| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|-----------------|--------|-------------------|-------------|------------------------|
| PyTorch | βœ… | 21.5 | 0.6967 | 18 |
| TorchScript | βœ… | 43.0 | 0.7136 | 23.81 |
| ONNX | βœ… | 42.8 | 0.7136 | 185.55 |
| OpenVINO | βœ… | 42.9 | 0.7136 | 243.97 |
| TensorRT (FP32) | βœ… | 46.4 | 0.7136 | 14.01 |
| TensorRT (FP16) | βœ… | 24.2 | 0.722 | 7.16 |
| TensorRT (INT8) | βœ… | 13.7 | 0.4233 | 5.49 |
| TF SavedModel | βœ… | 107 | 0.7136 | 260.03 |
| TF GraphDef | βœ… | 42.8 | 0.7136 | 423.4 |
| TF Lite | βœ… | 42.8 | 0.7136 | 1046.64 |
| PaddlePaddle | βœ… | 85.5 | 0.7140 | 1464 |
| NCNN | βœ… | 42.7 | 0.7200 | 63 |

=== "YOLOv8m"

| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|---------------|--------|-----------|-------------|------------------------|
| PyTorch | βœ… | 49.7 | 0.7370 | 36.4 |
| TorchScript | βœ… | 99.2 | 0.7285 | 53.58 |
| ONNX | βœ… | 99 | 0.7280 | 452.09 |
| OpenVINO | βœ… | 99.1 | 0.7280 | 544.36 |
| TensorRT | βœ… | 100.3 | 0.7285 | 33.21 |
| TF SavedModel | βœ… | 247.5 | 0.7280 | 543.65 |
| TF GraphDef | βœ… | 99 | 0.7280 | 906.63 |
| TF Lite | βœ… | 99 | 0.7280 | 2758.08 |
| PaddlePaddle | βœ… | 197.9 | 0.7280 | 3678 |
| NCNN | βœ… | 98.9 | 0.7260 | 135 |
| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|-----------------|--------|-------------------|-------------|------------------------|
| PyTorch | βœ… | 49.7 | 0.7370 | 36.4 |
| TorchScript | βœ… | 99.2 | 0.7285 | 53.58 |
| ONNX | βœ… | 99 | 0.7280 | 452.09 |
| OpenVINO | βœ… | 99.1 | 0.7280 | 544.36 |
| TensorRT (FP32) | βœ… | 102.4 | 0.7285 | 31.51 |
| TensorRT (FP16) | βœ… | 52.6 | 0.7324 | 14.88 |
| TensorRT (INT8) | βœ… | 28.6 | 0.3283 | 10.89 |
| TF SavedModel | βœ… | 247.5 | 0.7280 | 543.65 |
| TF GraphDef | βœ… | 99 | 0.7280 | 906.63 |
| TF Lite | βœ… | 99 | 0.7280 | 2758.08 |
| PaddlePaddle | βœ… | 197.9 | 0.7280 | 3678 |
| NCNN | βœ… | 98.9 | 0.7260 | 135 |

=== "YOLOv8l"

| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|---------------|--------|-----------|-------------|------------------------|
| PyTorch | βœ… | 83.7 | 0.7768 | 61.3 |
| TorchScript | βœ… | 167.2 | 0.7554 | 87.9 |
| ONNX | βœ… | 166.8 | 0.7551 | 852.29 |
| OpenVINO | βœ… | 167 | 0.7551 | 1012.6 |
| TensorRT | βœ… | 168.4 | 0.7554 | 51.23 |
| TF SavedModel | βœ… | 417.2 | 0.7551 | 990.45 |
| TF GraphDef | βœ… | 166.9 | 0.7551 | 1649.86 |
| TF Lite | βœ… | 166.9 | 0.7551 | 5652.37 |
| PaddlePaddle | βœ… | 333.6 | 0.7551 | 7114.67 |
| NCNN | βœ… | 166.8 | 0.7685 | 231.9 |
| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|-----------------|--------|-------------------|-------------|------------------------|
| PyTorch | βœ… | 83.7 | 0.7768 | 61.3 |
| TorchScript | βœ… | 167.2 | 0.7554 | 87.9 |
| ONNX | βœ… | 166.8 | 0.7551 | 852.29 |
| OpenVINO | βœ… | 167 | 0.7551 | 1012.6 |
| TensorRT (FP32) | βœ… | 170.5 | 0.7554 | 49.79 |
| TensorRT (FP16) | βœ… | 86.1 | 0.7535 | 22.89 |
| TensorRT (INT8) | βœ… | 46.4 | 0.4048 | 14.61 |
| TF SavedModel | βœ… | 417.2 | 0.7551 | 990.45 |
| TF GraphDef | βœ… | 166.9 | 0.7551 | 1649.86 |
| TF Lite | βœ… | 166.9 | 0.7551 | 5652.37 |
| PaddlePaddle | βœ… | 333.6 | 0.7551 | 7114.67 |
| NCNN | βœ… | 166.8 | 0.7685 | 231.9 |

=== "YOLOv8x"

| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|---------------|--------|-----------|-------------|------------------------|
| PyTorch | βœ… | 130.5 | 0.7759 | 93 |
| TorchScript | βœ… | 260.7 | 0.7472 | 135.1 |
| ONNX | βœ… | 260.4 | 0.7479 | 1296.13 |
| OpenVINO | βœ… | 260.6 | 0.7479 | 1502.15 |
| TensorRT | βœ… | 261.8 | 0.7469 | 84.53 |
| TF SavedModel | βœ… | 651.1 | 0.7479 | 1451.76 |
| TF GraphDef | βœ… | 260.5 | 0.7479 | 4029.36 |
| TF Lite | βœ… | 260.4 | 0.7479 | 8772.86 |
| PaddlePaddle | βœ… | 520.8 | 0.7479 | 10619.53 |
| NCNN | βœ… | 260.4 | 0.7646 | 376.38 |
| Format | Status | Size on disk (MB) | mAP50-95(B) | Inference time (ms/im) |
|-----------------|--------|-------------------|-------------|------------------------|
| PyTorch | βœ… | 130.5 | 0.7759 | 93 |
| TorchScript | βœ… | 260.7 | 0.7472 | 135.1 |
| ONNX | βœ… | 260.4 | 0.7479 | 1296.13 |
| OpenVINO | βœ… | 260.6 | 0.7479 | 1502.15 |
| TensorRT (FP32) | βœ… | 264.0 | 0.7469 | 80.01 |
| TensorRT (FP16) | βœ… | 133.3 | 0.7513 | 40.76 |
| TensorRT (INT8) | βœ… | 70.2 | 0.4277 | 22.08 |
| TF SavedModel | βœ… | 651.1 | 0.7479 | 1451.76 |
| TF GraphDef | βœ… | 260.5 | 0.7479 | 4029.36 |
| TF Lite | βœ… | 260.4 | 0.7479 | 8772.86 |
| PaddlePaddle | βœ… | 520.8 | 0.7479 | 10619.53 |
| NCNN | βœ… | 260.4 | 0.7646 | 376.38 |

[Explore more benchmarking efforts by Seeed Studio](https://www.seeedstudio.com/blog/2023/03/30/yolov8-performance-benchmarks-on-nvidia-jetson-devices) running on different versions of NVIDIA Jetson hardware.

Expand Down

0 comments on commit fbc6bae

Please sign in to comment.