Name		Name	Last commit message	Last commit date
parent directory ..
docker		docker
examples/llama		examples/llama
README.md		README.md
destruct_env.ps1		destruct_env.ps1
setup_build_env.ps1		setup_build_env.ps1
setup_env.ps1		setup_env.ps1

README.md

TensorRT-LLM for Windows

  The Windows release of TensorRT-LLM is currently in beta. We recommend using the `rel` branch for the most stable experience.

TensorRT-LLM is supported on bare-metal Windows for single-GPU inference. The release supports GeForce 40-series GPUs.

The release wheel for Windows can be installed with pip. Alternatively, you can build TensorRT-LLM for Windows from the source. Building from the source is an advanced option and is not necessary for building or running LLM engines. It is, however, required if you plan to use the C++ runtime directly or run C++ benchmarks.

Getting Started

To get started with TensorRT-LLM on Windows, visit our documentation:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

windows

windows

README.md

TensorRT-LLM for Windows

Getting Started

Files

windows

Directory actions

More options

Directory actions

More options

Latest commit

History

windows

Folders and files

parent directory

README.md

TensorRT-LLM for Windows

Getting Started