GitHub - vllm-project/vllm-gaudi: Community maintained hardware plugin for vLLM on Intel Gaudi

x

Welcome to vLLM x Intel Gaudi

| Documentation | Intel® Gaudi® Documentation | Optimizing Training Platform Guide |

Latest News 🔥

[2025/06] We are introduced an early developer preview of the vLLM Gaudi Plugin and is not yet intended for general use. For a more stable experience, consider using the HabanaAI/vllm-fork or the in-tree Gaudi implementation available in vllm-project/vllm.

About

vLLM Gaudi plugin (vllm-gaudi) integrates Intel Gaudi accelerators with vLLM to optimize large language model inference.

This plugin follows the [RFC]: Hardware pluggable and [RFC]: Enhancing vLLM Plugin Architecture principles, providing a modular interface for Intel Gaudi hardware.

Learn more: 🚀 vLLM Plugin System Overview

Getting Started

Preparation of the Setup

To set up the execution environment, please follow the instructions in the Gaudi Installation Guide. To achieve the best performance on HPU, please follow the methods outlined in the Optimizing Training Platform Guide.
Get Last good commit on vllm NOTE: vllm-gaudi is always follow latest vllm commit, however, vllm upstream API update may crash vllm-gaudi, this commit saved is verified with vllm-gaudi in a hourly basis
```
git clone https://github.com/vllm-project/vllm-gaudi
cd vllm-gaudi
export VLLM_COMMIT_HASH=$(git show "origin/vllm/last-good-commit-for-vllm-gaudi:VLLM_STABLE_COMMIT" 2>/dev/null)
```

Install vLLM with pip or from source:

# Build vLLM from source for empty platform, reusing existing torch installation
git clone https://github.com/vllm-project/vllm
cd vllm
git checkout $VLLM_COMMIT_HASH
pip install -r <(sed '/^[torch]/d' requirements/build.txt)
VLLM_TARGET_DEVICE=empty pip install --no-build-isolation -e .
cd ..

Install vLLM-Gaudi from source:
```
cd vllm-gaudi
pip install -e .
cd ..
```
To uncover all installation methods, sucha as NixL, follow the link

Contributing

We welcome and value any contributions and collaborations.

Contact Us

For technical questions and feature requests, please use GitHub Issues
For discussing with fellow users, please use the vLLM Forum
For coordinating contributions and development, please use Slack
For security disclosures, please use GitHub's Security Advisories feature

Name		Name	Last commit message	Last commit date
Latest commit History 265 Commits
.cd		.cd
.github		.github
.jenkins		.jenkins
calibration		calibration
docs		docs
examples		examples
tests		tests
tools		tools
vllm_gaudi		vllm_gaudi
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
format.sh		format.sh
install_nixl.py		install_nixl.py
mkdocs.yaml		mkdocs.yaml
pyproject.toml		pyproject.toml
requirements-docs.txt		requirements-docs.txt
requirements-lint.txt		requirements-lint.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Welcome to vLLM x Intel Gaudi

About

Getting Started

Contributing

Contact Us

About

Uh oh!

Releases 2

Packages

Contributors 38

Uh oh!

Languages

License

vllm-project/vllm-gaudi

Folders and files

Latest commit

History

Repository files navigation

Welcome to vLLM x Intel Gaudi

About

Getting Started

Contributing

Contact Us

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 38

Uh oh!

Languages

Packages