Home Assistant Add-on: Whisper

Installation

Follow these steps to get the add-on installed on your system:

Navigate in your Home Assistant frontend to Settings -> Add-ons -> Add-on store.
Find the "Whisper" add-on and click it.
Click on the "INSTALL" button.

How to use

After this add-on is installed and running, it will be automatically discovered by the Wyoming integration in Home Assistant. To finish the setup, click the following my button:

Alternatively, you can install the Wyoming integration manually, see the Wyoming integration documentation for more information.

Configuration

Option: `language`

Default language for the add-on. In Home Assist 2023.8+, multiple languages can be used simultaneously by different Assist pipelines.

If you select "auto", the model will run much slower but will auto-detect the spoken language.

Performance of supported languages

List of two-letter language codes

Option: `model`

Whisper model that will be used for transcription. Choose custom to use the model name in custom_model, which may be a HuggingFace model ID like "Systran/faster-distil-whisper-small.en".

The default model is tiny-int8, a compressed version of the smallest Whisper model which is able to run on a Raspberry Pi 4. Compressed models (int8) are slightly less accurate than their counterparts, but smaller and faster. Distilled models are not compressed, but are faster and smaller than their non-distilled counterparts.

Available models:

tiny-int8 (compressed)
tiny
tiny.en (English only)
base-int8 (compressed)
base
base.en (English only)
small-int8 (compressed)
distil-small.en (distilled, English only)
small
small.en (English only)
medium-int8 (compressed)
distil-medium.en (distilled, English only)
medium
medium.en (English only)
large
large-v1
distil-large-v2 (distilled, English only)
large-v2
distil-large-v3 (distilled, English only)
large-v3

Option: `custom_model`

Path to a converted model directory, or a CTranslate2-converted Whisper model ID from the HuggingFace Hub like "Systran/faster-distil-whisper-small.en".

Option: `beam_size`

Number of candidates to consider simultaneously during transcription (see beam search).

Increasing the beam size will increase accuracy at the cost of performance.

Option: `initial_prompt`

Description of audio that can help Whisper transcribe unusual words better. See this discussion for an example.

Backups

Whisper model files can be quite large, so they are automatically excluded from backups. The models will be re-downloaded when the backup is restored.

Support

Got questions?

You have several options to get them answered:

The Home Assistant Discord Chat Server.
The Home Assistant Community Forum.
Join the Reddit subreddit in /r/homeassistant

In case you've found an bug, please open an issue on our GitHub.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

DOCS.md

DOCS.md

Home Assistant Add-on: Whisper

Installation

How to use

Configuration

Option: `language`

Option: `model`

Option: `custom_model`

Option: `beam_size`

Option: `initial_prompt`

Backups

Support

Files

DOCS.md

Latest commit

History

DOCS.md

File metadata and controls

Home Assistant Add-on: Whisper

Installation

How to use

Configuration

Option: language

Option: model

Option: custom_model

Option: beam_size

Option: initial_prompt

Backups

Support

Option: `language`

Option: `model`

Option: `custom_model`

Option: `beam_size`

Option: `initial_prompt`