Follow these steps to get the add-on installed on your system:
- Navigate in your Home Assistant frontend to Settings -> Add-ons -> Add-on store.
- Find the "Whisper" add-on and click it.
- Click on the "INSTALL" button.
After this add-on is installed and running, it will be automatically discovered by the Wyoming integration in Home Assistant. To finish the setup, click the following my button:
Alternatively, you can install the Wyoming integration manually, see the Wyoming integration documentation for more information.
Default language for the add-on. In Home Assist 2023.8+, multiple languages can be used simultaneously by different Assist pipelines.
If you select "auto", the model will run much slower but will auto-detect the spoken language.
Performance of supported languages
List of two-letter language codes
Whisper model that will be used for transcription. Choose custom
to use the model name in custom_model
, which may be a HuggingFace model ID like "Systran/faster-distil-whisper-small.en".
The default model is tiny-int8
, a compressed version of the smallest Whisper model which is able to run on a Raspberry Pi 4.
Compressed models (int8
) are slightly less accurate than their counterparts, but smaller and faster. Distilled models are not compressed, but are faster and smaller than their non-distilled counterparts.
Available models:
tiny-int8
(compressed)tiny
tiny.en
(English only)base-int8
(compressed)base
base.en
(English only)small-int8
(compressed)distil-small.en
(distilled, English only)small
small.en
(English only)medium-int8
(compressed)distil-medium.en
(distilled, English only)medium
medium.en
(English only)large
large-v1
distil-large-v2
(distilled, English only)large-v2
distil-large-v3
(distilled, English only)large-v3
Path to a converted model directory, or a CTranslate2-converted Whisper model ID from the HuggingFace Hub like "Systran/faster-distil-whisper-small.en".
Number of candidates to consider simultaneously during transcription (see beam search).
Increasing the beam size will increase accuracy at the cost of performance.
Description of audio that can help Whisper transcribe unusual words better. See this discussion for an example.
Whisper model files can be quite large, so they are automatically excluded from backups. The models will be re-downloaded when the backup is restored.
Got questions?
You have several options to get them answered:
- The Home Assistant Discord Chat Server.
- The Home Assistant Community Forum.
- Join the Reddit subreddit in /r/homeassistant
In case you've found an bug, please open an issue on our GitHub.