Skip to content

woheller69/whisperIME

Repository files navigation

Donate

Send a coffee to 
[email protected] 


  
Or via this link (with fees)

Voice recognition based on Whisper

Whisper is an input method editor (IME) that leverages voice recognition technology based on the Whisper engine. It offers a seamless user experience, functioning both as a standalone application and an integrated IME that can be activated, e.g. via the microphone button in HeliBoard. As a standalone app Whisper can also translate any supported language to English.

Besides providing an IME, whisper can also be selected as system-wide voice input (RecognitionService) and it supports calls via intent (RecognizerIntent.ACTION_RECOGNIZE_SPEECH).

Initial Setup

Upon launching Whisper for the first time, the app will download the necessary Whisper models (~370 MB) from Hugging Face. Please note that this is the only instance where internet permission is required. Once the models are downloaded, voice recognition works entirely offline, ensuring your privacy and convenience.

Please note that for use as voice input (not as IME) there is a separate settings activity which can be accessed from Android settings (System > Languages > Speech > Voice Input). There you can activate the app as voice input and then click the settings button. In settings you can then select the model for voice input and the maximum recording time (default: 30s).

Model Selection

Whisper offers two models to choose from: a compact English-only model that prioritizes speed and a more comprehensive multi-lingual model that, while much slower, supports a broader range of languages. Select your preferred model within the app, and it will be applied consistently across all uses, including when used as an IME.

Using Whisper

To get the most out of Whisper, follow these simple tips:

  • Press and hold the button while speaking
  • Pause briefly before starting to speak
  • Speak clearly, loudly, and at a moderate pace
  • Please note that there is a limit of 30s for each recording

By following these guidelines, you'll be able to enjoy accurate and efficient voice recognition with Whisper.

Contribute

For translations use https://toolate.othing.xyz/projects/whisperime/

License

This work is licensed under MIT license, © woheller69

  • This app is built on the Whisper-Android project, and OpenAI Whisper published under MIT license
  • It uses Android VAD, which is published under MIT license
  • It uses Opencc4j, for Chinese conversions, published under Apache-2.0 license
  • At first start it downloads the Whisper TFLite models from Hugging Face, which is published under MIT license

OTHER APPS

RadarWeather Gas Prices Smart Eggtimer
Bubble hEARtest GPS Cockpit
Audio Analyzer LavSeeker TimeLapseCam
Arity Cirrus solXpect
gptAssist dumpSeeker huggingAssist
FREE Browser whoBIRD PeakOrama
Whisper Seamless SherpaTTS