whisper.cpp/main.1.asc

[4mWHISPERFILE[24m(1)              General Commands Manual             [4mWHISPERFILE[24m(1)

[1mNAME[0m
       whisperfile — speech to text engine

[1mSYNOPSIS[0m
       [1mwhisperfile [22m[flags...] [1m-m [4m[22mmodel.gguf[24m [1m-f [4m[22maudio.wav[0m
       [1mwhisperfile [22m[flags...] [1m--server -m [4m[22mmodel.gguf[0m

[1mDESCRIPTION[0m
       [1mwhisperfile  [22mis  a  high-performance implementation of OpenAI's Whisper
       model that's based off  the  whisper.cpp  software  written  by  Georgi
       Gerganov,  et  al.   It's  able  to convert speech to text. It can also
       translate different languages into English while transcribing.

[1mOPTIONS[0m
       The following options are available:

       [1m--version[0m
               Print version and exit.

       [1m-h[22m, [1m--help[0m
               Show help message and exit.

       [1m--server[0m
               Puts program in HTTP server mode.

       [1m-m [4m[22mFNAME[24m, [1m--model [4m[22mFNAME[0m
               Path   of   Whisper   model   weights.   See   https://hugging‐
               face.co/ggerganov/whisper.cpp

       [1m-f [4m[22mFNAME[24m, [1m--file [4m[22mFNAME[0m
               Path of audio file to transcribe. The preferred audio format is
               a  16khz  16-bit signed linear WAV file, which can be stereo or
               mono. It's also permissible to pass an MP3, FLAC, or OGG  file,
               in  which case it'll be converted to .wav file in your temp di‐
               rectory before transcribing.

       [1m-tr[22m, [1m--translate[0m
               Translate audio into English text.

       [1m-ot [4m[22mN[24m, [1m--offset-t [4m[22mN[0m
               Time offset in milliseconds.

       [1m-d [4m[22mN[24m, [1m--duration [4m[22mN[0m
               Duration of audio to process in milliseconds.

       [1m-np[22m, [1m--no-prints[0m
               Do not print anything other than the results.

       [1m-pc[22m, [1m--print-colors[0m
               Enables CLI printing of ANSI color codes.

               Transcribed text will appear in the terminal on a  spectrum  of
               color  ranging  from  green to red. Green represents confidence
               whereas red represents uncertainty.

       [1m-t [4m[22mN[24m, [1m--threads [4m[22mN[0m
               Overrides number of threads to use.

               The default is the number of cores  available  on  the  current
               system.  On systems with hyperthreading, that's half the number
               of CPUs reported by the system. On systems that have efficiency
               cores, e.g. Intel Alderlake, the default  behavior  is  to  use
               only the performance cores.

[1mDOCUMENTATION[0m
       Read  our Markdown documentation for additional help and tutorials. See
       whisper.cpp/doc/index.md in the llamafile source repository on GitHub.

[1mSEE ALSO[0m
       [4mllamafile[24m(1), [4mzipalign[24m(1), [4munzip[24m(1)

Mozilla Ocho                    August 17, 2024                 [4mWHISPERFILE[24m(1)