-
Notifications
You must be signed in to change notification settings - Fork 1.1k
/
Copy pathmain.1.asc
74 lines (54 loc) · 3.08 KB
/
main.1.asc
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
[4mWHISPERFILE[24m(1) General Commands Manual [4mWHISPERFILE[24m(1)
[1mNAME[0m
whisperfile — speech to text engine
[1mSYNOPSIS[0m
[1mwhisperfile [22m[flags...] [1m-m [4m[22mmodel.gguf[24m [1m-f [4m[22maudio.wav[0m
[1mwhisperfile [22m[flags...] [1m--server -m [4m[22mmodel.gguf[0m
[1mDESCRIPTION[0m
[1mwhisperfile [22mis a high-performance implementation of OpenAI's Whisper
model that's based off the whisper.cpp software written by Georgi
Gerganov, et al. It's able to convert speech to text. It can also
translate different languages into English while transcribing.
[1mOPTIONS[0m
The following options are available:
[1m--version[0m
Print version and exit.
[1m-h[22m, [1m--help[0m
Show help message and exit.
[1m--server[0m
Puts program in HTTP server mode.
[1m-m [4m[22mFNAME[24m, [1m--model [4m[22mFNAME[0m
Path of Whisper model weights. See https://hugging‐
face.co/ggerganov/whisper.cpp
[1m-f [4m[22mFNAME[24m, [1m--file [4m[22mFNAME[0m
Path of audio file to transcribe. The preferred audio format is
a 16khz 16-bit signed linear WAV file, which can be stereo or
mono. It's also permissible to pass an MP3, FLAC, or OGG file,
in which case it'll be converted to .wav file in your temp di‐
rectory before transcribing.
[1m-tr[22m, [1m--translate[0m
Translate audio into English text.
[1m-ot [4m[22mN[24m, [1m--offset-t [4m[22mN[0m
Time offset in milliseconds.
[1m-d [4m[22mN[24m, [1m--duration [4m[22mN[0m
Duration of audio to process in milliseconds.
[1m-np[22m, [1m--no-prints[0m
Do not print anything other than the results.
[1m-pc[22m, [1m--print-colors[0m
Enables CLI printing of ANSI color codes.
Transcribed text will appear in the terminal on a spectrum of
color ranging from green to red. Green represents confidence
whereas red represents uncertainty.
[1m-t [4m[22mN[24m, [1m--threads [4m[22mN[0m
Overrides number of threads to use.
The default is the number of cores available on the current
system. On systems with hyperthreading, that's half the number
of CPUs reported by the system. On systems that have efficiency
cores, e.g. Intel Alderlake, the default behavior is to use
only the performance cores.
[1mDOCUMENTATION[0m
Read our Markdown documentation for additional help and tutorials. See
whisper.cpp/doc/index.md in the llamafile source repository on GitHub.
[1mSEE ALSO[0m
[4mllamafile[24m(1), [4mzipalign[24m(1), [4munzip[24m(1)
Mozilla Ocho August 17, 2024 [4mWHISPERFILE[24m(1)