Converts audio into text transcripts. Output transcripts in .vtt files. (feature: highlighting key words in transcript.) Real time transcription. Uses Whisper.