Infos

Still in early development, use with caution

Infos

Wit.ai is free, but only allows only files with max-length 10s. This script will iteratively try to cut audio of unlimited size into chunks of appropriate size, while (hopefully) respecting word boundaries. Then the chunks are sent to wit and a textfile is exported.

Install:

You need Python 3
download this repo
cd into it
pip3 install -r requirements.txt

Usage:

Change WIT_KEY to your wit.ai key (free)
Put files of most audio formats into "input" folder
run python transcribe.py
output file should be in "output" folder

For most Youtube Videos Youtube already created captions. You can download those with yt_captions.py.

Create a file input.txt with one Youtube Vid per Line
Run python yt_captions.py
SRT Files are being create

TODO:

~~Empty Cache after run~~
~~Different File Types~~
~~Performance Improvements on first split~~
Reuse Info from first split?
Files via Command Line Arguments
maximize use of rate limit of wit.ai

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
cache		cache
input		input
output		output
.gitignore		.gitignore
__init__.py		__init__.py
readme.md		readme.md
requirements.txt		requirements.txt
transcribe.py		transcribe.py
yt_captions.py		yt_captions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Infos

Install:

Usage:

TODO:

About

Releases

Packages

Languages

pkreissel/SpeechToText

Folders and files

Latest commit

History

Repository files navigation

Infos

Install:

Usage:

TODO:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages