GitHub - francesliang/speech-audio-annotation

Speech Audio Annotation System [WIP]

This tool uses Mozilla DeepSpeech as an example to demonstrate the some generic functionalities / components of a speech-audio annotation tool, including annotation and model inference - training loop, to semi-automate the annotation process.

This repo contains the back-end component of the annotation system, the front-end component speech-audio-annotation-ui is also required for the usage of the tool.

Setup Back-end Component

Clone the project git repo
Download and unzip the pre-trained DeepSpeech model to <project_root_dir>/outputs:

cd <project_root_dir>/outputs
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.6.1/deepspeech-0.6.1-models.tar.gz
tar xvf deepspeech-0.6.1-models.tar.gz

Clone the DeepSpeech repo to <project_root_dir>/models/deepspeech (only required if model-training is needed):

git clone https://github.com/mozilla/DeepSpeech.git <project_root_dir>/models/deepspeech`

Build and run docker-compose in <project_root_dir> to bring up the containers:

docker-compose up --build

Setup Front-end Component

Clone the UI project repo outside the back-end project repo:

git clone https://github.com/francesliang/speech-audio-annotation-ui

Build and run docker-compose in <project_root_dir> to bring up the containers:

docker-compose up --build

The URL of the UI should be localhost:3000

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
annotations		annotations
app		app
audio		audio
models		models
nginx		nginx
outputs		outputs
speech_recognition		speech_recognition
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
config.py		config.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
run.py		run.py
setup.sh		setup.sh
uwsgi.ini		uwsgi.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Audio Annotation System [WIP]

Setup Back-end Component

Setup Front-end Component

How it works

About

Releases

Packages

Languages

francesliang/speech-audio-annotation

Folders and files

Latest commit

History

Repository files navigation

Speech Audio Annotation System [WIP]

Setup Back-end Component

Setup Front-end Component

How it works

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages