asr_bleu_rm_silence

zhangshaolei

Jun 4, 2024

fca58d8 · Jun 4, 2024

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md	first commit	Jun 4, 2024
__init__.py	__init__.py	first commit	Jun 4, 2024
asr_model_cfgs.json	asr_model_cfgs.json	first commit	Jun 4, 2024
compute_asr_bleu.py	compute_asr_bleu.py	first commit	Jun 4, 2024
requirements.txt	requirements.txt	first commit	Jun 4, 2024
test.txt	test.txt	first commit	Jun 4, 2024
text.txt	text.txt	first commit	Jun 4, 2024
utils.py	utils.py	first commit	Jun 4, 2024

README.md

ASR-BLEU evaluation toolkit

This toolkit provides a set of public ASR models used for evaluation of different speech-to-speech translation systems at Meta AI. It enables easier score comparisons between different system's outputs.

The ASRGenerator wraps different CTC-based ASR models from HuggingFace and fairseq code bases. Torchaudio CTC decoder is built on top of it to decode given audio files.

Please see asr_model_cfgs.json for a list of languages covered currently.

The high-level pipeline is simple by design: given a lang tag, script loads the ASR model, transcribes model's predicted audio, and computes the BLEU score against provided reference translations using sacrebleu.

Dependencies

Please see requirements.txt.

Usage examples

This toolkit have been used with:

Speechmatrix project: https://github.com/facebookresearch/fairseq/tree/ust/examples/speech_matrix.
Hokkien speech-to-speech translation project: https://github.com/facebookresearch/fairseq/tree/ust/examples/hokkien.

Standalone run example

High-level example, please substitute arguments per your case:

python compute_asr_bleu.py --lang <LANG> \
--audio_dirpath <PATH_TO_AUDIO_DIR> \
--reference_path <PATH_TO_REFERENCES_FILE> \
--reference_format txt

For more details about arguments please see the script argparser help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

asr_bleu_rm_silence

asr_bleu_rm_silence

README.md

ASR-BLEU evaluation toolkit

Dependencies

Usage examples

Standalone run example

Files

asr_bleu_rm_silence

Directory actions

More options

Directory actions

More options

Latest commit

History

asr_bleu_rm_silence

Folders and files

parent directory

README.md

ASR-BLEU evaluation toolkit

Dependencies

Usage examples

Standalone run example