Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
experiments		experiments
mbr		mbr
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Repository files navigation

Diverse Minimum Bayes Risk Decoding

This repository contains the code for the experiments in Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding.

The code is provided mostly as is with little effort on refactoring.

Installation

git clone [email protected]:CyberAgentAILab/diverse-mbr
cd diverse-mbr
pip install -r requirements.txt

Usage

The code runs in two steps.

sample.sh samples candidates.
run_mbr.sh computes the MBR candidate from the candidates sampled.

Sampling candidates

./experiments/sample.sh -d [DATASET] -s [NUMBER OF SAMPLES]

Computing Diverse MBR and KMBR

./experiments/run_mbr.sh -d [DATASET] -s [NUMBER OF SAMPLES] -a [ALGORITHM]

Example on WMT'19 En-De

Use sacrebleu to prepare the benchmark dataset.

sacrebleu -t wmt19 -l en-de --echo src > ./dataset/wmt19-text/wmt19.en-de.en
sacrebleu -t wmt19 -l en-de --echo ref > ./dataset/wmt19-text/wmt19.en-de.de

Sample candidates on WMT'19 En-De

./experiments/sample.sh -d wmt19.en-de

Computing the Diverse MBR output on WMT'19 En-De

./experiments/run_mbr.sh -d wmt19.en-de -a diverse

Computing the k-Medoid MBR output on WMT'19 En-De

./experiments/run_mbr.sh -d wmt19.en-de -a kmmbr

Reference

Jinnai, Y., Honda, U., Morimura, T., & Zhang, P. (2024). Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding. arXiv preprint arXiv:2401.05054.

Bibtex:

@article{jinnai2024generating,
      title={Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding}, 
      author={Yuu Jinnai and Ukyo Honda and Tetsuro Morimura and Peinan Zhang},
      year={2024},
      journal={arXiv preprint arXiv:2401.05054}
}

Contact

For any questions, feel free to raise an issue or contact me at [email protected].

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diverse Minimum Bayes Risk Decoding

Installation

Usage

Sampling candidates

Computing Diverse MBR and KMBR

Example on WMT'19 En-De

Reference

Contact

About

Contributors 2

Languages

License

CyberAgentAILab/diverse-mbr

Folders and files

Latest commit

History

Repository files navigation

Diverse Minimum Bayes Risk Decoding

Installation

Usage

Sampling candidates

Computing Diverse MBR and KMBR

Example on WMT'19 En-De

Reference

Contact

About

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages