Skip to content

NbAiLab/mmlu-translate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

58 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MMLU Translation Tools

This repository provides tools for translating the Massive Multitask Language Understanding (MMLU) dataset from English to Norwegian, along with evaluation scripts.

Contents

  • Research Protocol: Details the translation process, quality scoring, and evaluation strategy.
  • Translation Quality Evaluation: Evaluation of translation quality.
  • Translation Scripts: Tools to accurately translate MMLU questions while preserving structure and meaning.
  • Evaluation Tools: Built upon lm-evaluation-harness to assess translation quality and model performance on both Norwegian and English datasets.

Overview

The MMLU dataset includes over 14,000 multiple-choice questions across 57 subjects. High-quality translations ensure that the original difficulty and context are maintained for Norwegian audiences.

License

This project is licensed under the MIT License.

About

Translating the MMLU to Norwegian

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published