Auto Glossing Stem Translation

Overview

Welcome to the Auto Glossing Stem Translation repository! This repository contains the code accompanying our system description paper. Our system is based on the framework presented in Tü-CL at SIGMORPHON 2023: Straight-Through Gradient Estimation for Hard Attention.

Setup

To get started, follow these steps:

Create a virtual environment: We recommend using Anaconda for managing environments. You can create a new environment named glossing with Python 3.9 and pip by running:
```
conda create -n glossing python=3.9 pip
```
Activate the environment: Activate the newly created environment with:
```
conda activate glossing
```
Install dependencies: Install the required dependencies listed in requirements.txt using pip:
```
pip install -r requirements.txt
```

Ensure that you have a folder named data in the repository. You can place your own data into this folder. The data format should align with the format obtained from the shared task's main repository.

Training a Model

To train a single model and obtain predictions for the corresponding test set, execute the following command: python main.py --language LANGUAGE --model MODEL --track TRACK

Replace LANGUAGE with the desired language from the shared task dataset. The MODEL parameter should be set to morph for the joint segmentation and glossing model. The TRACK parameter can be either 1 for the closed track or 2 for the open track. For additional hyperparameters, refer to the help section:

Hyperparameter Tuning

To find the best hyperparameters, you can utilize the hyperparameter_tuning.py script:

python hyperparameter_tuning.py
--language LANGUAGE
--model MODEL
--track TRACK
--trials TRIALS

The TRIALS parameter specifies the number of evaluated hyperparameter combinations. We used 50 trials to obtain the hyperparameters provided in best_hyperparameters.json, which is included in this repository.

To retrain all models with the best hyperparameters, run:

To obtain predictions for the test data from the trained models, use: python predict_from_model.py

Citation

If you find this code useful, please consider citing our paper. The citation will be updated soon.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
bert_classifier		bert_classifier
bert_decoder_character		bert_decoder_character
bert_decoder_chunk		bert_decoder_chunk
data_low		data_low
hidden		hidden
lstm+attn_visualization		lstm+attn_visualization
lstm_softattention		lstm_softattention
prompt		prompt
show_morpheme_seg		show_morpheme_seg
visualization		visualization
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Auto Glossing Stem Translation

Overview

Setup

Training a Model

Hyperparameter Tuning

Citation

About

Releases

Packages

Languages

License

changbingY/Auto_glossing_stem_translation

Folders and files

Latest commit

History

Repository files navigation

Auto Glossing Stem Translation

Overview

Setup

Training a Model

Hyperparameter Tuning

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages