Skip to content

vadno/emmorph2ud

Repository files navigation

emmorph2ud

The script converts the output tag of emMorph morphological analyzer to the corresponding output tag of magyarlanc 3.0.

What's in this repo?

  • the main script of the converter: __main__.py (run it with python3 -m emmorph2ud)
  • auxiliary files in folder converterdata
  • license
  • this readme

The tagsets 🇭🇺

A detailed description of the tagsets is available here.

emMorph

emMorph is the current morphological analyzer for Hungarian and it is integrated into the e-magyar language processing toolchain. The list of emMorph tags is from here.

UD

UD follows Universal Dependencies, a framework for cross-linguistically consistent grammatical annotation. UD is the output of the third version of magyarlanc, a toolkit for linguistic processing of Hungarian texts and it is also an annotation scheme for a version of the largest fully manually annotated corpus of Hungarian, UD Hungarian Szeged.

How to use the converter?

  • standard input: token, lemma, emmorph tag separated by tab
  • standard output: ud tag

Dependencies

Python3

License

GNU General Public License v3.0

Our converters

About

morphology converter from emmorph to ud

Resources

License

Stars

Watchers

Forks

Packages

No packages published