ItalianFrameNet

A collection of data I manually annotated during my Phd on ItalianFrameNet.

There are currently two datasets available:

1.) ITA-EUROPARL_gold.xml: 987 Italian sentences extracted from the English-Italian bitext of Europarl, manually annotated with frames and FEs from FrameNet 1.3. The same English sentences had been previously used to build an English-German and an English-French gold standard (available at http://www.nlpado.de/~sebastian/srl_data.html) and to evaluate some transfer experiments with these two language pairs (Pado and Lapata, 2005, Pado and Pitel, 2007).

2.) MULTIBERKELEY_gold.xml: 391 sentences from the Berkeley FrameNet corpus, manually translated and then annotated with frames and FEs from FrameNet 1.3, one frame per sentence.

For details on the two corpora, how they were created and what frames they include, check my Phd thesis:

Sara Tonelli (2010). Semi-automatic Techniques for Extending the FrameNet Lexical Database to New Languages. PhD thesis, Dept. of Language Sciences, Università Ca’ Foscari, Venezia, Italy (available at: http://dspace.unive.it/handle/10579/1025)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
ITA-EUROPARL_gold.xml		ITA-EUROPARL_gold.xml
MULTIBERKELEY_gold.xml		MULTIBERKELEY_gold.xml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ItalianFrameNet

About

Releases

Packages

dhfbk/ItalianFrameNet

Folders and files

Latest commit

History

Repository files navigation

ItalianFrameNet

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages