A multilingual evaluation dataset for monolingual word sense alignment
Ahmadi, Sina ; McCrae, John P. ; Nimb, Sanni ; Khan, Fahad ; Monachini, Monica ; Pedersen, Bolette S. ; Declerck, Thierry ; Wissik, Tanja ; Bellandi, Andrea ; Pisani, Irene ... show 10 more
Ahmadi, Sina
McCrae, John P.
Nimb, Sanni
Khan, Fahad
Monachini, Monica
Pedersen, Bolette S.
Declerck, Thierry
Wissik, Tanja
Bellandi, Andrea
Pisani, Irene
Loading...
Identifiers
Publication Date
2020-05-16
Type
Conference Paper
Downloads
Citation
Ahmadi, Sina, McCrae, John P. et al. (2020). A multilingual evaluation dataset for monolingual word sense alignment, Paper presented at the 12th International Conference on Language Resources and Evaluation (LREC), Marseille, France (11-16 May).
Abstract
Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data will pave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriously requiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA.
Publisher
National University of Ireland Galway
Publisher DOI
Rights
Attribution-NonCommercial-NoDerivs 3.0 Ireland