When Word Pairs Matter - Analysis of the English-Slovak Evaluation Dataset

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

DENISOVÁ Michaela RYCHLÝ Pavel

Year of publication 2021
Type Article in Proceedings
Conference Proceedings of the Fifteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2021
MU Faculty or unit

Faculty of Informatics

Citation
Web
Keywords Cross-lingual word embeddings; Ground truth dictionary; Evaluation; English; Slovak
Description Cross-lingual word embeddings facilitate the transfer of lexical knowledge across languages, and they are mainly used for finding transla- tion equivalents. Translation equivalents obtained in this way are usually evaluated with the help of ground truth dictionaries. However, the evalu- ation process, including the ground truth dictionaries, differs from model to model, impeding the correct interpretation of the results. Therefore, in this paper, we provide a thorough analysis of the English-Slovak ground truth dictionary and employ our analysis in evaluating two cross-lingual word embedding models. We show that word pairs choice is an important factor when accurately reflecting the model’s performance.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.