An Update of the Manually Annotated Amharic Corpus

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on


RYCHLÝ Pavel LEMMA Gezahegn Tsegaye

Year of publication 2018
Type Article in Proceedings
Conference Proceedings of the Twelfth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2018
MU Faculty or unit

Faculty of Informatics

Keywords text corpus; Amharic corpus; part-of-speech tagging
Description The paper describes an update of the manually annotated Amharic corpus WIC 2.0. It lists the problems of the previous version of the corpus and shows that even small changes in the corpus annotation could lead to a higher quality of trained part-of-speech taggers.
Related projects: