Sustainable long-term WordNet development and maintenance: Case study of the Czech WordNet

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.

Authors

RAMBOUSEK Adam PALA Karel HORÁK Aleš

Year of publication 2018
Type Article in Periodical
Magazine / Source Cognitive Studies | Études cognitives
MU Faculty or unit

Faculty of Informatics

Citation
Web Open Access Journal
Doi http://dx.doi.org/10.11649/cs.1715
Keywords EuroWordNet; BalkaNet; wordnet; Czech WordNet; DEBVisDic
Description Czech WordNet represents one of the first national wordnets created during the EuroWordNet and BalkaNet projects. However, the data contains various issues that affect the use of Czech WordNet in NLP applications. Since the publication of the first CzWN version, the semantic network was augmented in several phases, however, complex final editing and publishing process has not been finished. In 2017, we have started a project to evaluate and update the Czech WordNet, followed by a connection to the Collaborative Interlingual Index. In this paper, we provide an overview of Czech WordNet data updates and extensions, and present the roadmap to publish a revised version of the Czech WordNet under open license. Moreover, we introduce a developed concept for long-term updates and maintenance of the data based on crowdsourcing activities.
Related projects: