Prague Dependency Treebank Annotation Errors: A Preliminary Analysis


This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on



Year of publication 2009
Type Article in Proceedings
Conference RASLAN 2009 : Recent Advances in Slavonic Natural Language Processing
MU Faculty or unit

Faculty of Informatics

Field Informatics
Keywords error in text; annotation; Prague Dependency Treebank; PDT
Description This paper presents a basic analysis of syntactic annotation errors and inconsistencies in the Prague Dependency Treebank, the biggest corpus of Czech with manual syntactic annotation. The corpus is used for developing and testing of many syntactic analysers of Czech and the problems in the annotation have an essential impact on the evaluation of the quality of these parsers and the results of precision measurements. We identify some of the basic annotation problems and in some cases, we outline possible solutions.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.