Czech Proofreading Rules

Authors

HLAVÁČKOVÁ Dana MACHURA Jakub ŽIŽKOVÁ Hana KOVÁŘ Vojtěch NEVĚŘILOVÁ Zuzana

Year of publication 2025
Type Prototype
MU Faculty or unit

Faculty of Arts

Citation
Description The collection describes proofreading errors in Czech covered by Opravidlo 1.0. It consists of: - the grammar rules applicable via the SET Czech syntactic parser - description of grammar rules with relation to ERRANT codes - extended ERRANT ontology, created from the original ERRANT [Bryant et al., 2017] and its Czech extension [Náplava et al., 2022] - Python script that demonstrates how to apply the SET rules to proofreading The dataset contains 6649 SET rules in main categories: agreement, capitals, commas, dependent clauses, non-grammatical structures, pronouns, spelling complex, and others. The error categories form a taxonomy with Czech and English descriptions, examples, and links to ERRANT codes, 175 classes in total.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.