Czech Proofreading Rules
| Authors | |
|---|---|
| Year of publication | 2025 |
| Type | Prototype |
| MU Faculty or unit | |
| Citation | |
| Description | The collection describes proofreading errors in Czech covered by Opravidlo 1.0. It consists of: - the grammar rules applicable via the SET Czech syntactic parser - description of grammar rules with relation to ERRANT codes - extended ERRANT ontology, created from the original ERRANT [Bryant et al., 2017] and its Czech extension [Náplava et al., 2022] - Python script that demonstrates how to apply the SET rules to proofreading The dataset contains 6649 SET rules in main categories: agreement, capitals, commas, dependent clauses, non-grammatical structures, pronouns, spelling complex, and others. The error categories form a taxonomy with Czech and English descriptions, examples, and links to ERRANT codes, 175 classes in total. |
| Related projects: |