A Case Study in Word Sketches - Czech Verb vidět 'see'

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.

Authors

PALA Karel RYCHLÝ Pavel

Year of publication 2010
Type Chapter of a book
MU Faculty or unit

Faculty of Informatics

Citation
Description In this paper we discuss errors that can be found in word sketches. We have conceived it as a case study in which we describe collocational behaviour ob the czech verb vidět 'see' in new large Czech corpus Czes-Eso (84,602,174 tokens). The frequency of vidět in the Czes-Eso corpus is 33.275 tikens, thus some of our observations can be considered general enough. We also deal with errors found in word sketch table of the verb vidět, clasify them, and offer some solutions for correction, which should improve the quality of the data produced by the Sketch Engine.
Related projects: