Low-cost ontology development

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.

Authors

GRÁC Marek RAMBOUSEK Adam

Type Article in Proceedings
Conference 6th International Global Wordnet Conference Proceedings
MU Faculty or unit

Faculty of Informatics

Citation
Field Informatics
Keywords ontology; WordNet; annotation; VerbaLex
Attached files
Description In this paper, we present the project building new lexical resource -- shallow ontology derived from the corpora. The ontology should be used primarily for machine translation, syntactic parsing and word sense disambiguation. Currently, the ontology for Czech language is developed, but the methodology and tools are suitable for other languages with similar structure. Ontology is based on BushBank corpus, which improves handling of ambiguity in natural language. BushBank data and tools are application-driven, thus reducing the time and costs needed to annotate the corpora and develop new lexical resources.
Related projects: