Software and Data for Corpus Pattern Analysis

Authors

BAISA Vít EL MAAROUF Ismail RYCHLÝ Pavel RAMBOUSEK Adam

Year of publication 2015
Type Article in Proceedings
Conference Ninth Workshop on Recent Advances in Slavonic Natural Language Processing
MU Faculty or unit

Faculty of Arts

Citation
Field Linguistics
Keywords Corpus Pattern Analysis; Pattern Dictionary of English Verbs; Sketch Engine; linked open data; ontology; LEMON
Description This report describes the tools and resources developed to support Corpus Pattern Analysis (CPA)—a corpus-based method for building patterns dictionaries. The tools are an annotation of concordance in Sketch Engine, a special CPA editor for editing Pattern Dictionary of English Verbs (PDEV), dedicated servlets based on the Dictionary Editing and Browsing platform and a public interface for browsing the PDEV. The resources are SemEval 2015 Task 15 dataset and LEMON API.
Related projects: