Corpus Annotation Tool

Investor logo


This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on



Year of publication 2017
MU Faculty or unit

Faculty of Informatics

Description The main goal of this work is an annotation tool for easy and fast production of small annotated corpora for languages without linguistics resources. The tool is optimised for the following priorities: Simple tool for instant usage: The client part of the tool is a web application which works in any browser, it requires small amount of all resources (memory, internet connection, screen size), it could be used even from mobile devices (tablets, phones). Using small standard tag set: If possible, we have used Universal Dependencies (UD) tag set (version 2). It is well documented and used for many different languages. The main description of the tags is directly included in the tool with links to the UD site. Fast navigation: Users can use mouse or keyboard for navigation, only one or two mouse clicks or key strokes are needed for annotation of one token. All sentences are pre-annotated via an builtin adaptive tagger and only incorrect annotation have to be corrected by users. Clean texts: Users can (and are instructed to) reject to annotate a sentence if the sentence is not clear.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.