Czech Question Answering with Extended SQAD v3.0 Benchmark Dataset

Publikace nespadá pod Filozofickou fakultu, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.

Autoři

SABOL Radoslav MEDVEĎ Marek HORÁK Aleš

Rok publikování 2019
Druh Článek ve sborníku
Konference Proceedings of the Thirteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2019
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
Klíčová slova question answering; QA benchmark dataset; SQAD; Czech
Popis In this paper, we introduce a new version of the Simple QuestionAnswering Databases (SQAD). The main asset of the new version lies inincreasing the number of records to a total of 13,473 records. Besides thedatabase enlargement, the new version incorporates new restrictions ofspecifying different formats of the expected answer for a given question.These new restrictions are connected with automatic database consistencychecks where new sub-processes safeguard the database correctness andconsistency.We also introduce a new on-line annotation tool used which offered aunified environment for extending the SQAD data in a crowdsourcingexperiment.
Související projekty: