Improving RNN-based Answer Selection for Morphologically Rich Languages

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.

Authors

MEDVEĎ Marek HORÁK Aleš SABOL Radoslav

Year of publication 2020
Type Article in Proceedings
Conference Proceedings of the 12th International Conference on Agents and Artificial Intelligence
MU Faculty or unit

Faculty of Informatics

Citation
Keywords Question Answering; Question Classification; Answer Classification; Czech; Simple Question Answering Database; SQAD
Description Question answering systems have improved greatly during the last five years by employing architectures of deep neural networks such as attentive recurrent networks or transformer-based networks with pretrained con- textual information. In this paper, we present the results and detailed analysis of experiments with the largest question answering benchmark dataset for the Czech language. The best results evaluated in the text reach the accuracy of 72 %, which is a 4 % improvement to the previous best result. We also introduce the newest version of the Czech Question Answering benchmark dataset SQAD 3.0, which was substantially extended to more than 13,000 question-answer pairs, and we report the first answer selection results on this dataset which indicate that the size of the training data is important for the task.
Related projects: