LDA-Frames: An Unsupervised Approach to Generating Semantic Frames

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

MATERNA Jiří

Year of publication 2012
Type Article in Proceedings
Conference Computational Linguistics and Intelligent Text Processing, 13th International Conference, CICLing 2012, Part I
MU Faculty or unit

Faculty of Informatics

Citation
Doi http://dx.doi.org/10.1007/978-3-642-28604-9_31
Field Informatics
Keywords LDA-frames; semantic frame; Latent Dirichlet Allocation
Description In this paper we introduce a novel approach to identifying semantic frames from semantically unlabelled text corpora. There are many frame formalisms but most of them suffer from the problem that all frames must be created manually and the set of semantic roles must be predefined. The LDA-Frames approach, based on the Latent Dirichlet Allocation, avoids both these problems by employing statistics on a syntactically tagged corpus. The only information that must be given is a number of semantic frames and a number of semantic roles to be identified. The power of LDA-Frames is first shown on a small sample corpus and then on the British National Corpus.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.