Large Scale Keyword Extraction using a Finite State Backend
Autoři | |
---|---|
Rok publikování | 2016 |
Druh | Článek ve sborníku |
Konference | Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016 |
Fakulta / Pracoviště MU | |
Citace | JAKUBÍČEK, Miloš a Pavel ŠMERK. Large Scale Keyword Extraction using a Finite State Backend. In Aleš Horák, Pavel Rychlý, Adam Rambousek. Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016. Brno: Tribun EU, 2016, s. 143-146. ISBN 978-80-263-1095-2. |
www | https://nlp.fi.muni.cz/raslan/2016/paper17-Jakubicek_Smerk.pdf |
Klíčová slova | terminology extraction; keyword extraction; fsa; Sketch Engine |
Popis | We present a novel method for performing fast keyword extraction from large text corpora using a finite state backend. The FSA3 package has been adopted for this purposes. We outline the basic approach and present a comparison with previous hash-based method as used in Sketch Engine. |
Související projekty: |