Určení tematické konzistence dokumentu

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Title in English Determining topic consistency of a document
Authors

MATERNA Jiří

Year of publication 2011
Type Article in Proceedings
Conference Znalosti 2011
MU Faculty or unit

Faculty of Informatics

Citation
Web http://znalosti.ics.upjs.sk
Field Informatics
Keywords fulltext search engine; topic consistency; backlinks
Description The aim of this work is to design and implement a tool, which should be able to assign a score reflecting topic consistency of any web document written in the Czech language. This score is dedicated to be used for deciding whether the document's hyperlinks are appropriate for computing relevancy of referenced documents. In fact, it turns out that inconsistent documents should not be used. The presented algorithm uses both statistical and heuristic methods and has the precision about 93.5 % on the set of 200 test documents.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.