Web Interface and Collection for Mathematical Retrieval : WebMIaS and MREC

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

LÍŠKA Martin SOJKA Petr RŮŽIČKA Michal MRAVEC Peter

Year of publication 2011
Type Article in Proceedings
Conference DML 2011: Towards a Digital Mathematics Library
MU Faculty or unit

Faculty of Informatics

Citation
Web http://dml.cz/handle/10338.dmlcz/702604
Field Informatics
Keywords math indexing and retrieval; mathematical digital libraries; information systems; information retrieval; mathematical content search; document ranking of mathematical papers; math text mining; WebMIaS; MIaS; Tralics; TeX; UMCL; Lucene
Description We demonstrate searching of mathematical expressions in technical digital libraries on a MREC collection of 439,423 real scientific documents with more than 158 million mathematical formulae. Our solution - the WebMIaS system - allows the retrieval of mathematical expressions written in TEX or MathML. TEX queries are converted on-the-fly into tree representations of Presentation MathML, which is used for indexing. WebMIaS allows complex queries composed of plain text and mathematical formulae, using MIaS (Math Indexer and Searcher), a math aware search engine based on the state-of-the-art system Lucene. MIaS implements proximity math indexing with a subformulae similarity search.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.