MIaS: Math-Aware Retrieval in Digital Mathematical Libraries

Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore...

Full description

Bibliographic Details
Published in:Proceedings of the 27th ACM International Conference on Information and Knowledge Management
Main Authors: Sojka Petr, Růžička Michal, Novotný Vít
Format: Article in Journal/Newspaper
Language:English
Published: Association for Computing Machinery 2018
Subjects:
DML
Online Access:https://is.muni.cz/publication/1430425
https://doi.org/10.1145/3269206.3269233
id ftmasarykis:oai:is.muni.cz:1430425
record_format openpolar
spelling ftmasarykis:oai:is.muni.cz:1430425 2023-10-25T01:38:03+02:00 MIaS: Math-Aware Retrieval in Digital Mathematical Libraries Sojka Petr Růžička Michal Novotný Vít 2018 4 https://is.muni.cz/publication/1430425 https://doi.org/10.1145/3269206.3269233 eng eng Association for Computing Machinery https://is.muni.cz/publication/1430425 info:eu-repo/semantics/restrictedAccess Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM '18) Math Information Retrieval DML EuDML Digital Mathematical Libraries vyhledávání matematiky digitální matematické knihovny info:eu-repo/semantics/article D 2018 ftmasarykis https://doi.org/10.1145/3269206.3269233 2023-09-28T15:14:34Z Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the full-text search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task. Article in Journal/Newspaper DML Masaryk University: Open Services of Information System Proceedings of the 27th ACM International Conference on Information and Knowledge Management 1923 1926
institution Open Polar
collection Masaryk University: Open Services of Information System
op_collection_id ftmasarykis
language English
topic Math Information Retrieval
DML
EuDML
Digital Mathematical Libraries
vyhledávání matematiky
digitální matematické knihovny
spellingShingle Math Information Retrieval
DML
EuDML
Digital Mathematical Libraries
vyhledávání matematiky
digitální matematické knihovny
Sojka Petr
Růžička Michal
Novotný Vít
MIaS: Math-Aware Retrieval in Digital Mathematical Libraries
topic_facet Math Information Retrieval
DML
EuDML
Digital Mathematical Libraries
vyhledávání matematiky
digitální matematické knihovny
description Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the full-text search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task.
format Article in Journal/Newspaper
author Sojka Petr
Růžička Michal
Novotný Vít
author_facet Sojka Petr
Růžička Michal
Novotný Vít
author_sort Sojka Petr
title MIaS: Math-Aware Retrieval in Digital Mathematical Libraries
title_short MIaS: Math-Aware Retrieval in Digital Mathematical Libraries
title_full MIaS: Math-Aware Retrieval in Digital Mathematical Libraries
title_fullStr MIaS: Math-Aware Retrieval in Digital Mathematical Libraries
title_full_unstemmed MIaS: Math-Aware Retrieval in Digital Mathematical Libraries
title_sort mias: math-aware retrieval in digital mathematical libraries
publisher Association for Computing Machinery
publishDate 2018
url https://is.muni.cz/publication/1430425
https://doi.org/10.1145/3269206.3269233
genre DML
genre_facet DML
op_source Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM '18)
op_relation https://is.muni.cz/publication/1430425
op_rights info:eu-repo/semantics/restrictedAccess
op_doi https://doi.org/10.1145/3269206.3269233
container_title Proceedings of the 27th ACM International Conference on Information and Knowledge Management
container_start_page 1923
op_container_end_page 1926
_version_ 1780733040850370560