MIaS: Math-Aware Retrieval in Digital Mathematical Libraries
Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore...
Published in: | Proceedings of the 27th ACM International Conference on Information and Knowledge Management |
---|---|
Main Authors: | , , |
Format: | Article in Journal/Newspaper |
Language: | English |
Published: |
Association for Computing Machinery
2018
|
Subjects: | |
Online Access: | https://is.muni.cz/publication/1430425 https://doi.org/10.1145/3269206.3269233 |
id |
ftmasarykis:oai:is.muni.cz:1430425 |
---|---|
record_format |
openpolar |
spelling |
ftmasarykis:oai:is.muni.cz:1430425 2023-10-25T01:38:03+02:00 MIaS: Math-Aware Retrieval in Digital Mathematical Libraries Sojka Petr Růžička Michal Novotný Vít 2018 4 https://is.muni.cz/publication/1430425 https://doi.org/10.1145/3269206.3269233 eng eng Association for Computing Machinery https://is.muni.cz/publication/1430425 info:eu-repo/semantics/restrictedAccess Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM '18) Math Information Retrieval DML EuDML Digital Mathematical Libraries vyhledávání matematiky digitální matematické knihovny info:eu-repo/semantics/article D 2018 ftmasarykis https://doi.org/10.1145/3269206.3269233 2023-09-28T15:14:34Z Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the full-text search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task. Article in Journal/Newspaper DML Masaryk University: Open Services of Information System Proceedings of the 27th ACM International Conference on Information and Knowledge Management 1923 1926 |
institution |
Open Polar |
collection |
Masaryk University: Open Services of Information System |
op_collection_id |
ftmasarykis |
language |
English |
topic |
Math Information Retrieval DML EuDML Digital Mathematical Libraries vyhledávání matematiky digitální matematické knihovny |
spellingShingle |
Math Information Retrieval DML EuDML Digital Mathematical Libraries vyhledávání matematiky digitální matematické knihovny Sojka Petr Růžička Michal Novotný Vít MIaS: Math-Aware Retrieval in Digital Mathematical Libraries |
topic_facet |
Math Information Retrieval DML EuDML Digital Mathematical Libraries vyhledávání matematiky digitální matematické knihovny |
description |
Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the full-text search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task. |
format |
Article in Journal/Newspaper |
author |
Sojka Petr Růžička Michal Novotný Vít |
author_facet |
Sojka Petr Růžička Michal Novotný Vít |
author_sort |
Sojka Petr |
title |
MIaS: Math-Aware Retrieval in Digital Mathematical Libraries |
title_short |
MIaS: Math-Aware Retrieval in Digital Mathematical Libraries |
title_full |
MIaS: Math-Aware Retrieval in Digital Mathematical Libraries |
title_fullStr |
MIaS: Math-Aware Retrieval in Digital Mathematical Libraries |
title_full_unstemmed |
MIaS: Math-Aware Retrieval in Digital Mathematical Libraries |
title_sort |
mias: math-aware retrieval in digital mathematical libraries |
publisher |
Association for Computing Machinery |
publishDate |
2018 |
url |
https://is.muni.cz/publication/1430425 https://doi.org/10.1145/3269206.3269233 |
genre |
DML |
genre_facet |
DML |
op_source |
Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM '18) |
op_relation |
https://is.muni.cz/publication/1430425 |
op_rights |
info:eu-repo/semantics/restrictedAccess |
op_doi |
https://doi.org/10.1145/3269206.3269233 |
container_title |
Proceedings of the 27th ACM International Conference on Information and Knowledge Management |
container_start_page |
1923 |
op_container_end_page |
1926 |
_version_ |
1780733040850370560 |