MathML-aware Article Conversion from LaTeX

summary:Publishing in Mathematics and theoretical areas in Computer Science and Physics has been predominantly using TeX/LaTeX as a formatting language in the last two decades. This large corpus of born-digital material is both a boon — LaTeX is semi-semantic format where the source often contains i...

Full description

Bibliographic Details
Main Authors: Stamerjohanns, Heinrich, Ginev, Deyan, David, Catalin, Misev, Dimitar, Zamdzhiev, Vladimir, Kohlhase, Michael
Format: Conference Object
Language:English
Published: Masaryk University Press 2009
Subjects:
DML
Online Access:http://hdl.handle.net/10338.dmlcz/702561
id ftdmlcz:oai:oai.dml.cz:10338.dmlcz/702561
record_format openpolar
institution Open Polar
collection DML-CZ (Czech Digital Mathematics Library)
op_collection_id ftdmlcz
language English
topic keyword:University of Western Ontario
keyword:XML
msc:68U10
msc:68U15
spellingShingle keyword:University of Western Ontario
keyword:XML
msc:68U10
msc:68U15
Stamerjohanns, Heinrich
Ginev, Deyan
David, Catalin
Misev, Dimitar
Zamdzhiev, Vladimir
Kohlhase, Michael
MathML-aware Article Conversion from LaTeX
topic_facet keyword:University of Western Ontario
keyword:XML
msc:68U10
msc:68U15
description summary:Publishing in Mathematics and theoretical areas in Computer Science and Physics has been predominantly using TeX/LaTeX as a formatting language in the last two decades. This large corpus of born-digital material is both a boon — LaTeX is semi-semantic format where the source often contains indications of the author’s intentions — and a problem — TeX is Turing-complete and authors use this freedom to use thousands of styles and millions of user macros. Several tools have been developed to convert TeX/LaTeX documents to XML-based — i.e. Web and DML-compatible formats. Different DML Projects use different tools, and the selection seems largely accidental. To put the choice of converters for DML projects onto a more solid footing and to encourage competition and feature convergence we survey the market. In this paper we investigate and compare five LaTeX-to-XML transformers in three dimensions: $a$) ergonomic factors like documentation, ease of installation, $b$) coverage, and $c$) quality of the resulting documents (in particular the MathML parts).
format Conference Object
author Stamerjohanns, Heinrich
Ginev, Deyan
David, Catalin
Misev, Dimitar
Zamdzhiev, Vladimir
Kohlhase, Michael
author_facet Stamerjohanns, Heinrich
Ginev, Deyan
David, Catalin
Misev, Dimitar
Zamdzhiev, Vladimir
Kohlhase, Michael
author_sort Stamerjohanns, Heinrich
title MathML-aware Article Conversion from LaTeX
title_short MathML-aware Article Conversion from LaTeX
title_full MathML-aware Article Conversion from LaTeX
title_fullStr MathML-aware Article Conversion from LaTeX
title_full_unstemmed MathML-aware Article Conversion from LaTeX
title_sort mathml-aware article conversion from latex
publisher Masaryk University Press
publishDate 2009
url http://hdl.handle.net/10338.dmlcz/702561
genre DML
genre_facet DML
op_relation isbn:978-80-210-4781-5
zbl:Zbl 1176.68233
reference:ABC+03. Ausbrooks, Ron: Mathematical Markup Language (MathML) version 2.0 (second edition). W3C recommendation, World Wide Web Consortium, 2003.
reference:Ang09a. Anghelache, Romeo: Hermes discontinued. project page at http://humanist.roua.org/2009/01/01/hermes-paused/, seen May 2009.
reference:Ang09b. Anghelache, Romeo: Hermes website. project page at http://hermes.roua.org/, seen May 2009.
reference:arX. : arxmliv build system. http://arxmliv.kwarc.info.
reference:ArX07. : arXiv.org e-Print archive., seen December2007. web page at http://www.arxiv.org.
reference:Bou08. Thierry, Bouche: Cedrics: When CEDRAM meets Tralics. In Sojka, Petr, editor, Towards Digital Mathematics Library, Proceedings of the DML 2008 workshop, pages 153–165. Masaryk University, Brno, 2008.
reference:CeC09. : Cecill license. http://www.cecill.info/, seen May 2009.
reference:DLM09. : Digital library of mathematical functions. project page at http://dlmf.nist.gov/, seen May 2009. Zbl 1130.65045
reference:Gri03. Grimm, Jose: Tralics, a latex to xml translator., 2003.
reference:KŞ06. Kohlhase, Michael, Şucan, Ioan: A search engine for mathematical formulae. In Ida, Tetsuo, Calmet, Jacques, and Wang, Dongming, editors, Proceedings of Artificial Intelligence and Symbolic Computation, AISC’2006, number 4120 in LNAI, pages 241–253. Springer Verlag, 2006. Zbl 1156.68306
reference:Mil09. Miller, Bruce: LaTeXML website. http://dlmf.nist.gov/LaTeXML/, seen May 2009.
reference:MM06. Munavalli, Rajesh, Miner, Robert: Mathfind: a math-aware search engine. In SIGIR ’06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 735–735, New York, NY, USA, 2006. ACM Press.
reference:PH. Plaice and Yannis Haralambous: Omega website.
reference:Sci09. EDP Sciences: lxir website. http://www.lxir-latex.org/, seen May 2009.
reference:SGD+09. Stamerjohanns, Heinrich, Ginev, Deyan, David, Catalin, Misev, Dimitar, Zamdzhiev, Vladimir, Kohlhase, Michael: A comparison study of mathml-aware LaTeX converters. Kwarc report, Jacobs University Bremen, 2009.
reference:SK08. Stamerjohanns, Heinrich, Kohlhase, Michael: Transforming the ar$\chi $iv to XML. In Autexier, Serge et al., editors, Intelligent Computer Mathematics, 9th International Conference, MKM 2008 Birmingham, UK, July 28 – August 1, 2008, Proceedings, number 5144 in LNAI, pages 574–582. Springer Verlag, 2008. Zbl 1166.68364
reference:Tex09. : TeX4HT website. http://www.cse.ohio-state.edu/~gurari/TeX4ht/, seen May 2009.
reference:Tra09. : Tralics website. http://www-sop.inria.fr/miaou/tralics/, seen May 2009.
reference:TtM09. : TtM website. project page at http://hutchinson.belmont.ma.us/tth/mml/, seen May 2009.
reference:Val09. : Validator website. http://homepage.mac.com/rcrews/software/validator/, seen May 2009.
reference:Wat09. Watt, Stephen: MathML at ORCCA. project page at http://www.orcca.on.ca/MathML/, seen May 2009.
reference:WG09. W3C Math WG: MathML software – converters. http://www.w3.org/Math/Software/mathml_software_cat_converters.html, seen May 2009.
op_rights access:Unrestricted
rights:DML-CZ Czech Digital Mathematics Library, http://dml.cz/
rights:Institute of Mathematics AS CR, http://www.math.cas.cz/
conditionOfUse:http://dml.cz/use
_version_ 1768386230545809408
spelling ftdmlcz:oai:oai.dml.cz:10338.dmlcz/702561 2023-06-11T04:11:17+02:00 MathML-aware Article Conversion from LaTeX Stamerjohanns, Heinrich Ginev, Deyan David, Catalin Misev, Dimitar Zamdzhiev, Vladimir Kohlhase, Michael 2009 application/pdf http://hdl.handle.net/10338.dmlcz/702561 eng eng Masaryk University Press isbn:978-80-210-4781-5 zbl:Zbl 1176.68233 reference:ABC+03. Ausbrooks, Ron: Mathematical Markup Language (MathML) version 2.0 (second edition). W3C recommendation, World Wide Web Consortium, 2003. reference:Ang09a. Anghelache, Romeo: Hermes discontinued. project page at http://humanist.roua.org/2009/01/01/hermes-paused/, seen May 2009. reference:Ang09b. Anghelache, Romeo: Hermes website. project page at http://hermes.roua.org/, seen May 2009. reference:arX. : arxmliv build system. http://arxmliv.kwarc.info. reference:ArX07. : arXiv.org e-Print archive., seen December2007. web page at http://www.arxiv.org. reference:Bou08. Thierry, Bouche: Cedrics: When CEDRAM meets Tralics. In Sojka, Petr, editor, Towards Digital Mathematics Library, Proceedings of the DML 2008 workshop, pages 153–165. Masaryk University, Brno, 2008. reference:CeC09. : Cecill license. http://www.cecill.info/, seen May 2009. reference:DLM09. : Digital library of mathematical functions. project page at http://dlmf.nist.gov/, seen May 2009. Zbl 1130.65045 reference:Gri03. Grimm, Jose: Tralics, a latex to xml translator., 2003. reference:KŞ06. Kohlhase, Michael, Şucan, Ioan: A search engine for mathematical formulae. In Ida, Tetsuo, Calmet, Jacques, and Wang, Dongming, editors, Proceedings of Artificial Intelligence and Symbolic Computation, AISC’2006, number 4120 in LNAI, pages 241–253. Springer Verlag, 2006. Zbl 1156.68306 reference:Mil09. Miller, Bruce: LaTeXML website. http://dlmf.nist.gov/LaTeXML/, seen May 2009. reference:MM06. Munavalli, Rajesh, Miner, Robert: Mathfind: a math-aware search engine. In SIGIR ’06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 735–735, New York, NY, USA, 2006. ACM Press. reference:PH. Plaice and Yannis Haralambous: Omega website. reference:Sci09. EDP Sciences: lxir website. http://www.lxir-latex.org/, seen May 2009. reference:SGD+09. Stamerjohanns, Heinrich, Ginev, Deyan, David, Catalin, Misev, Dimitar, Zamdzhiev, Vladimir, Kohlhase, Michael: A comparison study of mathml-aware LaTeX converters. Kwarc report, Jacobs University Bremen, 2009. reference:SK08. Stamerjohanns, Heinrich, Kohlhase, Michael: Transforming the ar$\chi $iv to XML. In Autexier, Serge et al., editors, Intelligent Computer Mathematics, 9th International Conference, MKM 2008 Birmingham, UK, July 28 – August 1, 2008, Proceedings, number 5144 in LNAI, pages 574–582. Springer Verlag, 2008. Zbl 1166.68364 reference:Tex09. : TeX4HT website. http://www.cse.ohio-state.edu/~gurari/TeX4ht/, seen May 2009. reference:Tra09. : Tralics website. http://www-sop.inria.fr/miaou/tralics/, seen May 2009. reference:TtM09. : TtM website. project page at http://hutchinson.belmont.ma.us/tth/mml/, seen May 2009. reference:Val09. : Validator website. http://homepage.mac.com/rcrews/software/validator/, seen May 2009. reference:Wat09. Watt, Stephen: MathML at ORCCA. project page at http://www.orcca.on.ca/MathML/, seen May 2009. reference:WG09. W3C Math WG: MathML software – converters. http://www.w3.org/Math/Software/mathml_software_cat_converters.html, seen May 2009. access:Unrestricted rights:DML-CZ Czech Digital Mathematics Library, http://dml.cz/ rights:Institute of Mathematics AS CR, http://www.math.cas.cz/ conditionOfUse:http://dml.cz/use keyword:University of Western Ontario keyword:XML msc:68U10 msc:68U15 type:math text:in_proceedings 2009 ftdmlcz 2023-04-24T16:24:59Z summary:Publishing in Mathematics and theoretical areas in Computer Science and Physics has been predominantly using TeX/LaTeX as a formatting language in the last two decades. This large corpus of born-digital material is both a boon — LaTeX is semi-semantic format where the source often contains indications of the author’s intentions — and a problem — TeX is Turing-complete and authors use this freedom to use thousands of styles and millions of user macros. Several tools have been developed to convert TeX/LaTeX documents to XML-based — i.e. Web and DML-compatible formats. Different DML Projects use different tools, and the selection seems largely accidental. To put the choice of converters for DML projects onto a more solid footing and to encourage competition and feature convergence we survey the market. In this paper we investigate and compare five LaTeX-to-XML transformers in three dimensions: $a$) ergonomic factors like documentation, ease of installation, $b$) coverage, and $c$) quality of the resulting documents (in particular the MathML parts). Conference Object DML DML-CZ (Czech Digital Mathematics Library)