DynGenPar -- A Dynamic Generalized Parser for Common Mathematical Language

This paper introduces a dynamic generalized parser aimed primarily at common natural mathematical language. Our algorithm combines the efficiency of GLR parsing, the dynamic extensibility of tableless approaches and the expressiveness of extended context-free grammars such as parallel multiple conte...

Full description

Bibliographic Details
Main Authors: Kevin Kofler, Arnold Neumaier
Other Authors: The Pennsylvania State University CiteSeerX Archives
Format: Text
Language:English
Subjects:
DML
Online Access:http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.298.5379
http://www.tigen.org/kevin.kofler/fmathl/dyngenpar-dml.pdf
id ftciteseerx:oai:CiteSeerX.psu:10.1.1.298.5379
record_format openpolar
spelling ftciteseerx:oai:CiteSeerX.psu:10.1.1.298.5379 2023-05-15T16:01:58+02:00 DynGenPar -- A Dynamic Generalized Parser for Common Mathematical Language Kevin Kofler Arnold Neumaier The Pennsylvania State University CiteSeerX Archives application/pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.298.5379 http://www.tigen.org/kevin.kofler/fmathl/dyngenpar-dml.pdf en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.298.5379 http://www.tigen.org/kevin.kofler/fmathl/dyngenpar-dml.pdf Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://www.tigen.org/kevin.kofler/fmathl/dyngenpar-dml.pdf scannerless parser text ftciteseerx 2016-09-25T00:06:18Z This paper introduces a dynamic generalized parser aimed primarily at common natural mathematical language. Our algorithm combines the efficiency of GLR parsing, the dynamic extensibility of tableless approaches and the expressiveness of extended context-free grammars such as parallel multiple context-free grammars (PMCFGs). In particular, it supports efficient dynamic rule additions to the grammar at any moment. The algorithm is designed in a fully incremental way, allowing to resume parsing with additional tokens without restarting the parse process, and can predict possible next tokens. Additionally, we handle constraints on the token following a rule. This allows for grammatically correct English indefinite articles when working with word tokens. It can also represent typical operations for scannerless parsing such as maximal matches when working with character tokens. Our long-term goal is to computerize a large library of existing mathematical knowledge using the new parser, starting from natural language input as found in textbooks or in the papers collected by the digital mathematical library (DML) projects around the world. In this paper, we present the algorithmic ideas behind our approach, give a short overview of the implementation, and present some efficiency results. The new parser is available at Text DML Unknown
institution Open Polar
collection Unknown
op_collection_id ftciteseerx
language English
topic scannerless parser
spellingShingle scannerless parser
Kevin Kofler
Arnold Neumaier
DynGenPar -- A Dynamic Generalized Parser for Common Mathematical Language
topic_facet scannerless parser
description This paper introduces a dynamic generalized parser aimed primarily at common natural mathematical language. Our algorithm combines the efficiency of GLR parsing, the dynamic extensibility of tableless approaches and the expressiveness of extended context-free grammars such as parallel multiple context-free grammars (PMCFGs). In particular, it supports efficient dynamic rule additions to the grammar at any moment. The algorithm is designed in a fully incremental way, allowing to resume parsing with additional tokens without restarting the parse process, and can predict possible next tokens. Additionally, we handle constraints on the token following a rule. This allows for grammatically correct English indefinite articles when working with word tokens. It can also represent typical operations for scannerless parsing such as maximal matches when working with character tokens. Our long-term goal is to computerize a large library of existing mathematical knowledge using the new parser, starting from natural language input as found in textbooks or in the papers collected by the digital mathematical library (DML) projects around the world. In this paper, we present the algorithmic ideas behind our approach, give a short overview of the implementation, and present some efficiency results. The new parser is available at
author2 The Pennsylvania State University CiteSeerX Archives
format Text
author Kevin Kofler
Arnold Neumaier
author_facet Kevin Kofler
Arnold Neumaier
author_sort Kevin Kofler
title DynGenPar -- A Dynamic Generalized Parser for Common Mathematical Language
title_short DynGenPar -- A Dynamic Generalized Parser for Common Mathematical Language
title_full DynGenPar -- A Dynamic Generalized Parser for Common Mathematical Language
title_fullStr DynGenPar -- A Dynamic Generalized Parser for Common Mathematical Language
title_full_unstemmed DynGenPar -- A Dynamic Generalized Parser for Common Mathematical Language
title_sort dyngenpar -- a dynamic generalized parser for common mathematical language
url http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.298.5379
http://www.tigen.org/kevin.kofler/fmathl/dyngenpar-dml.pdf
genre DML
genre_facet DML
op_source http://www.tigen.org/kevin.kofler/fmathl/dyngenpar-dml.pdf
op_relation http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.298.5379
http://www.tigen.org/kevin.kofler/fmathl/dyngenpar-dml.pdf
op_rights Metadata may be used without restrictions as long as the oai identifier remains attached to it.
_version_ 1766397628725067776