From Scanned Image to Knowledge Sharing Formats and Technologies in the Digital Mathematics Library Project

Abstract The main obstacle to easy accessing the vast amount of knowledge is the fact that they are not available in well-designed, standard, fully indexed electronic form, together with detailed metadata and full-text search capabilities. This paper is a case study of design issues in a subproject...

Full description

Bibliographic Details
Main Author: Petr Sojka
Other Authors: The Pennsylvania State University CiteSeerX Archives
Format: Text
Language:English
Published: 2005
Subjects:
DML
Online Access:http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.72.5913
http://www.fi.muni.cz/usr/sojka/papers/dmlcz-iknow.pdf
Description
Summary:Abstract The main obstacle to easy accessing the vast amount of knowledge is the fact that they are not available in well-designed, standard, fully indexed electronic form, together with detailed metadata and full-text search capabilities. This paper is a case study of design issues in a subproject of WDML (World Digital Mathematics Library) aimed at digitizing valuable mathematical journals and books published in the Czech and Slovak Republics, to make them publicly available in digital form. We discuss here the design of the work-flow aiming at having mathematical knowledge stored in digital library. The key concept is a gradual enhancement of the digital material by ‘knowledge enhancing ’ filters applied to the markup-rich XML data. Key Words: digital library; metadata handling; semantics of mathematical documents; knowledge management; digitization; MathML; visualization; portal-systems; repositories of knowledge; DML-CZ Category: H.3.7, H.5, H.3, H.4 We dreamed of making the incredible breadth of information that librarians so lovingly organize searchable online. —Larry Page, founder of Google 1