TU framework in automatic formatting a digital library

© Springer Nature Singapore Pte Ltd. 2019. Intelligent search in the digital libraries is very important. It is very important for efficient research to obtain relevant information quickly. In the paper, we propose methods for automatic processing of online resources for the institution library usin...

Full description

Bibliographic Details
Main Authors: Toschev A., Talanov M., Kurnosov V.
Format: Book Part
Language:unknown
Published: 2019
Subjects:
DML
Online Access:https://openrepository.ru/article?id=198535
Description
Summary:© Springer Nature Singapore Pte Ltd. 2019. Intelligent search in the digital libraries is very important. It is very important for efficient research to obtain relevant information quickly. In the paper, we propose methods for automatic processing of online resources for the institution library using scanned copies or/and pdf files to make MathML model and provide extended search capacity. The key idea is to use Thinking–Understanding framework to provide automatic document-type detection and processing using the thinking flow to combine different open-source engines like OCR and approaches like Word2vec.