Services for formation of digital documents metadata in the formats of international science-based databases

© 2018 CEUR-WS. All rights reserved. This paper contains the review of modern scientometric databases. The specificity of the representation of scientific materials in them is highlighted. The integration methods based on automation of the process of creating metadata for documents, included in digi...

Full description

Bibliographic Details
Format: Conference Object
Language:unknown
Published: 2018
Subjects:
DML
Online Access:https://openrepository.ru/article?id=189239
Description
Summary:© 2018 CEUR-WS. All rights reserved. This paper contains the review of modern scientometric databases. The specificity of the representation of scientific materials in them is highlighted. The integration methods based on automation of the process of creating metadata for documents, included in digital scientific collections, are presented. Features of the formation of metadata for international scientometric databases on mathematical and computer sciences are noted. An algorithm for the automated formation of metadata in the format of the Russian scientific citation index (RSCI) is given. To automatically parse the text of articles, several regular expression patterns have been created, with the help of which the main metadata groups were selected. The algorithm is implemented as a service, consisting of modules for analyzing the structure of documents, automatically selecting documents according to the established order (for example, lexicographic), extracting the annotation block, the alphabetical index generating module, creating a bibliographic description of the article for writing headers of this article, converting documents to the portable document format (pdf), according to the determined parameters. The final module is the formation of metadata for exports to the RSCI. Approbation of the algorithm for the collection of articles of the journal “Russian Digital Libraries” was noted. The service for the formation of metadata for the documents of the digital collection Lobachevskii DML, made in accordance with the diagrams of the fundamental metadata of the European Digital Mathematical Library (EuDML) and the bibliographic database DBLP, is presented. Templates for showing metadata of articles of digital collections Lobachevskii DML in accordance with the scheme NISO JATS V1.0 are prepared. Plugins for the Open Journal System, allowing generation of metadata for science-based databases for downloadable articles are developed.