Services for formation of digital documents metadata in the formats of international science-based databases

© 2018 CEUR-WS. All rights reserved. This paper contains the review of modern scientometric databases. The specificity of the representation of scientific materials in them is highlighted. The integration methods based on automation of the process of creating metadata for documents, included in digi...

Full description

Bibliographic Details
Format: Conference Object
Language:unknown
Published: 2018
Subjects:
DML
Online Access:https://dspace.kpfu.ru/xmlui/handle/net/148707
id ftkazanuniv:oai:dspace.kpfu.ru:net/148707
record_format openpolar
spelling ftkazanuniv:oai:dspace.kpfu.ru:net/148707 2023-05-15T16:01:50+02:00 Services for formation of digital documents metadata in the formats of international science-based databases 2018 https://dspace.kpfu.ru/xmlui/handle/net/148707 unknown CEUR Workshop Proceedings 2260 175 1613-0073 https://dspace.kpfu.ru/xmlui/handle/net/148707 SCOPUS16130073-2018-2260-SID85058176774 Computer sciences DBLP Digital collections EuDML European Digital Mathematical Library Mathematical Metadata Scientometric databases Semantic methods Structural Stylistic analysis of digital documents Conference Paper 2018 ftkazanuniv 2022-01-01T09:50:42Z © 2018 CEUR-WS. All rights reserved. This paper contains the review of modern scientometric databases. The specificity of the representation of scientific materials in them is highlighted. The integration methods based on automation of the process of creating metadata for documents, included in digital scientific collections, are presented. Features of the formation of metadata for international scientometric databases on mathematical and computer sciences are noted. An algorithm for the automated formation of metadata in the format of the Russian scientific citation index (RSCI) is given. To automatically parse the text of articles, several regular expression patterns have been created, with the help of which the main metadata groups were selected. The algorithm is implemented as a service, consisting of modules for analyzing the structure of documents, automatically selecting documents according to the established order (for example, lexicographic), extracting the annotation block, the alphabetical index generating module, creating a bibliographic description of the article for writing headers of this article, converting documents to the portable document format (pdf), according to the determined parameters. The final module is the formation of metadata for exports to the RSCI. Approbation of the algorithm for the collection of articles of the journal “Russian Digital Libraries” was noted. The service for the formation of metadata for the documents of the digital collection Lobachevskii DML, made in accordance with the diagrams of the fundamental metadata of the European Digital Mathematical Library (EuDML) and the bibliographic database DBLP, is presented. Templates for showing metadata of articles of digital collections Lobachevskii DML in accordance with the scheme NISO JATS V1.0 are prepared. Plugins for the Open Journal System, allowing generation of metadata for science-based databases for downloadable articles are developed. Conference Object DML Kazan Federal University Digital Repository
institution Open Polar
collection Kazan Federal University Digital Repository
op_collection_id ftkazanuniv
language unknown
topic Computer sciences
DBLP
Digital collections
EuDML
European Digital Mathematical Library
Mathematical
Metadata
Scientometric databases
Semantic methods
Structural
Stylistic analysis of digital documents
spellingShingle Computer sciences
DBLP
Digital collections
EuDML
European Digital Mathematical Library
Mathematical
Metadata
Scientometric databases
Semantic methods
Structural
Stylistic analysis of digital documents
Services for formation of digital documents metadata in the formats of international science-based databases
topic_facet Computer sciences
DBLP
Digital collections
EuDML
European Digital Mathematical Library
Mathematical
Metadata
Scientometric databases
Semantic methods
Structural
Stylistic analysis of digital documents
description © 2018 CEUR-WS. All rights reserved. This paper contains the review of modern scientometric databases. The specificity of the representation of scientific materials in them is highlighted. The integration methods based on automation of the process of creating metadata for documents, included in digital scientific collections, are presented. Features of the formation of metadata for international scientometric databases on mathematical and computer sciences are noted. An algorithm for the automated formation of metadata in the format of the Russian scientific citation index (RSCI) is given. To automatically parse the text of articles, several regular expression patterns have been created, with the help of which the main metadata groups were selected. The algorithm is implemented as a service, consisting of modules for analyzing the structure of documents, automatically selecting documents according to the established order (for example, lexicographic), extracting the annotation block, the alphabetical index generating module, creating a bibliographic description of the article for writing headers of this article, converting documents to the portable document format (pdf), according to the determined parameters. The final module is the formation of metadata for exports to the RSCI. Approbation of the algorithm for the collection of articles of the journal “Russian Digital Libraries” was noted. The service for the formation of metadata for the documents of the digital collection Lobachevskii DML, made in accordance with the diagrams of the fundamental metadata of the European Digital Mathematical Library (EuDML) and the bibliographic database DBLP, is presented. Templates for showing metadata of articles of digital collections Lobachevskii DML in accordance with the scheme NISO JATS V1.0 are prepared. Plugins for the Open Journal System, allowing generation of metadata for science-based databases for downloadable articles are developed.
format Conference Object
title Services for formation of digital documents metadata in the formats of international science-based databases
title_short Services for formation of digital documents metadata in the formats of international science-based databases
title_full Services for formation of digital documents metadata in the formats of international science-based databases
title_fullStr Services for formation of digital documents metadata in the formats of international science-based databases
title_full_unstemmed Services for formation of digital documents metadata in the formats of international science-based databases
title_sort services for formation of digital documents metadata in the formats of international science-based databases
publishDate 2018
url https://dspace.kpfu.ru/xmlui/handle/net/148707
genre DML
genre_facet DML
op_source SCOPUS16130073-2018-2260-SID85058176774
op_relation CEUR Workshop Proceedings
2260
175
1613-0073
https://dspace.kpfu.ru/xmlui/handle/net/148707
_version_ 1766397542540509184