Services for formation of digital documents metadata in the formats of international science-based databases
© 2018 CEUR-WS. All rights reserved. This paper contains the review of modern scientometric databases. The specificity of the representation of scientific materials in them is highlighted. The integration methods based on automation of the process of creating metadata for documents, included in digi...
Format: | Conference Object |
---|---|
Language: | unknown |
Published: |
2018
|
Subjects: | |
Online Access: | https://dspace.kpfu.ru/xmlui/handle/net/148707 |
id |
ftkazanuniv:oai:dspace.kpfu.ru:net/148707 |
---|---|
record_format |
openpolar |
spelling |
ftkazanuniv:oai:dspace.kpfu.ru:net/148707 2023-05-15T16:01:50+02:00 Services for formation of digital documents metadata in the formats of international science-based databases 2018 https://dspace.kpfu.ru/xmlui/handle/net/148707 unknown CEUR Workshop Proceedings 2260 175 1613-0073 https://dspace.kpfu.ru/xmlui/handle/net/148707 SCOPUS16130073-2018-2260-SID85058176774 Computer sciences DBLP Digital collections EuDML European Digital Mathematical Library Mathematical Metadata Scientometric databases Semantic methods Structural Stylistic analysis of digital documents Conference Paper 2018 ftkazanuniv 2022-01-01T09:50:42Z © 2018 CEUR-WS. All rights reserved. This paper contains the review of modern scientometric databases. The specificity of the representation of scientific materials in them is highlighted. The integration methods based on automation of the process of creating metadata for documents, included in digital scientific collections, are presented. Features of the formation of metadata for international scientometric databases on mathematical and computer sciences are noted. An algorithm for the automated formation of metadata in the format of the Russian scientific citation index (RSCI) is given. To automatically parse the text of articles, several regular expression patterns have been created, with the help of which the main metadata groups were selected. The algorithm is implemented as a service, consisting of modules for analyzing the structure of documents, automatically selecting documents according to the established order (for example, lexicographic), extracting the annotation block, the alphabetical index generating module, creating a bibliographic description of the article for writing headers of this article, converting documents to the portable document format (pdf), according to the determined parameters. The final module is the formation of metadata for exports to the RSCI. Approbation of the algorithm for the collection of articles of the journal “Russian Digital Libraries” was noted. The service for the formation of metadata for the documents of the digital collection Lobachevskii DML, made in accordance with the diagrams of the fundamental metadata of the European Digital Mathematical Library (EuDML) and the bibliographic database DBLP, is presented. Templates for showing metadata of articles of digital collections Lobachevskii DML in accordance with the scheme NISO JATS V1.0 are prepared. Plugins for the Open Journal System, allowing generation of metadata for science-based databases for downloadable articles are developed. Conference Object DML Kazan Federal University Digital Repository |
institution |
Open Polar |
collection |
Kazan Federal University Digital Repository |
op_collection_id |
ftkazanuniv |
language |
unknown |
topic |
Computer sciences DBLP Digital collections EuDML European Digital Mathematical Library Mathematical Metadata Scientometric databases Semantic methods Structural Stylistic analysis of digital documents |
spellingShingle |
Computer sciences DBLP Digital collections EuDML European Digital Mathematical Library Mathematical Metadata Scientometric databases Semantic methods Structural Stylistic analysis of digital documents Services for formation of digital documents metadata in the formats of international science-based databases |
topic_facet |
Computer sciences DBLP Digital collections EuDML European Digital Mathematical Library Mathematical Metadata Scientometric databases Semantic methods Structural Stylistic analysis of digital documents |
description |
© 2018 CEUR-WS. All rights reserved. This paper contains the review of modern scientometric databases. The specificity of the representation of scientific materials in them is highlighted. The integration methods based on automation of the process of creating metadata for documents, included in digital scientific collections, are presented. Features of the formation of metadata for international scientometric databases on mathematical and computer sciences are noted. An algorithm for the automated formation of metadata in the format of the Russian scientific citation index (RSCI) is given. To automatically parse the text of articles, several regular expression patterns have been created, with the help of which the main metadata groups were selected. The algorithm is implemented as a service, consisting of modules for analyzing the structure of documents, automatically selecting documents according to the established order (for example, lexicographic), extracting the annotation block, the alphabetical index generating module, creating a bibliographic description of the article for writing headers of this article, converting documents to the portable document format (pdf), according to the determined parameters. The final module is the formation of metadata for exports to the RSCI. Approbation of the algorithm for the collection of articles of the journal “Russian Digital Libraries” was noted. The service for the formation of metadata for the documents of the digital collection Lobachevskii DML, made in accordance with the diagrams of the fundamental metadata of the European Digital Mathematical Library (EuDML) and the bibliographic database DBLP, is presented. Templates for showing metadata of articles of digital collections Lobachevskii DML in accordance with the scheme NISO JATS V1.0 are prepared. Plugins for the Open Journal System, allowing generation of metadata for science-based databases for downloadable articles are developed. |
format |
Conference Object |
title |
Services for formation of digital documents metadata in the formats of international science-based databases |
title_short |
Services for formation of digital documents metadata in the formats of international science-based databases |
title_full |
Services for formation of digital documents metadata in the formats of international science-based databases |
title_fullStr |
Services for formation of digital documents metadata in the formats of international science-based databases |
title_full_unstemmed |
Services for formation of digital documents metadata in the formats of international science-based databases |
title_sort |
services for formation of digital documents metadata in the formats of international science-based databases |
publishDate |
2018 |
url |
https://dspace.kpfu.ru/xmlui/handle/net/148707 |
genre |
DML |
genre_facet |
DML |
op_source |
SCOPUS16130073-2018-2260-SID85058176774 |
op_relation |
CEUR Workshop Proceedings 2260 175 1613-0073 https://dspace.kpfu.ru/xmlui/handle/net/148707 |
_version_ |
1766397542540509184 |