Virtual definition of data sets according to RDA recommendations

At GEOFON data centre is very difficult to offer big pre-assembled datasets to be downloaded, due to the resources needed for their storage. In this context, the idea of using a Data Collections System (DCS) in order to define and save this type of dataset is very appealing, because we can define co...

Full description

Bibliographic Details
Main Author: Quinteros, Javier
Format: Article in Journal/Newspaper
Language:English
Published: 2022
Subjects:
Online Access:https://zenodo.org/record/7225818
https://doi.org/10.5281/zenodo.7225818
Description
Summary:At GEOFON data centre is very difficult to offer big pre-assembled datasets to be downloaded, due to the resources needed for their storage. In this context, the idea of using a Data Collections System (DCS) in order to define and save this type of dataset is very appealing, because we can define collections containing only "pointers" (e.g. PIDs, URLs) to the files which are included. This implies almost no extra storage, as only the pointers are saved. Therefore, we implemented this DCS based on an extended version of RDA WG specification on research data collection and make it generic enough and ready to be adopted by different communities within EOSC. Currently, 6000+ collections and 1.5+ million members were defined in the internal service. Learn more