A distributed data warehouse system for astroparticle physics

A distributed data warehouse system is one of the actual issues in the field of astroparticle physics. Famous experiments, such as TAIGA, KASCADE-Grande, produce tens of terabytes of data measured by their instruments. It is critical to have a smart data warehouse system on-site to store the collect...

Full description

Bibliographic Details
Main Authors: Nguyen, Minh-Duc, Kryukov, Alexander, Dubenskaya, Julia, Korosteleva, Elena, Polyakov, Stanislav, Postnikov, Evgeny, Bychkov, Igor, Mikhailov, Andrey, Shigarov, Alexey, Fedorov, Oleg, Kazarina, Yulia, Shipilov, Dmitry, Zhurov, Dmitry
Format: Text
Language:unknown
Published: 2018
Subjects:
Online Access:http://arxiv.org/abs/1812.01906
id ftarxivpreprints:oai:arXiv.org:1812.01906
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:1812.01906 2023-09-05T13:23:40+02:00 A distributed data warehouse system for astroparticle physics Nguyen, Minh-Duc Kryukov, Alexander Dubenskaya, Julia Korosteleva, Elena Polyakov, Stanislav Postnikov, Evgeny Bychkov, Igor Mikhailov, Andrey Shigarov, Alexey Fedorov, Oleg Kazarina, Yulia Shipilov, Dmitry Zhurov, Dmitry 2018-12-05 http://arxiv.org/abs/1812.01906 unknown http://arxiv.org/abs/1812.01906 Astrophysics - Instrumentation and Methods for Astrophysics Computer Science - Distributed Parallel and Cluster Computing text 2018 ftarxivpreprints 2023-08-16T15:06:38Z A distributed data warehouse system is one of the actual issues in the field of astroparticle physics. Famous experiments, such as TAIGA, KASCADE-Grande, produce tens of terabytes of data measured by their instruments. It is critical to have a smart data warehouse system on-site to store the collected data for further distribution effectively. It is also vital to provide scientists with a handy and user-friendly interface to access the collected data with proper permissions not only on-site but also online. The latter case is handy when scientists need to combine data from different experiments for analysis. In this work, we describe an approach to implementing a distributed data warehouse system that allows scientists to acquire just the necessary data from different experiments via the Internet on demand. The implementation is based on CernVM-FS with additional components developed by us to search through the whole available data sets and deliver their subsets to users' computers. Comment: 5 pages, 3 figures, The 8th International Conference "Distributed Computing and Grid-technologies in Science and Education" (GRID 2018) Text taiga ArXiv.org (Cornell University Library)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Astrophysics - Instrumentation and Methods for Astrophysics
Computer Science - Distributed
Parallel
and Cluster Computing
spellingShingle Astrophysics - Instrumentation and Methods for Astrophysics
Computer Science - Distributed
Parallel
and Cluster Computing
Nguyen, Minh-Duc
Kryukov, Alexander
Dubenskaya, Julia
Korosteleva, Elena
Polyakov, Stanislav
Postnikov, Evgeny
Bychkov, Igor
Mikhailov, Andrey
Shigarov, Alexey
Fedorov, Oleg
Kazarina, Yulia
Shipilov, Dmitry
Zhurov, Dmitry
A distributed data warehouse system for astroparticle physics
topic_facet Astrophysics - Instrumentation and Methods for Astrophysics
Computer Science - Distributed
Parallel
and Cluster Computing
description A distributed data warehouse system is one of the actual issues in the field of astroparticle physics. Famous experiments, such as TAIGA, KASCADE-Grande, produce tens of terabytes of data measured by their instruments. It is critical to have a smart data warehouse system on-site to store the collected data for further distribution effectively. It is also vital to provide scientists with a handy and user-friendly interface to access the collected data with proper permissions not only on-site but also online. The latter case is handy when scientists need to combine data from different experiments for analysis. In this work, we describe an approach to implementing a distributed data warehouse system that allows scientists to acquire just the necessary data from different experiments via the Internet on demand. The implementation is based on CernVM-FS with additional components developed by us to search through the whole available data sets and deliver their subsets to users' computers. Comment: 5 pages, 3 figures, The 8th International Conference "Distributed Computing and Grid-technologies in Science and Education" (GRID 2018)
format Text
author Nguyen, Minh-Duc
Kryukov, Alexander
Dubenskaya, Julia
Korosteleva, Elena
Polyakov, Stanislav
Postnikov, Evgeny
Bychkov, Igor
Mikhailov, Andrey
Shigarov, Alexey
Fedorov, Oleg
Kazarina, Yulia
Shipilov, Dmitry
Zhurov, Dmitry
author_facet Nguyen, Minh-Duc
Kryukov, Alexander
Dubenskaya, Julia
Korosteleva, Elena
Polyakov, Stanislav
Postnikov, Evgeny
Bychkov, Igor
Mikhailov, Andrey
Shigarov, Alexey
Fedorov, Oleg
Kazarina, Yulia
Shipilov, Dmitry
Zhurov, Dmitry
author_sort Nguyen, Minh-Duc
title A distributed data warehouse system for astroparticle physics
title_short A distributed data warehouse system for astroparticle physics
title_full A distributed data warehouse system for astroparticle physics
title_fullStr A distributed data warehouse system for astroparticle physics
title_full_unstemmed A distributed data warehouse system for astroparticle physics
title_sort distributed data warehouse system for astroparticle physics
publishDate 2018
url http://arxiv.org/abs/1812.01906
genre taiga
genre_facet taiga
op_relation http://arxiv.org/abs/1812.01906
_version_ 1776204256531972096