A distributed data warehouse system for astroparticle physics
A distributed data warehouse system is one of the actual issues in the field of astroparticle physics. Famous experiments, such as TAIGA, KASCADE-Grande, produce tens of terabytes of data measured by their instruments. It is critical to have a smart data warehouse system on-site to store the collect...
Main Authors: | , , , , , , , , , , , , |
---|---|
Format: | Text |
Language: | unknown |
Published: |
2018
|
Subjects: | |
Online Access: | http://arxiv.org/abs/1812.01906 |
id |
ftarxivpreprints:oai:arXiv.org:1812.01906 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:1812.01906 2023-09-05T13:23:40+02:00 A distributed data warehouse system for astroparticle physics Nguyen, Minh-Duc Kryukov, Alexander Dubenskaya, Julia Korosteleva, Elena Polyakov, Stanislav Postnikov, Evgeny Bychkov, Igor Mikhailov, Andrey Shigarov, Alexey Fedorov, Oleg Kazarina, Yulia Shipilov, Dmitry Zhurov, Dmitry 2018-12-05 http://arxiv.org/abs/1812.01906 unknown http://arxiv.org/abs/1812.01906 Astrophysics - Instrumentation and Methods for Astrophysics Computer Science - Distributed Parallel and Cluster Computing text 2018 ftarxivpreprints 2023-08-16T15:06:38Z A distributed data warehouse system is one of the actual issues in the field of astroparticle physics. Famous experiments, such as TAIGA, KASCADE-Grande, produce tens of terabytes of data measured by their instruments. It is critical to have a smart data warehouse system on-site to store the collected data for further distribution effectively. It is also vital to provide scientists with a handy and user-friendly interface to access the collected data with proper permissions not only on-site but also online. The latter case is handy when scientists need to combine data from different experiments for analysis. In this work, we describe an approach to implementing a distributed data warehouse system that allows scientists to acquire just the necessary data from different experiments via the Internet on demand. The implementation is based on CernVM-FS with additional components developed by us to search through the whole available data sets and deliver their subsets to users' computers. Comment: 5 pages, 3 figures, The 8th International Conference "Distributed Computing and Grid-technologies in Science and Education" (GRID 2018) Text taiga ArXiv.org (Cornell University Library) |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Astrophysics - Instrumentation and Methods for Astrophysics Computer Science - Distributed Parallel and Cluster Computing |
spellingShingle |
Astrophysics - Instrumentation and Methods for Astrophysics Computer Science - Distributed Parallel and Cluster Computing Nguyen, Minh-Duc Kryukov, Alexander Dubenskaya, Julia Korosteleva, Elena Polyakov, Stanislav Postnikov, Evgeny Bychkov, Igor Mikhailov, Andrey Shigarov, Alexey Fedorov, Oleg Kazarina, Yulia Shipilov, Dmitry Zhurov, Dmitry A distributed data warehouse system for astroparticle physics |
topic_facet |
Astrophysics - Instrumentation and Methods for Astrophysics Computer Science - Distributed Parallel and Cluster Computing |
description |
A distributed data warehouse system is one of the actual issues in the field of astroparticle physics. Famous experiments, such as TAIGA, KASCADE-Grande, produce tens of terabytes of data measured by their instruments. It is critical to have a smart data warehouse system on-site to store the collected data for further distribution effectively. It is also vital to provide scientists with a handy and user-friendly interface to access the collected data with proper permissions not only on-site but also online. The latter case is handy when scientists need to combine data from different experiments for analysis. In this work, we describe an approach to implementing a distributed data warehouse system that allows scientists to acquire just the necessary data from different experiments via the Internet on demand. The implementation is based on CernVM-FS with additional components developed by us to search through the whole available data sets and deliver their subsets to users' computers. Comment: 5 pages, 3 figures, The 8th International Conference "Distributed Computing and Grid-technologies in Science and Education" (GRID 2018) |
format |
Text |
author |
Nguyen, Minh-Duc Kryukov, Alexander Dubenskaya, Julia Korosteleva, Elena Polyakov, Stanislav Postnikov, Evgeny Bychkov, Igor Mikhailov, Andrey Shigarov, Alexey Fedorov, Oleg Kazarina, Yulia Shipilov, Dmitry Zhurov, Dmitry |
author_facet |
Nguyen, Minh-Duc Kryukov, Alexander Dubenskaya, Julia Korosteleva, Elena Polyakov, Stanislav Postnikov, Evgeny Bychkov, Igor Mikhailov, Andrey Shigarov, Alexey Fedorov, Oleg Kazarina, Yulia Shipilov, Dmitry Zhurov, Dmitry |
author_sort |
Nguyen, Minh-Duc |
title |
A distributed data warehouse system for astroparticle physics |
title_short |
A distributed data warehouse system for astroparticle physics |
title_full |
A distributed data warehouse system for astroparticle physics |
title_fullStr |
A distributed data warehouse system for astroparticle physics |
title_full_unstemmed |
A distributed data warehouse system for astroparticle physics |
title_sort |
distributed data warehouse system for astroparticle physics |
publishDate |
2018 |
url |
http://arxiv.org/abs/1812.01906 |
genre |
taiga |
genre_facet |
taiga |
op_relation |
http://arxiv.org/abs/1812.01906 |
_version_ |
1776204256531972096 |