HRDBMS: Combining the Best of Modern and Traditional Relational Databases

HRDBMS is a novel distributed relational database that uses a hybrid model combining the best of traditional distributed relational databases and Big Data analytics platforms such as Hive. This allows HRDBMS to leverage years worth of research regarding query optimization, while also taking advantag...

Full description

Bibliographic Details
Main Authors: Arnold, Jason, Glavic, Boris, Raicu, Ioan
Format: Report
Language:unknown
Published: arXiv 2019
Subjects:
DML
Online Access:https://dx.doi.org/10.48550/arxiv.1901.08666
https://arxiv.org/abs/1901.08666
id ftdatacite:10.48550/arxiv.1901.08666
record_format openpolar
spelling ftdatacite:10.48550/arxiv.1901.08666 2023-05-15T16:01:50+02:00 HRDBMS: Combining the Best of Modern and Traditional Relational Databases Arnold, Jason Glavic, Boris Raicu, Ioan 2019 https://dx.doi.org/10.48550/arxiv.1901.08666 https://arxiv.org/abs/1901.08666 unknown arXiv arXiv.org perpetual, non-exclusive license http://arxiv.org/licenses/nonexclusive-distrib/1.0/ Databases cs.DB Distributed, Parallel, and Cluster Computing cs.DC FOS Computer and information sciences Preprint Article article CreativeWork 2019 ftdatacite https://doi.org/10.48550/arxiv.1901.08666 2022-04-01T08:47:12Z HRDBMS is a novel distributed relational database that uses a hybrid model combining the best of traditional distributed relational databases and Big Data analytics platforms such as Hive. This allows HRDBMS to leverage years worth of research regarding query optimization, while also taking advantage of the scalability of Big Data platforms. The system uses an execution framework that is tailored for relational processing, thus addressing some of the performance challenges of running SQL on top of platforms such as MapReduce and Spark. These include excessive materialization of intermediate results, lack of a global cost-based optimization, unnecessary sorting, lack of index support, no statistics, no support for DML and ACID, and excessive communication caused by the rigid communication patterns enforced by these platforms. : Oral Ph.D. Qualifier Report Report DML DataCite Metadata Store (German National Library of Science and Technology)
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
topic Databases cs.DB
Distributed, Parallel, and Cluster Computing cs.DC
FOS Computer and information sciences
spellingShingle Databases cs.DB
Distributed, Parallel, and Cluster Computing cs.DC
FOS Computer and information sciences
Arnold, Jason
Glavic, Boris
Raicu, Ioan
HRDBMS: Combining the Best of Modern and Traditional Relational Databases
topic_facet Databases cs.DB
Distributed, Parallel, and Cluster Computing cs.DC
FOS Computer and information sciences
description HRDBMS is a novel distributed relational database that uses a hybrid model combining the best of traditional distributed relational databases and Big Data analytics platforms such as Hive. This allows HRDBMS to leverage years worth of research regarding query optimization, while also taking advantage of the scalability of Big Data platforms. The system uses an execution framework that is tailored for relational processing, thus addressing some of the performance challenges of running SQL on top of platforms such as MapReduce and Spark. These include excessive materialization of intermediate results, lack of a global cost-based optimization, unnecessary sorting, lack of index support, no statistics, no support for DML and ACID, and excessive communication caused by the rigid communication patterns enforced by these platforms. : Oral Ph.D. Qualifier Report
format Report
author Arnold, Jason
Glavic, Boris
Raicu, Ioan
author_facet Arnold, Jason
Glavic, Boris
Raicu, Ioan
author_sort Arnold, Jason
title HRDBMS: Combining the Best of Modern and Traditional Relational Databases
title_short HRDBMS: Combining the Best of Modern and Traditional Relational Databases
title_full HRDBMS: Combining the Best of Modern and Traditional Relational Databases
title_fullStr HRDBMS: Combining the Best of Modern and Traditional Relational Databases
title_full_unstemmed HRDBMS: Combining the Best of Modern and Traditional Relational Databases
title_sort hrdbms: combining the best of modern and traditional relational databases
publisher arXiv
publishDate 2019
url https://dx.doi.org/10.48550/arxiv.1901.08666
https://arxiv.org/abs/1901.08666
genre DML
genre_facet DML
op_rights arXiv.org perpetual, non-exclusive license
http://arxiv.org/licenses/nonexclusive-distrib/1.0/
op_doi https://doi.org/10.48550/arxiv.1901.08666
_version_ 1766397548948357120