Performance evaluation of the Orca shared-object system

Orca is a portable, object-based distributed shared memory (DSM) system. This article studies and evaluates the design choices made in the Orca system and compares Orca with other DSMs. The article gives a quantitative analysis of Orca's coherence protocol (based on write-updates with function...

Full description

Bibliographic Details
Published in:ACM Transactions on Computer Systems
Main Authors: Bal, Henri E., Bhoedjang, Raoul, Hofman, Rutger, Jacobs, Ceriel, Langendoen, Koen, Rühl, Tim, Kaashoek, M. Frans
Format: Article in Journal/Newspaper
Language:English
Published: Association for Computing Machinery (ACM) 1998
Subjects:
Online Access:http://dx.doi.org/10.1145/273011.273014
https://dl.acm.org/doi/pdf/10.1145/273011.273014
id cracm:10.1145/273011.273014
record_format openpolar
spelling cracm:10.1145/273011.273014 2024-06-02T08:12:43+00:00 Performance evaluation of the Orca shared-object system Bal, Henri E. Bhoedjang, Raoul Hofman, Rutger Jacobs, Ceriel Langendoen, Koen Rühl, Tim Kaashoek, M. Frans 1998 http://dx.doi.org/10.1145/273011.273014 https://dl.acm.org/doi/pdf/10.1145/273011.273014 en eng Association for Computing Machinery (ACM) ACM Transactions on Computer Systems volume 16, issue 1, page 1-40 ISSN 0734-2071 1557-7333 journal-article 1998 cracm https://doi.org/10.1145/273011.273014 2024-05-07T12:58:47Z Orca is a portable, object-based distributed shared memory (DSM) system. This article studies and evaluates the design choices made in the Orca system and compares Orca with other DSMs. The article gives a quantitative analysis of Orca's coherence protocol (based on write-updates with function shipping), the totally ordered group communication protocol, the strategy for object placement, and the all-software, user-space architecture. Performance measurements for 10 parallel applications illustrate the trade-offs made in the design of Orca and show that essentially the right design decisions have been made. A write-update protocol with function shipping is effective for Orca, especially since it is used in combination with techniques that avoid replicating objects that have a low read/write ratio. The overhead of totally ordered group communication on application performance is low. The Orca system is able to make near-optimal decisions for object placement and replication. In addition, the article compares the performance of Orca with that of a page-based DSM (TreadMarks) and another object-based DSM (CRL). It also analyzes the communication overhead of the DSMs for several applications. All performance measurements are done on a 32-node Pentium Pro cluster with Myrinet and Fast Ethernet networks. The results show that Orca programs send fewer messages and less data than the TreadMarks and CRL programs and obtain better speedups. Article in Journal/Newspaper Orca ACM Publications (Association for Computing Machinery) ACM Transactions on Computer Systems 16 1 1 40
institution Open Polar
collection ACM Publications (Association for Computing Machinery)
op_collection_id cracm
language English
description Orca is a portable, object-based distributed shared memory (DSM) system. This article studies and evaluates the design choices made in the Orca system and compares Orca with other DSMs. The article gives a quantitative analysis of Orca's coherence protocol (based on write-updates with function shipping), the totally ordered group communication protocol, the strategy for object placement, and the all-software, user-space architecture. Performance measurements for 10 parallel applications illustrate the trade-offs made in the design of Orca and show that essentially the right design decisions have been made. A write-update protocol with function shipping is effective for Orca, especially since it is used in combination with techniques that avoid replicating objects that have a low read/write ratio. The overhead of totally ordered group communication on application performance is low. The Orca system is able to make near-optimal decisions for object placement and replication. In addition, the article compares the performance of Orca with that of a page-based DSM (TreadMarks) and another object-based DSM (CRL). It also analyzes the communication overhead of the DSMs for several applications. All performance measurements are done on a 32-node Pentium Pro cluster with Myrinet and Fast Ethernet networks. The results show that Orca programs send fewer messages and less data than the TreadMarks and CRL programs and obtain better speedups.
format Article in Journal/Newspaper
author Bal, Henri E.
Bhoedjang, Raoul
Hofman, Rutger
Jacobs, Ceriel
Langendoen, Koen
Rühl, Tim
Kaashoek, M. Frans
spellingShingle Bal, Henri E.
Bhoedjang, Raoul
Hofman, Rutger
Jacobs, Ceriel
Langendoen, Koen
Rühl, Tim
Kaashoek, M. Frans
Performance evaluation of the Orca shared-object system
author_facet Bal, Henri E.
Bhoedjang, Raoul
Hofman, Rutger
Jacobs, Ceriel
Langendoen, Koen
Rühl, Tim
Kaashoek, M. Frans
author_sort Bal, Henri E.
title Performance evaluation of the Orca shared-object system
title_short Performance evaluation of the Orca shared-object system
title_full Performance evaluation of the Orca shared-object system
title_fullStr Performance evaluation of the Orca shared-object system
title_full_unstemmed Performance evaluation of the Orca shared-object system
title_sort performance evaluation of the orca shared-object system
publisher Association for Computing Machinery (ACM)
publishDate 1998
url http://dx.doi.org/10.1145/273011.273014
https://dl.acm.org/doi/pdf/10.1145/273011.273014
genre Orca
genre_facet Orca
op_source ACM Transactions on Computer Systems
volume 16, issue 1, page 1-40
ISSN 0734-2071 1557-7333
op_doi https://doi.org/10.1145/273011.273014
container_title ACM Transactions on Computer Systems
container_volume 16
container_issue 1
container_start_page 1
op_container_end_page 40
_version_ 1800759250026233856