Performance of a High-Level Parallel Language on a High-Speed Network
Clusters of workstations are often claimed to be a good platform for parallel processing, especially if a fast network is used to interconnect the workstations. Indeed, high performance can be obtained for low-level message passing primitives on modern networks like ATM and Myrinet. Most application...
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Text |
Language: | English |
Published: |
1997
|
Subjects: | |
Online Access: | http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.95.1069 http://www.cs.cornell.edu/raoul/papers/jpdc97.pdf |
id |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.95.1069 |
---|---|
record_format |
openpolar |
spelling |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.95.1069 2023-05-15T17:53:50+02:00 Performance of a High-Level Parallel Language on a High-Speed Network Henri Bal Raoul Bhoedjang Rutger Hofman Ceriel Jacobs Koen Langendoen Tim Rühl Kees Verstoep The Pennsylvania State University CiteSeerX Archives 1997 application/pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.95.1069 http://www.cs.cornell.edu/raoul/papers/jpdc97.pdf en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.95.1069 http://www.cs.cornell.edu/raoul/papers/jpdc97.pdf Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://www.cs.cornell.edu/raoul/papers/jpdc97.pdf text 1997 ftciteseerx 2016-01-08T19:57:30Z Clusters of workstations are often claimed to be a good platform for parallel processing, especially if a fast network is used to interconnect the workstations. Indeed, high performance can be obtained for low-level message passing primitives on modern networks like ATM and Myrinet. Most application programmers, however, want to use higherlevel communication primitives. Unfortunately, implementing such primitives efficiently on a modern network is a difficult task, because their software overhead is relatively much higher than on a traditional, slow network (such as Ethernet). In this paper we investigate the issues involved in implementing a high-level programming environment on a fast network. We have implemented a portable runtime system for an object-based language (Orca) on a collection of processors connected by a Myrinet network. Many performance optimizations were required in order to let application programmers benefit sufficiently from the faster network. In particular, we have optimized message handling, multicasting, buffer management, fragmentation, marshalling, and various other issues. The paper analyzes the impact of these optimizations on the performance of the basic language primitives as well as parallel applications. Keywords: clusters, threads, communication protocols, multicast, Myrinet, Illinois Fast Messages. 1 Text Orca Unknown |
institution |
Open Polar |
collection |
Unknown |
op_collection_id |
ftciteseerx |
language |
English |
description |
Clusters of workstations are often claimed to be a good platform for parallel processing, especially if a fast network is used to interconnect the workstations. Indeed, high performance can be obtained for low-level message passing primitives on modern networks like ATM and Myrinet. Most application programmers, however, want to use higherlevel communication primitives. Unfortunately, implementing such primitives efficiently on a modern network is a difficult task, because their software overhead is relatively much higher than on a traditional, slow network (such as Ethernet). In this paper we investigate the issues involved in implementing a high-level programming environment on a fast network. We have implemented a portable runtime system for an object-based language (Orca) on a collection of processors connected by a Myrinet network. Many performance optimizations were required in order to let application programmers benefit sufficiently from the faster network. In particular, we have optimized message handling, multicasting, buffer management, fragmentation, marshalling, and various other issues. The paper analyzes the impact of these optimizations on the performance of the basic language primitives as well as parallel applications. Keywords: clusters, threads, communication protocols, multicast, Myrinet, Illinois Fast Messages. 1 |
author2 |
The Pennsylvania State University CiteSeerX Archives |
format |
Text |
author |
Henri Bal Raoul Bhoedjang Rutger Hofman Ceriel Jacobs Koen Langendoen Tim Rühl Kees Verstoep |
spellingShingle |
Henri Bal Raoul Bhoedjang Rutger Hofman Ceriel Jacobs Koen Langendoen Tim Rühl Kees Verstoep Performance of a High-Level Parallel Language on a High-Speed Network |
author_facet |
Henri Bal Raoul Bhoedjang Rutger Hofman Ceriel Jacobs Koen Langendoen Tim Rühl Kees Verstoep |
author_sort |
Henri Bal |
title |
Performance of a High-Level Parallel Language on a High-Speed Network |
title_short |
Performance of a High-Level Parallel Language on a High-Speed Network |
title_full |
Performance of a High-Level Parallel Language on a High-Speed Network |
title_fullStr |
Performance of a High-Level Parallel Language on a High-Speed Network |
title_full_unstemmed |
Performance of a High-Level Parallel Language on a High-Speed Network |
title_sort |
performance of a high-level parallel language on a high-speed network |
publishDate |
1997 |
url |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.95.1069 http://www.cs.cornell.edu/raoul/papers/jpdc97.pdf |
genre |
Orca |
genre_facet |
Orca |
op_source |
http://www.cs.cornell.edu/raoul/papers/jpdc97.pdf |
op_relation |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.95.1069 http://www.cs.cornell.edu/raoul/papers/jpdc97.pdf |
op_rights |
Metadata may be used without restrictions as long as the oai identifier remains attached to it. |
_version_ |
1766161536459472896 |