Low latency memory access and synchronization

A low latency memory system access is provided in association with a weakly-ordered multiprocessor system. Each processor in the multiprocessor shares resources, and each shared resource has an associated lock within a locking device that provides support for synchronization between the multiple pro...

Full description

Bibliographic Details
Main Authors: Blumrich, Matthias A., Chen, Dong, Coteus, Paul W., Gara, Alan G., Giampapa, Mark E., Heidelberger, Philip, Hoenicke, Dirk, Ohmacht, Martin, Steinmacher-Burow, Burkhard D., Takken, Todd E., Vranas, Pavlos M.
Language:unknown
Published: 2023
Subjects:
Online Access:http://www.osti.gov/servlets/purl/949197
https://www.osti.gov/biblio/949197
id ftosti:oai:osti.gov:949197
record_format openpolar
spelling ftosti:oai:osti.gov:949197 2023-07-30T04:07:15+02:00 Low latency memory access and synchronization Blumrich, Matthias A. Chen, Dong Coteus, Paul W. Gara, Alan G. Giampapa, Mark E. Heidelberger, Philip Hoenicke, Dirk Ohmacht, Martin Steinmacher-Burow, Burkhard D. Takken, Todd E. Vranas, Pavlos M. 2023-01-23 application/pdf http://www.osti.gov/servlets/purl/949197 https://www.osti.gov/biblio/949197 unknown http://www.osti.gov/servlets/purl/949197 https://www.osti.gov/biblio/949197 2023 ftosti 2023-07-11T08:46:55Z A low latency memory system access is provided in association with a weakly-ordered multiprocessor system. Each processor in the multiprocessor shares resources, and each shared resource has an associated lock within a locking device that provides support for synchronization between the multiple processors in the multiprocessor and the orderly sharing of the resources. A processor only has permission to access a resource when it owns the lock associated with that resource, and an attempt by a processor to own a lock requires only a single load operation, rather than a traditional atomic load followed by store, such that the processor only performs a read operation and the hardware locking device performs a subsequent write operation rather than the processor. A simple prefetching for non-contiguous data structures is also disclosed. A memory line is redefined so that in addition to the normal physical memory data, every line includes a pointer that is large enough to point to any other line in the memory, wherein the pointers to determine which memory line to prefetch rather than some other predictive algorithm. This enables hardware to effectively prefetch memory access patterns that are non-contiguous, but repetitive. Other/Unknown Material The Pointers SciTec Connect (Office of Scientific and Technical Information - OSTI, U.S. Department of Energy)
institution Open Polar
collection SciTec Connect (Office of Scientific and Technical Information - OSTI, U.S. Department of Energy)
op_collection_id ftosti
language unknown
description A low latency memory system access is provided in association with a weakly-ordered multiprocessor system. Each processor in the multiprocessor shares resources, and each shared resource has an associated lock within a locking device that provides support for synchronization between the multiple processors in the multiprocessor and the orderly sharing of the resources. A processor only has permission to access a resource when it owns the lock associated with that resource, and an attempt by a processor to own a lock requires only a single load operation, rather than a traditional atomic load followed by store, such that the processor only performs a read operation and the hardware locking device performs a subsequent write operation rather than the processor. A simple prefetching for non-contiguous data structures is also disclosed. A memory line is redefined so that in addition to the normal physical memory data, every line includes a pointer that is large enough to point to any other line in the memory, wherein the pointers to determine which memory line to prefetch rather than some other predictive algorithm. This enables hardware to effectively prefetch memory access patterns that are non-contiguous, but repetitive.
author Blumrich, Matthias A.
Chen, Dong
Coteus, Paul W.
Gara, Alan G.
Giampapa, Mark E.
Heidelberger, Philip
Hoenicke, Dirk
Ohmacht, Martin
Steinmacher-Burow, Burkhard D.
Takken, Todd E.
Vranas, Pavlos M.
spellingShingle Blumrich, Matthias A.
Chen, Dong
Coteus, Paul W.
Gara, Alan G.
Giampapa, Mark E.
Heidelberger, Philip
Hoenicke, Dirk
Ohmacht, Martin
Steinmacher-Burow, Burkhard D.
Takken, Todd E.
Vranas, Pavlos M.
Low latency memory access and synchronization
author_facet Blumrich, Matthias A.
Chen, Dong
Coteus, Paul W.
Gara, Alan G.
Giampapa, Mark E.
Heidelberger, Philip
Hoenicke, Dirk
Ohmacht, Martin
Steinmacher-Burow, Burkhard D.
Takken, Todd E.
Vranas, Pavlos M.
author_sort Blumrich, Matthias A.
title Low latency memory access and synchronization
title_short Low latency memory access and synchronization
title_full Low latency memory access and synchronization
title_fullStr Low latency memory access and synchronization
title_full_unstemmed Low latency memory access and synchronization
title_sort low latency memory access and synchronization
publishDate 2023
url http://www.osti.gov/servlets/purl/949197
https://www.osti.gov/biblio/949197
genre The Pointers
genre_facet The Pointers
op_relation http://www.osti.gov/servlets/purl/949197
https://www.osti.gov/biblio/949197
_version_ 1772820472892751872