Low latency memory access and synchronization

A low latency memory system access is provided in association with a weakly-ordered multiprocessor system. Each processor in the multiprocessor shares resources, and each shared resource has an associated lock within a locking device that provides support for synchronization between the multiple pro...

Full description

Bibliographic Details
Main Authors:	Blumrich, Matthias A., Chen, Dong, Coteus, Paul W., Gara, Alan G., Giampapa, Mark E., Heidelberger, Philip, Hoenicke, Dirk, Ohmacht, Martin, Steinmacher-Burow, Burkhard D., Takken, Todd E., Vranas, Pavlos M.
Language:	unknown
Published:	2023
Subjects:	The Pointers
Online Access:	http://www.osti.gov/servlets/purl/949197 https://www.osti.gov/biblio/949197

id	ftosti:oai:osti.gov:949197
record_format	openpolar
spelling	ftosti:oai:osti.gov:949197 2023-07-30T04:07:15+02:00 Low latency memory access and synchronization Blumrich, Matthias A. Chen, Dong Coteus, Paul W. Gara, Alan G. Giampapa, Mark E. Heidelberger, Philip Hoenicke, Dirk Ohmacht, Martin Steinmacher-Burow, Burkhard D. Takken, Todd E. Vranas, Pavlos M. 2023-01-23 application/pdf http://www.osti.gov/servlets/purl/949197 https://www.osti.gov/biblio/949197 unknown http://www.osti.gov/servlets/purl/949197 https://www.osti.gov/biblio/949197 2023 ftosti 2023-07-11T08:46:55Z A low latency memory system access is provided in association with a weakly-ordered multiprocessor system. Each processor in the multiprocessor shares resources, and each shared resource has an associated lock within a locking device that provides support for synchronization between the multiple processors in the multiprocessor and the orderly sharing of the resources. A processor only has permission to access a resource when it owns the lock associated with that resource, and an attempt by a processor to own a lock requires only a single load operation, rather than a traditional atomic load followed by store, such that the processor only performs a read operation and the hardware locking device performs a subsequent write operation rather than the processor. A simple prefetching for non-contiguous data structures is also disclosed. A memory line is redefined so that in addition to the normal physical memory data, every line includes a pointer that is large enough to point to any other line in the memory, wherein the pointers to determine which memory line to prefetch rather than some other predictive algorithm. This enables hardware to effectively prefetch memory access patterns that are non-contiguous, but repetitive. Other/Unknown Material The Pointers SciTec Connect (Office of Scientific and Technical Information - OSTI, U.S. Department of Energy)
institution	Open Polar
collection	SciTec Connect (Office of Scientific and Technical Information - OSTI, U.S. Department of Energy)
op_collection_id	ftosti
language	unknown
description	A low latency memory system access is provided in association with a weakly-ordered multiprocessor system. Each processor in the multiprocessor shares resources, and each shared resource has an associated lock within a locking device that provides support for synchronization between the multiple processors in the multiprocessor and the orderly sharing of the resources. A processor only has permission to access a resource when it owns the lock associated with that resource, and an attempt by a processor to own a lock requires only a single load operation, rather than a traditional atomic load followed by store, such that the processor only performs a read operation and the hardware locking device performs a subsequent write operation rather than the processor. A simple prefetching for non-contiguous data structures is also disclosed. A memory line is redefined so that in addition to the normal physical memory data, every line includes a pointer that is large enough to point to any other line in the memory, wherein the pointers to determine which memory line to prefetch rather than some other predictive algorithm. This enables hardware to effectively prefetch memory access patterns that are non-contiguous, but repetitive.
author	Blumrich, Matthias A. Chen, Dong Coteus, Paul W. Gara, Alan G. Giampapa, Mark E. Heidelberger, Philip Hoenicke, Dirk Ohmacht, Martin Steinmacher-Burow, Burkhard D. Takken, Todd E. Vranas, Pavlos M.
spellingShingle	Blumrich, Matthias A. Chen, Dong Coteus, Paul W. Gara, Alan G. Giampapa, Mark E. Heidelberger, Philip Hoenicke, Dirk Ohmacht, Martin Steinmacher-Burow, Burkhard D. Takken, Todd E. Vranas, Pavlos M. Low latency memory access and synchronization
author_facet	Blumrich, Matthias A. Chen, Dong Coteus, Paul W. Gara, Alan G. Giampapa, Mark E. Heidelberger, Philip Hoenicke, Dirk Ohmacht, Martin Steinmacher-Burow, Burkhard D. Takken, Todd E. Vranas, Pavlos M.
author_sort	Blumrich, Matthias A.
title	Low latency memory access and synchronization
title_short	Low latency memory access and synchronization
title_full	Low latency memory access and synchronization
title_fullStr	Low latency memory access and synchronization
title_full_unstemmed	Low latency memory access and synchronization
title_sort	low latency memory access and synchronization
publishDate	2023
url	http://www.osti.gov/servlets/purl/949197 https://www.osti.gov/biblio/949197
genre	The Pointers
genre_facet	The Pointers
op_relation	http://www.osti.gov/servlets/purl/949197 https://www.osti.gov/biblio/949197
_version_	1772820472892751872

Low latency memory access and synchronization

Similar Items