WHOLE: A Low Energy I-Cache with Separate Way History
Abstract — Set-associative instruction caches achieve low miss rates at the expense of significant energy dissipation. Previous energy-efficient approaches usually suffer from performance degradation and redundant extension bits. In this paper, we propose a Way History Oriented Low Energy Instructio...
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Text |
Language: | English |
Subjects: | |
Online Access: | http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825 http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf |
id |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.398.1825 |
---|---|
record_format |
openpolar |
spelling |
ftciteseerx:oai:CiteSeerX.psu:10.1.1.398.1825 2023-05-15T18:32:44+02:00 WHOLE: A Low Energy I-Cache with Separate Way History Zichao Xie Dong Tong Xu Cheng The Pennsylvania State University CiteSeerX Archives application/pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825 http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825 http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf text ftciteseerx 2016-09-25T00:09:26Z Abstract — Set-associative instruction caches achieve low miss rates at the expense of significant energy dissipation. Previous energy-efficient approaches usually suffer from performance degradation and redundant extension bits. In this paper, we propose a Way History Oriented Low Energy Instruction Cache (WHOLE-Cache) design for single issue and in-order execution processors. The WHOLE-Cache design not only achieves a significant portion of energy reduction by effectively reducing dynamic energy dissipation of set-associative instruction cache, but also leads to no additional cycle penalties. Tag comparison results are stored into either the Branch Target Buffer (BTB) or the Instruction Cache (I-Cache) to avoid tag checks and unnecessary way activation for subsequent accesses to visited cache lines. The extended BTB uses way history bits for branch instructions, while the I-Cache extension bits are used in case of fetching consecutive instructions resided in different cache lines. A valid flag is associated with each stored tag comparison result to indicate whether the instruction to be fetched is resided in the recorded location. A simple invalidation scheme is implemented in the cache miss replacement operation. Whenever a cache line is replaced, the pointers to it, which reside in the BTB or other I-cache lines, will be invalidated accordingly. We model the WHOLE-Cache design in Verilog. By deriving basic parameters from TSMC 65nm technology, we use Wattch simulator to evaluate the performance and energy reduction of the WHOLE-Cache in the instruction fetch stage. We use SPEC2000 and Mediabench as benchmarks. It is observed that compared with a conventional 4-way set-associative I-Cache, the energy consumption of the WHOLE-Cache is reduced by 65 % without any performance penalty. I. Text The Pointers Unknown |
institution |
Open Polar |
collection |
Unknown |
op_collection_id |
ftciteseerx |
language |
English |
description |
Abstract — Set-associative instruction caches achieve low miss rates at the expense of significant energy dissipation. Previous energy-efficient approaches usually suffer from performance degradation and redundant extension bits. In this paper, we propose a Way History Oriented Low Energy Instruction Cache (WHOLE-Cache) design for single issue and in-order execution processors. The WHOLE-Cache design not only achieves a significant portion of energy reduction by effectively reducing dynamic energy dissipation of set-associative instruction cache, but also leads to no additional cycle penalties. Tag comparison results are stored into either the Branch Target Buffer (BTB) or the Instruction Cache (I-Cache) to avoid tag checks and unnecessary way activation for subsequent accesses to visited cache lines. The extended BTB uses way history bits for branch instructions, while the I-Cache extension bits are used in case of fetching consecutive instructions resided in different cache lines. A valid flag is associated with each stored tag comparison result to indicate whether the instruction to be fetched is resided in the recorded location. A simple invalidation scheme is implemented in the cache miss replacement operation. Whenever a cache line is replaced, the pointers to it, which reside in the BTB or other I-cache lines, will be invalidated accordingly. We model the WHOLE-Cache design in Verilog. By deriving basic parameters from TSMC 65nm technology, we use Wattch simulator to evaluate the performance and energy reduction of the WHOLE-Cache in the instruction fetch stage. We use SPEC2000 and Mediabench as benchmarks. It is observed that compared with a conventional 4-way set-associative I-Cache, the energy consumption of the WHOLE-Cache is reduced by 65 % without any performance penalty. I. |
author2 |
The Pennsylvania State University CiteSeerX Archives |
format |
Text |
author |
Zichao Xie Dong Tong Xu Cheng |
spellingShingle |
Zichao Xie Dong Tong Xu Cheng WHOLE: A Low Energy I-Cache with Separate Way History |
author_facet |
Zichao Xie Dong Tong Xu Cheng |
author_sort |
Zichao Xie |
title |
WHOLE: A Low Energy I-Cache with Separate Way History |
title_short |
WHOLE: A Low Energy I-Cache with Separate Way History |
title_full |
WHOLE: A Low Energy I-Cache with Separate Way History |
title_fullStr |
WHOLE: A Low Energy I-Cache with Separate Way History |
title_full_unstemmed |
WHOLE: A Low Energy I-Cache with Separate Way History |
title_sort |
whole: a low energy i-cache with separate way history |
url |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825 http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf |
genre |
The Pointers |
genre_facet |
The Pointers |
op_source |
http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf |
op_relation |
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825 http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf |
op_rights |
Metadata may be used without restrictions as long as the oai identifier remains attached to it. |
_version_ |
1766216913540612096 |