WHOLE: A Low Energy I-Cache with Separate Way History

Abstract — Set-associative instruction caches achieve low miss rates at the expense of significant energy dissipation. Previous energy-efficient approaches usually suffer from performance degradation and redundant extension bits. In this paper, we propose a Way History Oriented Low Energy Instructio...

Full description

Bibliographic Details
Main Authors: Zichao Xie, Dong Tong, Xu Cheng
Other Authors: The Pennsylvania State University CiteSeerX Archives
Format: Text
Language:English
Subjects:
Online Access:http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825
http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf
id ftciteseerx:oai:CiteSeerX.psu:10.1.1.398.1825
record_format openpolar
spelling ftciteseerx:oai:CiteSeerX.psu:10.1.1.398.1825 2023-05-15T18:32:44+02:00 WHOLE: A Low Energy I-Cache with Separate Way History Zichao Xie Dong Tong Xu Cheng The Pennsylvania State University CiteSeerX Archives application/pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825 http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf en eng http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825 http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf Metadata may be used without restrictions as long as the oai identifier remains attached to it. http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf text ftciteseerx 2016-09-25T00:09:26Z Abstract — Set-associative instruction caches achieve low miss rates at the expense of significant energy dissipation. Previous energy-efficient approaches usually suffer from performance degradation and redundant extension bits. In this paper, we propose a Way History Oriented Low Energy Instruction Cache (WHOLE-Cache) design for single issue and in-order execution processors. The WHOLE-Cache design not only achieves a significant portion of energy reduction by effectively reducing dynamic energy dissipation of set-associative instruction cache, but also leads to no additional cycle penalties. Tag comparison results are stored into either the Branch Target Buffer (BTB) or the Instruction Cache (I-Cache) to avoid tag checks and unnecessary way activation for subsequent accesses to visited cache lines. The extended BTB uses way history bits for branch instructions, while the I-Cache extension bits are used in case of fetching consecutive instructions resided in different cache lines. A valid flag is associated with each stored tag comparison result to indicate whether the instruction to be fetched is resided in the recorded location. A simple invalidation scheme is implemented in the cache miss replacement operation. Whenever a cache line is replaced, the pointers to it, which reside in the BTB or other I-cache lines, will be invalidated accordingly. We model the WHOLE-Cache design in Verilog. By deriving basic parameters from TSMC 65nm technology, we use Wattch simulator to evaluate the performance and energy reduction of the WHOLE-Cache in the instruction fetch stage. We use SPEC2000 and Mediabench as benchmarks. It is observed that compared with a conventional 4-way set-associative I-Cache, the energy consumption of the WHOLE-Cache is reduced by 65 % without any performance penalty. I. Text The Pointers Unknown
institution Open Polar
collection Unknown
op_collection_id ftciteseerx
language English
description Abstract — Set-associative instruction caches achieve low miss rates at the expense of significant energy dissipation. Previous energy-efficient approaches usually suffer from performance degradation and redundant extension bits. In this paper, we propose a Way History Oriented Low Energy Instruction Cache (WHOLE-Cache) design for single issue and in-order execution processors. The WHOLE-Cache design not only achieves a significant portion of energy reduction by effectively reducing dynamic energy dissipation of set-associative instruction cache, but also leads to no additional cycle penalties. Tag comparison results are stored into either the Branch Target Buffer (BTB) or the Instruction Cache (I-Cache) to avoid tag checks and unnecessary way activation for subsequent accesses to visited cache lines. The extended BTB uses way history bits for branch instructions, while the I-Cache extension bits are used in case of fetching consecutive instructions resided in different cache lines. A valid flag is associated with each stored tag comparison result to indicate whether the instruction to be fetched is resided in the recorded location. A simple invalidation scheme is implemented in the cache miss replacement operation. Whenever a cache line is replaced, the pointers to it, which reside in the BTB or other I-cache lines, will be invalidated accordingly. We model the WHOLE-Cache design in Verilog. By deriving basic parameters from TSMC 65nm technology, we use Wattch simulator to evaluate the performance and energy reduction of the WHOLE-Cache in the instruction fetch stage. We use SPEC2000 and Mediabench as benchmarks. It is observed that compared with a conventional 4-way set-associative I-Cache, the energy consumption of the WHOLE-Cache is reduced by 65 % without any performance penalty. I.
author2 The Pennsylvania State University CiteSeerX Archives
format Text
author Zichao Xie
Dong Tong
Xu Cheng
spellingShingle Zichao Xie
Dong Tong
Xu Cheng
WHOLE: A Low Energy I-Cache with Separate Way History
author_facet Zichao Xie
Dong Tong
Xu Cheng
author_sort Zichao Xie
title WHOLE: A Low Energy I-Cache with Separate Way History
title_short WHOLE: A Low Energy I-Cache with Separate Way History
title_full WHOLE: A Low Energy I-Cache with Separate Way History
title_fullStr WHOLE: A Low Energy I-Cache with Separate Way History
title_full_unstemmed WHOLE: A Low Energy I-Cache with Separate Way History
title_sort whole: a low energy i-cache with separate way history
url http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825
http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf
genre The Pointers
genre_facet The Pointers
op_source http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf
op_relation http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.398.1825
http://iccd.et.tudelft.nl/2009/proceedings/137Xie.pdf
op_rights Metadata may be used without restrictions as long as the oai identifier remains attached to it.
_version_ 1766216913540612096