TY - JOUR
T1 - Impact of spintronic memory on multicore cache hierarchy design
AU - Ma, Cong
AU - Tuohy, William
AU - Lilja, David J.
N1 - Publisher Copyright:
© The Institution of Engineering and Technology 2016.
PY - 2017/3/1
Y1 - 2017/3/1
N2 - Spintronic memory [spin-transfer torque-magnetic random access memory (STT-MRAM)] is an attractive alternative technology to CMOS since it offers higher density and virtually no leakage current. Spintronic memory continues to require higher write energy, however, presenting a challenge to memory hierarchy design when energy consumption is a concern. This study motivates the use of STT-MRAM for the first-level caches of a multicore processor to reduce energy consumption without significantly degrading the performance. The large STT-MRAM first-level cache implementation saves leakage power. Moreover, the use of small level-0 cache regains the performance drop due to STT-MRAM long write latencies. The combination of both reduces the energy-delay product by 65% on average compared with CMOS baseline. The proposed STT hierarchy also shows good scalability over the CMOS with a few benchmarks which scale significantly better. The PARSEC and Splash2 benchmark suites are analysed running on a modern multicore platform, comparing performance, energy consumption and scalability of the spintronic cache system to a CMOS design.
AB - Spintronic memory [spin-transfer torque-magnetic random access memory (STT-MRAM)] is an attractive alternative technology to CMOS since it offers higher density and virtually no leakage current. Spintronic memory continues to require higher write energy, however, presenting a challenge to memory hierarchy design when energy consumption is a concern. This study motivates the use of STT-MRAM for the first-level caches of a multicore processor to reduce energy consumption without significantly degrading the performance. The large STT-MRAM first-level cache implementation saves leakage power. Moreover, the use of small level-0 cache regains the performance drop due to STT-MRAM long write latencies. The combination of both reduces the energy-delay product by 65% on average compared with CMOS baseline. The proposed STT hierarchy also shows good scalability over the CMOS with a few benchmarks which scale significantly better. The PARSEC and Splash2 benchmark suites are analysed running on a modern multicore platform, comparing performance, energy consumption and scalability of the spintronic cache system to a CMOS design.
UR - http://www.scopus.com/inward/record.url?scp=85013898987&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85013898987&partnerID=8YFLogxK
U2 - 10.1049/iet-cdt.2015.0190
DO - 10.1049/iet-cdt.2015.0190
M3 - Article
AN - SCOPUS:85013898987
SN - 1751-8601
VL - 11
SP - 51
EP - 59
JO - IET Computers and Digital Techniques
JF - IET Computers and Digital Techniques
IS - 2
ER -