View : 431 Download: 0

CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs

Title
CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs
Authors
Oh, YunhoJeong, IpoomRo, Won WooYoon, Myung Kuk
Ewha Authors
윤명국
SCOPUS Author ID
윤명국scopus
Issue Date
2022
Journal Title
IEEE EMBEDDED SYSTEMS LETTERS
ISSN
1943-0663JCR Link

1943-0671JCR Link
Citation
IEEE EMBEDDED SYSTEMS LETTERS vol. 14, no. 4, pp. 187 - 190
Keywords
Compiler-assistedgraphics processing units (GPUs)hierarchical register file (RF)spin-transfer torque magnetic random-access memory (STT-MRAM)SRAM
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Indexed
SCIE; SCOPUS WOS scopus
Document Type
Article
Abstract
Spin-transfer torque magnetic random-access memory (STT-MRAM) is an emerging nonvolatile memory technology that has been received significant attention due to its higher density and lower leakage current over SRAM. One compelling use case is to employ STT-MRAM as a graphics processing unit (GPU) register file (RF) to reduce its massive energy consumption. One critical challenge is that STT-MRAM has longer access latency and higher dynamic power consumption than SRAM, which motivates the hierarchical RF that places a small SRAM-based register cache (RC) between functional units and STT-MRAM RF. The RC acts as the write buffer, so all the writes on the RF are first performed on the RC. In the presence of a conflict miss, the RC writes back the corresponding cache line into the RF. In this work, we observe that a large amount of such write-back operations are unnecessary because they include register values that are never used again. Leveraging this observation, we propose a compiler-assisted hierarchical RF in GPUs (CASH-RF) that optimizes STT-MRAM accesses by removing dead register values. In CASH-RF, unnecessary write-back operations are detected by the compiler via the last consumer analysis. At runtime, the corresponding RC lines are discarded after the last references without being updated to the RF. Compared to the baseline GPUs, CASH-RF removes 59.5% of write-back operations, which leads to 54.7% lower RF energy consumption with only 2.6% of performance degradation.
DOI
10.1109/LES.2022.3163749
Appears in Collections:
인공지능대학 > 컴퓨터공학과 > Journal papers
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE