TY - JOUR
T1 - Adapting the RACER Architecture to Integrate Improved In-ReRAM Logic Primitives
AU - Truong, Minh S.Q.
AU - Shen, Liting
AU - Glass, Alexander
AU - Hoffmann, Alison
AU - Carley, L. Richard
AU - Bain, James A.
AU - Ghose, Saugata
N1 - Publisher Copyright:
© 2011 IEEE.
PY - 2022/6/1
Y1 - 2022/6/1
N2 - Modern computing applications based upon machine learning can incur significant data movement overheads in state-of-the-art computers. Resistive-memory-based processing-using-memory (PUM) can mitigate this data movement by instead performing computation in situ (i.e., directly within memory cells), but device-level limitations restrict the practicality and/or performance of many PUM architecture proposals. The RACER architecture overcomes these limitations, by proposing efficient peripheral circuitry and the concept of bit-pipelining to enable high-performance, high-efficiency computation using small memory tiles. In this work, we extend RACER to adapt easily to different PUM logic families, by (1) modifying the device access circuitry to support a wide range of logic families, (2) evaluating three logic families proposed by prior work, and (3) proposing and evaluating a new logic family called OSCAR that significantly relaxes the switching voltage constraints required to perform logic with resistive memory devices. We show that the modified RACER architecture, using the OSCAR logic family, can enable practical PUM on real ReRAM devices while improving performance and energy savings by 30% and 37%, respectively, over the original RACER work.
AB - Modern computing applications based upon machine learning can incur significant data movement overheads in state-of-the-art computers. Resistive-memory-based processing-using-memory (PUM) can mitigate this data movement by instead performing computation in situ (i.e., directly within memory cells), but device-level limitations restrict the practicality and/or performance of many PUM architecture proposals. The RACER architecture overcomes these limitations, by proposing efficient peripheral circuitry and the concept of bit-pipelining to enable high-performance, high-efficiency computation using small memory tiles. In this work, we extend RACER to adapt easily to different PUM logic families, by (1) modifying the device access circuitry to support a wide range of logic families, (2) evaluating three logic families proposed by prior work, and (3) proposing and evaluating a new logic family called OSCAR that significantly relaxes the switching voltage constraints required to perform logic with resistive memory devices. We show that the modified RACER architecture, using the OSCAR logic family, can enable practical PUM on real ReRAM devices while improving performance and energy savings by 30% and 37%, respectively, over the original RACER work.
KW - Accelerator architectures
KW - memory architecture
KW - resistive RAM
UR - http://www.scopus.com/inward/record.url?scp=85132506204&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85132506204&partnerID=8YFLogxK
U2 - 10.1109/JETCAS.2022.3171765
DO - 10.1109/JETCAS.2022.3171765
M3 - Article
AN - SCOPUS:85132506204
SN - 2156-3357
VL - 12
SP - 393
EP - 407
JO - IEEE Journal on Emerging and Selected Topics in Circuits and Systems
JF - IEEE Journal on Emerging and Selected Topics in Circuits and Systems
IS - 2
ER -