TY - GEN
T1 - PRIVE
T2 - 2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023
AU - He, Wangxin
AU - Meng, Jian
AU - Gonugondla, Sujan Kumar
AU - Yu, Shimeng
AU - Shanbhag, Naresh R.
AU - Seo, Jae Sun
N1 - This work is partially supported by NSF grant 1652866, JUMP C-BRIC and ASCENT programs (SRC programs sponsored by DARPA). This work was done outside of Amazon. (E-mail: [email protected])
PY - 2023
Y1 - 2023
N2 - As deep neural networks (DNNs) have been success-fully developed in many applications with continuously increasing complexity, the number of weights in DNNs surges, leading to consistent demands for denser memories than SRAMs. RRAM-based in-memory computing (IMC) achieves high density and energy-efficiency for DNN inference, but RRAM programming remains to be a bottleneck due to high write latency and energy consumption. In this work, we present the Progressive-wRite In-memory program-VErify (PRIVE) scheme, which we verify with an RRAM testchip for IMC-based hardware acceleration for DNNs. We optimize the progressive write operations on different bit positions of RRAM weights to enable error compensation and reduce programming latency/energy, while achieving high DNN accuracy. For 5-bit precision DNNs, PRIVE reduces the RRAM programming energy by 1.82×, while maintaining high accuracy of 91.91% (VGG-7) and 71.47% (ResNet-18) on CIFAR-10 and CIFAR-100 datasets, respectively.
AB - As deep neural networks (DNNs) have been success-fully developed in many applications with continuously increasing complexity, the number of weights in DNNs surges, leading to consistent demands for denser memories than SRAMs. RRAM-based in-memory computing (IMC) achieves high density and energy-efficiency for DNN inference, but RRAM programming remains to be a bottleneck due to high write latency and energy consumption. In this work, we present the Progressive-wRite In-memory program-VErify (PRIVE) scheme, which we verify with an RRAM testchip for IMC-based hardware acceleration for DNNs. We optimize the progressive write operations on different bit positions of RRAM weights to enable error compensation and reduce programming latency/energy, while achieving high DNN accuracy. For 5-bit precision DNNs, PRIVE reduces the RRAM programming energy by 1.82×, while maintaining high accuracy of 91.91% (VGG-7) and 71.47% (ResNet-18) on CIFAR-10 and CIFAR-100 datasets, respectively.
KW - Deep neural network
KW - RRAM programming
KW - in-memory computing
KW - resistive RAM (RRAM)
KW - write-verify
UR - http://www.scopus.com/inward/record.url?scp=85162678594&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85162678594&partnerID=8YFLogxK
U2 - 10.23919/DATE56975.2023.10137266
DO - 10.23919/DATE56975.2023.10137266
M3 - Conference contribution
AN - SCOPUS:85162678594
T3 - Proceedings -Design, Automation and Test in Europe, DATE
BT - 2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 17 April 2023 through 19 April 2023
ER -