TY - JOUR
T1 - A Hybrid Optical-Electrical Analog Deep Learning Accelerator Using Incoherent Optical Signals
AU - Yang, Mingdai
AU - Lou, Qiuwen
AU - Rajaei, Ramin
AU - Jokar, Mohammad Reza
AU - Qiu, Junyi
AU - Liu, Yuming
AU - Udupa, Aditi
AU - Chong, Frederic T.
AU - Dallesasse, John M.
AU - Feng, Milton
AU - Goddard, Lynford L.
AU - Hu, X. Sharon
AU - Li, Yanjing
N1 - Publisher Copyright:
© 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2023/5/3
Y1 - 2023/5/3
N2 - Optical deep learning (DL) accelerators have attracted significant interests due to their latency and power advantages. In this article, we focus on incoherent optical designs. A significant challenge is that there is no known solution to perform single-wavelength accumulation (a key operation required for DL workloads) using incoherent optical signals efficiently. Therefore, we devise a hybrid approach, where accumulation is done in the electrical domain, and multiplication is performed in the optical domain. The key technology enabler of our design is the transistor laser, which performs electrical-to-optical and optical-to-electrical conversions efficiently. Through detailed design and evaluation of our design, along with a comprehensive benchmarking study against state-of-the-art RRAM-based designs, we derive the following key results:(1) For a four-layer multilayer perceptron network, our design achieves 115× and 17.11× improvements in latency and energy, respectively, compared to the RRAM-based design. We can take full advantage of the speed and energy benefits of the optical technology because the inference task can be entirely mapped onto our design.(2) For a complex workload (Resnet50), weight reprogramming is needed, and intermediate results need to be stored/re-fetched to/from memories. In this case, for the same area, our design still outperforms the RRAM-based design by 15.92× in inference latency, and 8.99× in energy.
AB - Optical deep learning (DL) accelerators have attracted significant interests due to their latency and power advantages. In this article, we focus on incoherent optical designs. A significant challenge is that there is no known solution to perform single-wavelength accumulation (a key operation required for DL workloads) using incoherent optical signals efficiently. Therefore, we devise a hybrid approach, where accumulation is done in the electrical domain, and multiplication is performed in the optical domain. The key technology enabler of our design is the transistor laser, which performs electrical-to-optical and optical-to-electrical conversions efficiently. Through detailed design and evaluation of our design, along with a comprehensive benchmarking study against state-of-the-art RRAM-based designs, we derive the following key results:(1) For a four-layer multilayer perceptron network, our design achieves 115× and 17.11× improvements in latency and energy, respectively, compared to the RRAM-based design. We can take full advantage of the speed and energy benefits of the optical technology because the inference task can be entirely mapped onto our design.(2) For a complex workload (Resnet50), weight reprogramming is needed, and intermediate results need to be stored/re-fetched to/from memories. In this case, for the same area, our design still outperforms the RRAM-based design by 15.92× in inference latency, and 8.99× in energy.
KW - Additional Key Words and PhrasesOptical computing
KW - deep learning accelerator
UR - http://www.scopus.com/inward/record.url?scp=85162030686&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85162030686&partnerID=8YFLogxK
U2 - 10.1145/3584183
DO - 10.1145/3584183
M3 - Article
AN - SCOPUS:85162030686
SN - 1550-4832
VL - 19
JO - ACM Journal on Emerging Technologies in Computing Systems
JF - ACM Journal on Emerging Technologies in Computing Systems
IS - 2
M1 - 14
ER -