TY - JOUR
T1 - Performance Evaluation of Spintronic-Based Spiking Neural Networks using Parallel Discrete-Event Simulation
AU - Cruz-Camacho, Elkin
AU - Qian, Siyuan
AU - Shukla, Ankit
AU - McGlohon, Neil
AU - Rakheja, Shaloo
AU - Carothers, Christopher
N1 - This material is based on research sponsored by Air Force Research Laboratory under agreement number FA8750-21-1- 1010. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of Air Force Research Laboratory or the U.S. Government. Additionally, computational time on the AiMOS supercomputer was provided by Rensselaer s Center for Computational Innovations. We thank the time and careful reading by the reviewers of the multiple versions of the manuscript. Without them, this paper would not be what it is.
This material is based on research sponsored by Air Force Research Laboratory under agreement number FA8750-21-1-1010. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of Air Force Research Laboratory or the U.S. Government. Additionally, computational time on the AiMOS supercomputer was provided by Rensselaer\u2019s Center for Computational Innovations.
PY - 2024/11/25
Y1 - 2024/11/25
N2 - Spintronic devices that use the spin of electrons as the information state variable have the potential to emulate neuro-synaptic dynamics and can be realized within a compact form-factor, while operating at ultra-low energy-delay point. In this paper, we benchmark the performance of a spintronics hardware platform designed for handling neuromorphic tasks.To explore the benefits of spintronics-based hardware on realistic neuromorphic workloads, we developed a Parallel Discrete-Event Simulation model called Doryta, which is further integrated with a materials-to-systems benchmarking framework. The benchmarking framework allows us to obtain quantitative metrics on the throughput and energy of spintronics-based neuromorphic computing and compare these against standard CMOS-based approaches. Although spintronics hardware offers significant energy and latency advantages, we find that for larger neuromorphic circuits, the performance is limited by the interconnection networks rather than the spintronics-based neurons and synapses. This limitation can be overcome by architectural changes to the network.Through Doryta we are also able to show the power of neuromorphic computing by simulating Conway's Game of Life (GoL), thus showing that it is Turing complete. We show that Doryta obtains over 300× speedup using 1,024 CPU cores when tested on a convolutional, sparse, neural architecture. When scaled-up 64 times, to a 200 million neuron model, the simulation ran in 3:42 minutes for a total of 2,000 virtual clock steps. The conservative approach of execution was found to be faster in most cases than the optimistic approach, even when a tie-breaking mechanism to guarantee deterministic execution, was deactivated.
AB - Spintronic devices that use the spin of electrons as the information state variable have the potential to emulate neuro-synaptic dynamics and can be realized within a compact form-factor, while operating at ultra-low energy-delay point. In this paper, we benchmark the performance of a spintronics hardware platform designed for handling neuromorphic tasks.To explore the benefits of spintronics-based hardware on realistic neuromorphic workloads, we developed a Parallel Discrete-Event Simulation model called Doryta, which is further integrated with a materials-to-systems benchmarking framework. The benchmarking framework allows us to obtain quantitative metrics on the throughput and energy of spintronics-based neuromorphic computing and compare these against standard CMOS-based approaches. Although spintronics hardware offers significant energy and latency advantages, we find that for larger neuromorphic circuits, the performance is limited by the interconnection networks rather than the spintronics-based neurons and synapses. This limitation can be overcome by architectural changes to the network.Through Doryta we are also able to show the power of neuromorphic computing by simulating Conway's Game of Life (GoL), thus showing that it is Turing complete. We show that Doryta obtains over 300× speedup using 1,024 CPU cores when tested on a convolutional, sparse, neural architecture. When scaled-up 64 times, to a 200 million neuron model, the simulation ran in 3:42 minutes for a total of 2,000 virtual clock steps. The conservative approach of execution was found to be faster in most cases than the optimistic approach, even when a tie-breaking mechanism to guarantee deterministic execution, was deactivated.
KW - Spiking neural networks
KW - Turing completeness
KW - chip performance
KW - energy estimation
KW - game of life
KW - parallel discrete event simulation
KW - spintronic devices
UR - http://www.scopus.com/inward/record.url?scp=85210762301&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85210762301&partnerID=8YFLogxK
U2 - 10.1145/3649464
DO - 10.1145/3649464
M3 - Article
AN - SCOPUS:85210762301
SN - 1049-3301
VL - 35
JO - ACM Transactions on Modeling and Computer Simulation
JF - ACM Transactions on Modeling and Computer Simulation
IS - 1
M1 - 4
ER -