TY - JOUR
T1 - MNSIM 2.0
T2 - A Behavior-Level Modeling Tool for Processing-In-Memory Architectures
AU - Zhu, Zhenhua
AU - Sun, Hanbo
AU - Xie, Tongxin
AU - Zhu, Yu
AU - Dai, Guohao
AU - Xia, Lixue
AU - Niu, Dimin
AU - Chen, Xiaoming
AU - Hu, Xiaobo Sharon
AU - Cao, Yu
AU - Xie, Yuan
AU - Yang, Huazhong
AU - Wang, Yu
N1 - Publisher Copyright:
© 1982-2012 IEEE.
PY - 2023/11/1
Y1 - 2023/11/1
N2 - In the age of artificial intelligence (AI), the huge data movements between memory and computing units become the bottleneck of von Neumann architectures, i.e., the 'memory wall' problem. In order to tackle this challenge, processing-in-memory (PIM) architectures are proposed, which perform in-situ computations in memory and give alternative solutions to boost the computing energy efficiency and performance. Because of the large-scale neural network (NN) algorithm models and the huge hardware design space, various factors affect computing accuracy and performance, bringing the need for efficient PIM modeling and evaluation tools. In this work, we propose a behavior-level modeling tool, MNSIM 2.0, to model the performance of PIM architectures efficiently. At the hardware level, MNSIM 2.0 provides a hierarchical PIM modeling structure with flexible architecture configurability and components extensibility. Moreover, the first unified PIM memory array model is proposed for describing both digital and analog PIM. At the algorithm level, MNSIM 2.0 supports the PIM-based NN computing accuracy simulation considering various architecture and device parameters. A PIM-oriented NN model training and quantization flow is also integrated to improve the performance gain brought by PIM. At the scheduling level, MNSIM 2.0 adopts a universal scheduling description compatible with different scheduling strategies. Validation using fabricated PIM macros shows the relative modeling error rate of MNSIM 2.0 is 3.8%-5.5%. Case studies show that MNSIM 2.0 enables PIM design space explorations, influences analysis of device parameters, and architecture design insight discoveries.
AB - In the age of artificial intelligence (AI), the huge data movements between memory and computing units become the bottleneck of von Neumann architectures, i.e., the 'memory wall' problem. In order to tackle this challenge, processing-in-memory (PIM) architectures are proposed, which perform in-situ computations in memory and give alternative solutions to boost the computing energy efficiency and performance. Because of the large-scale neural network (NN) algorithm models and the huge hardware design space, various factors affect computing accuracy and performance, bringing the need for efficient PIM modeling and evaluation tools. In this work, we propose a behavior-level modeling tool, MNSIM 2.0, to model the performance of PIM architectures efficiently. At the hardware level, MNSIM 2.0 provides a hierarchical PIM modeling structure with flexible architecture configurability and components extensibility. Moreover, the first unified PIM memory array model is proposed for describing both digital and analog PIM. At the algorithm level, MNSIM 2.0 supports the PIM-based NN computing accuracy simulation considering various architecture and device parameters. A PIM-oriented NN model training and quantization flow is also integrated to improve the performance gain brought by PIM. At the scheduling level, MNSIM 2.0 adopts a universal scheduling description compatible with different scheduling strategies. Validation using fabricated PIM macros shows the relative modeling error rate of MNSIM 2.0 is 3.8%-5.5%. Case studies show that MNSIM 2.0 enables PIM design space explorations, influences analysis of device parameters, and architecture design insight discoveries.
KW - Hardware modeling tool
KW - processing-in-memory (PIM)
KW - software-hardware co-optimization
UR - http://www.scopus.com/inward/record.url?scp=85149457607&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85149457607&partnerID=8YFLogxK
U2 - 10.1109/TCAD.2023.3251696
DO - 10.1109/TCAD.2023.3251696
M3 - Article
AN - SCOPUS:85149457607
SN - 0278-0070
VL - 42
SP - 4112
EP - 4125
JO - IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
JF - IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
IS - 11
ER -