Abstract
Resistive random-access memory (RRAM) is a promising technology for in-memory computing with high storage density, fast inference, and good compatibility with CMOS. However, the mapping of a pre-trained deep neural network (DNN) model on RRAM suffers from realistic device issues, especially the variation and quantization error, resulting in a significant reduction in inference accuracy. In this work, we first extract these statistical properties from 65 nm RRAM data on 300mm wafers. The RRAM data present 10-levels in quantization and 50% variance, resulting in an accuracy drop to 31.76% and 10.49% for MNIST and CIFAR-10 datasets, respectively. Based on the experimental data, we propose a combination of machine learning algorithms and on-line adaptation to recover the accuracy with the minimum overhead. The recipe first applies Knowledge Distillation (KD) to transfer an ideal model into a student model with statistical variations and 10 levels. Furthermore, an on-line sparse adaptation (OSA) method is applied to the DNN model mapped on to the RRAM array. Using importance sampling, OSA adds a small SRAM array that is sparsely connected to the main RRAM array; only this SRAM array is updated to recover the accuracy. As demonstrated on MNIST and CIFAR-10 datasets, a 7.86% area cost is sufficient to achieve baseline accuracy for the 65 nm RRAM devices.
Original language | English (US) |
---|---|
Title of host publication | 2020 57th ACM/IEEE Design Automation Conference, DAC 2020 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781450367257 |
DOIs | |
State | Published - Jul 2020 |
Externally published | Yes |
Event | 57th ACM/IEEE Design Automation Conference, DAC 2020 - Virtual, San Francisco, United States Duration: Jul 20 2020 → Jul 24 2020 |
Publication series
Name | Proceedings - Design Automation Conference |
---|---|
Volume | 2020-July |
ISSN (Print) | 0738-100X |
Conference
Conference | 57th ACM/IEEE Design Automation Conference, DAC 2020 |
---|---|
Country/Territory | United States |
City | Virtual, San Francisco |
Period | 7/20/20 → 7/24/20 |
Bibliographical note
Publisher Copyright:© 2020 IEEE.
Keywords
- In-memory computing
- Knowledge Distillation
- On-line adaptation
- Resistive random access memory (RRAM)
- Robustness