Interconnect-Aware Area and Energy Optimization for In-Memory Acceleration of DNNs

Gokul Krishnan, Sumit K. Mandal, Chaitali Chakrabarti, Jae Sun Seo, Umit Y. Ogras, Yu Cao

Research output: Contribution to journalArticlepeer-review

27 Scopus citations

Abstract

State-of-the-art in-memory computing (IMC) architectures employ an array of homogeneous tiles and severely underutilize processing elements (PEs). In this article, the researchers propose an area and energy optimization methodology to generate a heterogeneous IMC architecture coupled with an optimized Network-on-Chip (NoC) for deep neural network (DNN) acceleration. The researchers first propose an area-aware optimization technique that improves the PE array utilization. This is achieved by generating a heterogeneous tile-based IMC architecture that consists of tiles of different sizes, with different numbers of PEs where each PE is of the same size. They minimize the communication energy across a large number of tiles using an NoC architecture with optimized tile-to-router mapping and scheduling. Overall, our proposed area and energy optimization methodology generates a heterogeneous IMC architecture coupled with an optimized NoC for DNN acceleration.

Original languageEnglish (US)
Article number9114969
Pages (from-to)79-87
Number of pages9
JournalIEEE Design and Test
Volume37
Issue number6
DOIs
StatePublished - Dec 2020
Externally publishedYes

Bibliographical note

Funding Information:
This work was supported in part by the Center for Brain-Inspired Computing (C-BRIC), one of the six centers in JUMP; in part by the Semiconductor Research Corporation program sponsored by the Defense Advanced Research Projects Agency (DARPA), National Science Foundation (NSF) CAREER under Award CNS-1651624; and in part by the Semiconductor Research Corporation under Grant 2938.001. Gokul Krishnan and Sumit K. Man-dal contributed equally to this work.

Keywords

  • Deep Neural Networks
  • In-Memory Computing
  • Interconnect
  • Network-on-Chip
  • Neural Network Accelerator
  • RRAM

Fingerprint

Dive into the research topics of 'Interconnect-Aware Area and Energy Optimization for In-Memory Acceleration of DNNs'. Together they form a unique fingerprint.

Cite this