Abstract
With the popularity of battery-powered edge computing, an important yet under-explored problem is the supporting of DNNs for diverse edge devices. On the one hand, different edge platforms have various runtime requirements and computation/memory capabilities. Deploying the same DNN model is unsatisfiable, while designing a specialized DNN for each platform is prohibitively expensive. On the other hand, for a single edge device, DVFS is leveraged to prolong the battery, incurring significant inference speed variation for the same DNN and consequently poor user experience. To tackle this, we propose Condense, a framework providing a single adaptive model that can be reconfigured (switch to various sub-networks with different computations/parameters) instantly for diverse devices and execution frequencies without any retraining. Experiments demonstrate that Condense can simultaneously provide vast high-accuracy sub-networks with different computations and parameters corresponding to various sparsity ratios to support diverse edge devices with different runtime requirements, and reduce the speed variation under varying frequencies on each device, with a memory cost of only one set of weights.
Original language | English (US) |
---|---|
Title of host publication | 2023 60th ACM/IEEE Design Automation Conference, DAC 2023 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9798350323481 |
DOIs | |
State | Published - 2023 |
Externally published | Yes |
Event | 60th ACM/IEEE Design Automation Conference, DAC 2023 - San Francisco, United States Duration: Jul 9 2023 → Jul 13 2023 |
Publication series
Name | Proceedings - Design Automation Conference |
---|---|
Volume | 2023-July |
ISSN (Print) | 0738-100X |
Conference
Conference | 60th ACM/IEEE Design Automation Conference, DAC 2023 |
---|---|
Country/Territory | United States |
City | San Francisco |
Period | 7/9/23 → 7/13/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.