Abstract
Stochastic rounding is crucial in the low-bit (e.g., 8-bit) training of deep neural networks (DNNs) to achieve high accuracy. One of the drawbacks of prior studies is that they require a large number of high-precision stochastic rounding units (SRUs) to guarantee low-bit DNN accuracy, which involves considerable hardware overhead. In this paper, we use extremely low-bit SRUs (ESRUs) to save a large number of hardware resources during low-bit DNN training. However, a naively designed ESRU introduces a biased distribution of random numbers, causing accuracy degradation. To address this issue, we further propose an ESRU design with a plateau-shape distribution. The plateau-shape distribution in our ESRU design is implemented with the combination of an LFSR (linear-feedback shift register) and an inverted LFSR, which avoids LFSR packing and turns an inherent LFSR drawback into an advantage in our efficient ESRU design. Experimental results using state-of-the-art DNN models demonstrate that, compared to the prior 24-bit SRU with 24-bit pseudo-random number generators (PRNG), our 8-bit ESRU with 3-bit PRNG reduces the SRU hardware resource usage by 9.75x while achieving slightly higher accuracy.
| Original language | English (US) |
|---|---|
| Title of host publication | 2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023 - Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9783981926378 |
| DOIs | |
| State | Published - 2023 |
| Externally published | Yes |
| Event | 2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023 - Antwerp, Belgium Duration: Apr 17 2023 → Apr 19 2023 |
Publication series
| Name | Proceedings -Design, Automation and Test in Europe, DATE |
|---|---|
| Volume | 2023-April |
| ISSN (Print) | 1530-1591 |
Conference
| Conference | 2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023 |
|---|---|
| Country/Territory | Belgium |
| City | Antwerp |
| Period | 4/17/23 → 4/19/23 |
Bibliographical note
Publisher Copyright:© 2023 EDAA.
Keywords
- DNNs
- low-bit training
- stochastic rounding
Fingerprint
Dive into the research topics of 'ESRU: Extremely Low-Bit and Hardware-Efficient Stochastic Rounding Unit Design for Low-Bit DNN Training'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS