Abstract
Artificial intelligence has made a significant impact on fields like computer vision, Natural Language Processing (NLP), healthcare, and robotics. However, recent AI models, such as GPT-4 and LLaMAv3, demand significant number of computational resources, pushing monolithic chips to their technological and practical limits. 2.5D chiplet-based heterogeneous architectures have been proposed to address these technological and practical limits. While chiplet optimization for models like Convolutional Neural Networks (CNNs) is well-established, scaling this approach to accommodate diverse AI inference models with different computing primitives, data volumes, and different chiplet sizes is very challenging. A set of hardened IPs and chiplet libraries optimized for a broad range of AI applications is proposed in this work. We derive the set of chiplet configurations that are composable, scalable and reusable by employing an analytical framework trained on a diverse set of AI algorithms. Testing these set of library synthesized configurations on a different set of algorithms, we achieve a 1.99×-3.99× improvement in non-recurring engineering (NRE) chiplet design costs, with minimal performance overhead compared to custom chiplet-based ASIC designs. Similar to soft IPs for SoC development, the library of chiplets improves flexibility, reusability, and efficiency for AI hardware designs.
Original language | English (US) |
---|---|
Title of host publication | 2025 Design, Automation and Test in Europe Conference, DATE 2025 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9783982674100 |
DOIs | |
State | Published - 2025 |
Event | 2025 Design, Automation and Test in Europe Conference, DATE 2025 - Lyon, France Duration: Mar 31 2025 → Apr 2 2025 |
Publication series
Name | Proceedings -Design, Automation and Test in Europe, DATE |
---|---|
ISSN (Print) | 1530-1591 |
Conference
Conference | 2025 Design, Automation and Test in Europe Conference, DATE 2025 |
---|---|
Country/Territory | France |
City | Lyon |
Period | 3/31/25 → 4/2/25 |
Bibliographical note
Publisher Copyright:© 2025 EDAA.
Keywords
- 2.5D
- Accelerator
- Chiplet Libraries
- Hard IP
- Neural Networks
- Optimal Chiplet Size