DeCOS: Data-Efficient Reinforcement Learning for Compiler Optimization Selection Ignited by LLM

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Machine learning methods have proven their effectiveness in a wide range of program optimization tasks. These methods selectively map program feature spaces to carefully defined optimization spaces to identify effective optimizations. However, the size and complexity of these spaces often necessitate large amounts of training data to achieve effective mappings. For certain optimization tasks, obtaining accurate training data can be costly, making data efficiency a critical concern. Reinforcement learning (RL) offers a promising solution by dynamically adjusting exploration strategies and selectively requesting training data. In this paper, we propose leveraging reinforcement learning to optimize compilation sequences.This paper presents the Data-efficient Compiler Optimization Selection (DeCOS) system, which utilizes a reinforcement learning engine to perform a guided search of the optimization spaces. To improve the data efficiency in training DeCOS, we utilize synthesized data to configure the RL-architecture; and incorporate simulation results to refine profiling information. To overcome the slow start-up issue in RL-processes, we integrate an LLM into the workflow, leveraging its knowledge to accelerate the initial training phase of the RL-agent. Our experiments show that DeCOS efficiently generates compiler optimization sequences that either match or outperform that of the state-of-the-art optimizer Opentuner. Furthermore, the DeCOS reinforcement learning engine, once trained, demonstrates its versatility by showing portability across different target applications and hardware platforms, highlighting its broad applicability and adaptability.

Original languageEnglish (US)
Title of host publicationACM ICS 2025 - Proceedings of the 39th ACM International Conference on Supercomputing
PublisherAssociation for Computing Machinery
Pages943-958
Number of pages16
ISBN (Electronic)9798400715372
DOIs
StatePublished - Aug 22 2025
Event39th ACM International Conference on Supercomputing, ICS 2025 - Lake City, United States
Duration: Jun 8 2025Jun 11 2025

Publication series

NameProceedings of the International Conference on Supercomputing
VolumePart of 213821

Conference

Conference39th ACM International Conference on Supercomputing, ICS 2025
Country/TerritoryUnited States
CityLake City
Period6/8/256/11/25

Bibliographical note

Publisher Copyright:
© 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Keywords

  • Compilers
  • Language models
  • Reinforcement Learning

Fingerprint

Dive into the research topics of 'DeCOS: Data-Efficient Reinforcement Learning for Compiler Optimization Selection Ignited by LLM'. Together they form a unique fingerprint.

Cite this