EVeREST: An Effective and Versatile Runtime Energy Saving Tool for GPUs

Anna Yue, Pen Chung Yew, Sanyam Mehta

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Amid conflicting demands for ever-improving performance and maximizing energy savings, it is important to have a tool that automatically identifies opportunities to save power/energy at runtime without compromising performance. GPUs in particular present challenges due to (1) reduced savings available from memory bound applications, and (2) limited availability of low overhead performance counters. Thus, a successful tool must address these issues while still tackling the challenges of dynamic application characterization, versatility across processors from different vendors, and effectiveness at making the right power-performance tradeoffs for desired energy savings. We propose Everest, a tool that automatically finds energy saving opportunities across GPUs at runtime. Specifically, Everest finds two unique avenues for saving energy using DVFS in GPUs in addition to the traditional method of lowering core clock for memory bound phases. Everest has very low overhead and works across different GPUs given its reliance on the minimum possible performance events for the needed characterization. Everest works at a finer granularity of individual application phases and utilizes built-in performance estimation to provide desired performance guarantees for an effective solution that outperforms existing solutions on the latest NVIDIA and AMD GPUs.

Original languageEnglish (US)
Title of host publicationPPoPP 2025 - Proceedings of the 2025 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming
PublisherAssociation for Computing Machinery
Pages57-69
Number of pages13
ISBN (Electronic)9798400714436
DOIs
StatePublished - Feb 28 2025
Event30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, PPoPP 2025 - Las Vegas, United States
Duration: Mar 1 2025Mar 5 2025

Publication series

NameProceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP
ISSN (Print)1542-0205

Conference

Conference30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, PPoPP 2025
Country/TerritoryUnited States
CityLas Vegas
Period3/1/253/5/25

Bibliographical note

Publisher Copyright:
© 2025 Copyright held by the owner/author(s).

Keywords

  • DVFS
  • Energy-efficiency
  • GPU
  • Power

Fingerprint

Dive into the research topics of 'EVeREST: An Effective and Versatile Runtime Energy Saving Tool for GPUs'. Together they form a unique fingerprint.

Cite this