Abstract
We describe the process and outcome of our efforts to port a legacy Fortran benchmark code to heterogeneous GPU-accelerated computing architectures using OpenMP. The benchmark code is one of the multi-zone NAS Parallel Benchmarks (NPB-MZ) called SP-MZ. This “mini-app” mimics the computation and data movement that is found in popular legacy and modern implicit computational fluid dynamics (CFD) solvers. Our objective was to examine how efficiently legacy Fortran codes can be ported to accelerators by leveraging OpenMP directives. We describe the development and optimization process and demonstrate the performance impact of various code modifications. We show select profiling results from the NVIDIA Visual Profiler (nvpp) to help others diagnose and overcome performance issues in their own applications. We present results for two compute systems endowed with NVIDIA V100 accelerators.
Original language | English (US) |
---|---|
Title of host publication | High Performance Computing - 35th International Conference, ISC High Performance 2020, Proceedings |
Editors | Ponnuswamy Sadayappan, Bradford L. Chamberlain, Guido Juckeland, Hatem Ltaief |
Publisher | Springer |
Pages | 270-287 |
Number of pages | 18 |
ISBN (Print) | 9783030507428 |
DOIs | |
State | Published - 2020 |
Event | 35th International Conference on High Performance Computing, ISC High Performance 2020 - Frankfurt, Germany Duration: Jun 22 2020 → Jun 25 2020 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 12151 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 35th International Conference on High Performance Computing, ISC High Performance 2020 |
---|---|
Country/Territory | Germany |
City | Frankfurt |
Period | 6/22/20 → 6/25/20 |
Bibliographical note
Publisher Copyright:© Springer Nature Switzerland AG 2020.
Keywords
- Accelerator
- Fortran
- GPU
- Implicit CFD
- OpenMP