NWPerf

A system wide performance monitoring tool for large Linux clusters

Ryan Mooney, Kenneth P. Schmidt, Scott Studham, Jarek Nieplocha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

14 Citations (Scopus)

Abstract

We present NWPerf, a new system for analyzing fine granularity performance metric data on large-scale supercomputing clusters. This tool is able to measure application efficiency on a system wide basis from both a global system perspective as well as providing a detailed view of individual applications. NWPerf provides this service while minimizing the impact on the performance of user applications. We describe the type of information that can be derived from the system, and demonstrate how the system was used detect and eliminate a performance problem in an application application that improved performance by up to several thousand percent. The NWPerf architecture has proven to be a stable and scalable platform for gathering performance data on a large 1954-CPU production Linux cluster at PNNL

Original languageEnglish (US)
Title of host publication2004 IEEE International Conference on Cluster Computing, ICCC 2004
Pages379-389
Number of pages11
DOIs
StatePublished - Dec 1 2004
Event2004 IEEE International Conference on Cluster Computing, ICCC 2004 - San Diego, CA, United States
Duration: Sep 20 2004Sep 23 2004

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
ISSN (Print)1552-5244

Other

Other2004 IEEE International Conference on Cluster Computing, ICCC 2004
CountryUnited States
CitySan Diego, CA
Period9/20/049/23/04

Fingerprint

Monitoring
Program processors
Linux

Cite this

Mooney, R., Schmidt, K. P., Studham, S., & Nieplocha, J. (2004). NWPerf: A system wide performance monitoring tool for large Linux clusters. In 2004 IEEE International Conference on Cluster Computing, ICCC 2004 (pp. 379-389). (Proceedings - IEEE International Conference on Cluster Computing, ICCC). https://doi.org/10.1109/CLUSTR.2004.1392637

NWPerf : A system wide performance monitoring tool for large Linux clusters. / Mooney, Ryan; Schmidt, Kenneth P.; Studham, Scott; Nieplocha, Jarek.

2004 IEEE International Conference on Cluster Computing, ICCC 2004. 2004. p. 379-389 (Proceedings - IEEE International Conference on Cluster Computing, ICCC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mooney, R, Schmidt, KP, Studham, S & Nieplocha, J 2004, NWPerf: A system wide performance monitoring tool for large Linux clusters. in 2004 IEEE International Conference on Cluster Computing, ICCC 2004. Proceedings - IEEE International Conference on Cluster Computing, ICCC, pp. 379-389, 2004 IEEE International Conference on Cluster Computing, ICCC 2004, San Diego, CA, United States, 9/20/04. https://doi.org/10.1109/CLUSTR.2004.1392637
Mooney R, Schmidt KP, Studham S, Nieplocha J. NWPerf: A system wide performance monitoring tool for large Linux clusters. In 2004 IEEE International Conference on Cluster Computing, ICCC 2004. 2004. p. 379-389. (Proceedings - IEEE International Conference on Cluster Computing, ICCC). https://doi.org/10.1109/CLUSTR.2004.1392637
Mooney, Ryan ; Schmidt, Kenneth P. ; Studham, Scott ; Nieplocha, Jarek. / NWPerf : A system wide performance monitoring tool for large Linux clusters. 2004 IEEE International Conference on Cluster Computing, ICCC 2004. 2004. pp. 379-389 (Proceedings - IEEE International Conference on Cluster Computing, ICCC).
@inproceedings{39dcc739ccfc4bf6aecf9f5ace0a2649,
title = "NWPerf: A system wide performance monitoring tool for large Linux clusters",
abstract = "We present NWPerf, a new system for analyzing fine granularity performance metric data on large-scale supercomputing clusters. This tool is able to measure application efficiency on a system wide basis from both a global system perspective as well as providing a detailed view of individual applications. NWPerf provides this service while minimizing the impact on the performance of user applications. We describe the type of information that can be derived from the system, and demonstrate how the system was used detect and eliminate a performance problem in an application application that improved performance by up to several thousand percent. The NWPerf architecture has proven to be a stable and scalable platform for gathering performance data on a large 1954-CPU production Linux cluster at PNNL",
author = "Ryan Mooney and Schmidt, {Kenneth P.} and Scott Studham and Jarek Nieplocha",
year = "2004",
month = "12",
day = "1",
doi = "10.1109/CLUSTR.2004.1392637",
language = "English (US)",
isbn = "0780386949",
series = "Proceedings - IEEE International Conference on Cluster Computing, ICCC",
pages = "379--389",
booktitle = "2004 IEEE International Conference on Cluster Computing, ICCC 2004",

}

TY - GEN

T1 - NWPerf

T2 - A system wide performance monitoring tool for large Linux clusters

AU - Mooney, Ryan

AU - Schmidt, Kenneth P.

AU - Studham, Scott

AU - Nieplocha, Jarek

PY - 2004/12/1

Y1 - 2004/12/1

N2 - We present NWPerf, a new system for analyzing fine granularity performance metric data on large-scale supercomputing clusters. This tool is able to measure application efficiency on a system wide basis from both a global system perspective as well as providing a detailed view of individual applications. NWPerf provides this service while minimizing the impact on the performance of user applications. We describe the type of information that can be derived from the system, and demonstrate how the system was used detect and eliminate a performance problem in an application application that improved performance by up to several thousand percent. The NWPerf architecture has proven to be a stable and scalable platform for gathering performance data on a large 1954-CPU production Linux cluster at PNNL

AB - We present NWPerf, a new system for analyzing fine granularity performance metric data on large-scale supercomputing clusters. This tool is able to measure application efficiency on a system wide basis from both a global system perspective as well as providing a detailed view of individual applications. NWPerf provides this service while minimizing the impact on the performance of user applications. We describe the type of information that can be derived from the system, and demonstrate how the system was used detect and eliminate a performance problem in an application application that improved performance by up to several thousand percent. The NWPerf architecture has proven to be a stable and scalable platform for gathering performance data on a large 1954-CPU production Linux cluster at PNNL

UR - http://www.scopus.com/inward/record.url?scp=20444502552&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=20444502552&partnerID=8YFLogxK

U2 - 10.1109/CLUSTR.2004.1392637

DO - 10.1109/CLUSTR.2004.1392637

M3 - Conference contribution

SN - 0780386949

T3 - Proceedings - IEEE International Conference on Cluster Computing, ICCC

SP - 379

EP - 389

BT - 2004 IEEE International Conference on Cluster Computing, ICCC 2004

ER -