hfplayer: Scalable Replay for Intensive Block I/O Workloads

Alireza Haghdoost, Weiping He, Jerry Fredin, David H Du

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

We introduce new methods to replay intensive block I/O workloads more accurately. These methods can be used to reproduce realistic workloads for benchmarking, performance validation, and tuning of a high-performance block storage device/system. In this article, we study several sources in the stock operating system that introduce uncertainty in the workload replay. Based on the remedies of these findings, we design and develop a new replay tool called hfplayer that replays intensive block I/O workloads in a similar unscaled environment with more accuracy. To replay a given workload trace in a scaled environment with faster storage or host server, the dependency between I/O requests becomes crucial since the timing and ordering of I/O requests is expected to change according to these dependencies. Therefore, we propose a heuristic way of speculating I/O dependencies in a block I/O trace. Using the generated dependency graph, hfplayer tries to propagate I/O related performance gains appropriately along the I/O dependency chains and mimics the original application behavior when it executes in a scaled environment with slower or faster storage system and servers. We evaluate hfplayer with a wide range of workloads using several accuracy metrics and find that it produces better accuracy when compared to other replay approaches.

Original languageEnglish (US)
Article numberY
JournalACM Transactions on Storage
Volume13
Issue number4
DOIs
StatePublished - Dec 2017

Bibliographical note

Funding Information:
This work was supported by NSF I/UCRC Center for Research in Intelligent Storage (CRIS) and the National Science Foundation (NSF) under awards 130523, 1439622, and 1525617, as well as by NetApp. Authors’ addresses: A. Haghdoost, W. He, and D. H. C. Du, 200 Union Street SE Minneapolis, MN 55455; emails: {alireza, weihe, du}@cs.umn.edu; J. Fredin, 3718 N Rock Rd, Wichita, KS 67226; email: jerry@jerryfredin.com. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. © 2017 ACM 1553-3077/2017/12-ART39 $15.00 https://doi.org/10.1145/3149392

Publisher Copyright:
© 2017 ACM.

Keywords

  • I/O workload
  • Storage area network
  • Storage performance
  • Workload replay

Fingerprint

Dive into the research topics of 'hfplayer: Scalable Replay for Intensive Block I/O Workloads'. Together they form a unique fingerprint.

Cite this