Exploring a Forecasting Structure for the Capacity Usage in Backup Storage Systems

Yaobin Qin, Brandon Hoffmann, Yuwei Wang, David J. Lilja

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Protecting data from loss is important in many domains, particularly business. Businesses generally employ third-party protection services to routinely back up data to storage. As the scale of the backup system and the volume of backup data continue to grow significantly, backup jobs fail increasingly frequently. In this study, we present an analysis of data protection system reports written over two years and collected from 3,500 backup systems. We found that inadequate capacity is among the two most frequent causes of the failure of backups. Little research has targeted the development of a forecasting tool to predict the backup storage capacity to mitigate this failure, and accurate prediction becomes more challenging as the number of clients served by the storage increases. Our research highlights the characteristics of enterprise backup storage data and uses the information examined here to develop a forecasting structure. We tested our forecasting structure on empirically obtained data. In comparison with a prevalent backup system forecasting tool, our structure provides better and more robust predictions of the capacity of backup storage systems as well as the size of data generated by a given number of clients and the associated deduplication ratios.

Original languageEnglish (US)
Title of host publication2019 IEEE 10th Annual Ubiquitous Computing, Electronics and Mobile Communication Conference, UEMCON 2019
EditorsSatyajit Chakrabarti, Himadri Nath Saha
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages126-134
Number of pages9
ISBN (Electronic)9781728138855
DOIs
StatePublished - Oct 2019
Event10th IEEE Annual Ubiquitous Computing, Electronics and Mobile Communication Conference, UEMCON 2019 - New York City, United States
Duration: Oct 10 2019Oct 12 2019

Publication series

Name2019 IEEE 10th Annual Ubiquitous Computing, Electronics and Mobile Communication Conference, UEMCON 2019

Conference

Conference10th IEEE Annual Ubiquitous Computing, Electronics and Mobile Communication Conference, UEMCON 2019
CountryUnited States
CityNew York City
Period10/10/1910/12/19

Keywords

  • Backup Storages
  • Backup Systems
  • Capacity Usage
  • Machine Learning
  • Storage Capacity Forecasting

Fingerprint Dive into the research topics of 'Exploring a Forecasting Structure for the Capacity Usage in Backup Storage Systems'. Together they form a unique fingerprint.

Cite this