Resource bundles: Using aggregation for statistical large-scale resource discovery and management

Michael Cardosa, Abhishek Chandra

Research output: Contribution to journalArticlepeer-review

13 Scopus citations


Resource discovery is an important process for finding suitable nodes that satisfy application requirements in large loosely coupled distributed systems. Besides internode heterogeneity, many of these systems also show a high degree of intranode dynamism, so that selecting nodes based only on their recently observed resource capacities can lead to poor deployment decisions resulting in application failures or migration overheads. However, most existing resource discovery mechanisms rely mainly on recent observations to achieve scalability in large systems. In this paper, we propose the notion of a resource bundlea representative resource usage distribution for a group of nodes with similar resource usage patternsthat employs two complementary techniques to overcome the limitations of existing techniques: resource usage histograms to provide statistical guarantees for resource capacities and clustering-based resource aggregation to achieve scalability. Using trace-driven simulations and data analysis of a month-long PlanetLab trace, we show that resource bundles are able to provide high accuracy for statistical resource discovery, while achieving high scalability. We also show that resource bundles are ideally suited for identifying group-level characteristics (e.g., hot spots, total group capacity). To automatically parameterize the bundling algorithm, we present an adaptive algorithm that can detect online fluctuations in resource heterogeneity.

Original languageEnglish (US)
Article number5226621
Pages (from-to)1089-1102
Number of pages14
JournalIEEE Transactions on Parallel and Distributed Systems
Issue number8
StatePublished - 2010

Bibliographical note

Funding Information:
This work was supported in part by the US National Science Foundation (NSF) CAREER Award CNS-0643505.


  • Resource discovery
  • aggregation
  • machine learning
  • resource management


Dive into the research topics of 'Resource bundles: Using aggregation for statistical large-scale resource discovery and management'. Together they form a unique fingerprint.

Cite this