Scalable Load Balancing in the Presence of Heterogeneous Servers

Kristen Gardner, Jazeem Abdul Jaleel, Alexander Wickeham, Sherwin Doroudi

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

In large-scale computer systems, deciding how to dispatch arriving jobs to servers is a primary factor affecting system performance. Consequently, there is a wealth of literature on designing, analyzing, and evaluating the performance of load balancing policies. For analytical tractability, most existing work on dispatching in large-scale systems makes a key assumption: that the servers are homogeneous, meaning that they all have the same speeds, capabilities, and available resources. But this assumption is not accurate in practice. Modern computer systems are instead heterogeneous: server farms may consist of multiple generations of hardware, servers with varied resources, or even virtual machines running in a cloud environment. Given the ubiquity of heterogeneity in today's systems, it is critically important to develop load balancing policies that perform well in heterogeneous environments. In this paper, we focus on systems in which server speeds are heterogeneous.

Original languageEnglish (US)
Pages (from-to)37-38
Number of pages2
JournalPerformance Evaluation Review
Volume48
Issue number3
DOIs
StatePublished - Mar 5 2021

Bibliographical note

Publisher Copyright:
© 2021 Copyright is held by the owner/author(s).

Fingerprint

Dive into the research topics of 'Scalable Load Balancing in the Presence of Heterogeneous Servers'. Together they form a unique fingerprint.

Cite this