Abstract
We consider a large distributed service system consisting of n homogeneous servers with infinite capacity FIFO queues. Jobs arrive as a Poisson process of rate λ n/kn (for some positive constant λ and integer kn). Each incoming job consists of kn identical tasks that can be executed in parallel, and that can be encoded into at least kn "replicas" of the same size (by introducing redundancy) so that the job is considered to be completed when any kn replicas associated with it finish their service. Moreover, we assume that servers can experience random slowdowns in their processing rate so that the service time of a replica is the product of its size and a random slowdown. First, we assume that the server slowdowns are shifted exponential and independent of the replica sizes. In this setting we show that the delay of a typical job is asymptotically minimized (as n∞) when the number of replicas per task is a constant that only depends on the arrival rate λ, and on the expected slowdown of servers. Second, we introduce a new model for the server slowdowns in which larger tasks experience less variable slowdowns than smaller tasks. In this setting we show that, under the class of policies where all replicas start their service at the same time, the delay of a typical job is asymptotically minimized (as n\→\∞) when the number of replicas per task is made to depend on the actual size of the tasks being replicated, with smaller tasks being replicated more than larger tasks.
Original language | English (US) |
---|---|
Pages (from-to) | 39-40 |
Number of pages | 2 |
Journal | Performance Evaluation Review |
Volume | 48 |
Issue number | 1 |
DOIs | |
State | Published - Jun 8 2020 |
Externally published | Yes |
Event | 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS 2020 - Boston, United States Duration: Jun 8 2020 → Jun 12 2020 |
Bibliographical note
Funding Information:This work was partially supported by ONR grant N00014-17-1-2790.
Publisher Copyright:
© 2020 Copyright is held by the owner/author(s).
Keywords
- minimizing delay
- partial fork-join
- random slowdowns
- replication