While scheduling and dispatching of computational workloads is a well-investigated subject, only recently has Google provided publicly a vast high-resolution measurement dataset of its cloud workloads. We revisit dispatching and scheduling algorithms fed by traffic workloads derived from those measurements. The main finding is that mean job response time attains a minimum as the number of servers of the computing cluster is varied, under the constraint that the overall computational budget is kept constant. Moreover, simple policies, such as Join Idle Queue, appear to attain the same performance as more complex, size-based policies for suitably high degrees of parallelism. Further, better performance, definitely outperforming size-based dispatching policies, is obtained by using multistage server clusters, even using very simple policies such as Round Robin. The takeaway is that parallelism and architecture of computing systems might be powerful knobs to control performance, even more than policies, under realistic workload traffic.

The merit of simple policies. buying performance with parallelism and system architecture / Yildiz, Mert; Rolich, Alexey; Baiocchi, Andrea. - (2025), pp. 1-6. ( INFOCOM 2025 ICCN: International Workshop on Intelligent Cloud Computing and Networking London; UK ) [10.1109/INFOCOMWKSHPS65812.2025.11152765].

The merit of simple policies. buying performance with parallelism and system architecture

Mert Yildiz
Primo
;
Alexey Rolich;Andrea Baiocchi
2025

Abstract

While scheduling and dispatching of computational workloads is a well-investigated subject, only recently has Google provided publicly a vast high-resolution measurement dataset of its cloud workloads. We revisit dispatching and scheduling algorithms fed by traffic workloads derived from those measurements. The main finding is that mean job response time attains a minimum as the number of servers of the computing cluster is varied, under the constraint that the overall computational budget is kept constant. Moreover, simple policies, such as Join Idle Queue, appear to attain the same performance as more complex, size-based policies for suitably high degrees of parallelism. Further, better performance, definitely outperforming size-based dispatching policies, is obtained by using multistage server clusters, even using very simple policies such as Round Robin. The takeaway is that parallelism and architecture of computing systems might be powerful knobs to control performance, even more than policies, under realistic workload traffic.
2025
INFOCOM 2025 ICCN: International Workshop on Intelligent Cloud Computing and Networking
data centers; scheduling; dispatching; large scale multi server system; workload traffic measurements
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
The merit of simple policies. buying performance with parallelism and system architecture / Yildiz, Mert; Rolich, Alexey; Baiocchi, Andrea. - (2025), pp. 1-6. ( INFOCOM 2025 ICCN: International Workshop on Intelligent Cloud Computing and Networking London; UK ) [10.1109/INFOCOMWKSHPS65812.2025.11152765].
File allegati a questo prodotto
File Dimensione Formato  
Yildiz_the-merit-of_2025.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 447.79 kB
Formato Adobe PDF
447.79 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1741140
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact