Modern parallel/distributed simulations can produce large amounts of data. The historical approach of performing analyses at the end of the simulation is unlikely to cope with modern, extremely large-scale analytics jobs. Indeed, the I/O subsystem can quickly become the global bottleneck. Similarly, processing onthe-fly the data produced by simulations can significantly impair the performance in terms of computational capacity and network load. We present a methodology and reference architecture for constructing an autonomic control system to determine at runtime the best placement for data processing (on simulation nodes or a set of external nodes). This allows for a good tradeoff between the load on the simulation’s critical path and the data communication system. Our preliminary experimentation shows that autonomic orchestration is crucial to improve the global performance of a data analysis system, especially when the simulation node’s rate of data production varies during simulation.
Autonomic Orchestration Of In-Situ and In-Transit Data Analytics For Simulation Studies / Du, Xiaorui; Pimpini, Adriano; Piccione, Andrea; Meng, Zhioxiao; Siguenza-Torres, Anibal; Bortoli, Stefano; Knoll, Alois; Pellegrini, Alessandro. - (2023), pp. 781-792. (Intervento presentato al convegno 2023 Winter Simulation Conference, WSC 2023 tenutosi a San Antonio, Texas, USA) [10.1109/WSC60868.2023.10408191].
Autonomic Orchestration Of In-Situ and In-Transit Data Analytics For Simulation Studies
Adriano Pimpini
Secondo
;Andrea Piccione;Alessandro PellegriniUltimo
2023
Abstract
Modern parallel/distributed simulations can produce large amounts of data. The historical approach of performing analyses at the end of the simulation is unlikely to cope with modern, extremely large-scale analytics jobs. Indeed, the I/O subsystem can quickly become the global bottleneck. Similarly, processing onthe-fly the data produced by simulations can significantly impair the performance in terms of computational capacity and network load. We present a methodology and reference architecture for constructing an autonomic control system to determine at runtime the best placement for data processing (on simulation nodes or a set of external nodes). This allows for a good tradeoff between the load on the simulation’s critical path and the data communication system. Our preliminary experimentation shows that autonomic orchestration is crucial to improve the global performance of a data analysis system, especially when the simulation node’s rate of data production varies during simulation.File | Dimensione | Formato | |
---|---|---|---|
Du_postprint_Autonomic-Orchestration_2023.pdf.pdf
accesso aperto
Note: DOI: 10.1109/WSC60868.2023.10408191
Tipologia:
Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
3.8 MB
Formato
Adobe PDF
|
3.8 MB | Adobe PDF | |
Du_Autonomic-Orchestration_2023.pdf
solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.44 MB
Formato
Adobe PDF
|
2.44 MB | Adobe PDF | Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.