Failures in computer systems can be often tracked down to software anomalies of various kinds. In many scenarios, it could be difficult, unfeasible, or unprofitable to carry out extensive debugging activity to spot the causes of anomalies and remove them. In other cases, taking corrective actions may led to undesirable service downtime. In this article we propose an alternative approach to cope with the problem of software anomalies in cloud-based applications, and we present the design of a distributed autonomic framework that implements our approach. It exploits the elastic capabilities of cloud infrastructures, and relies on machine learning models, proactive rejuvenation techniques and a new load balancing approach. By putting together all these elements, we show that it is possible to improve both availability and performance of applications deployed over heterogeneous cloud regions and subject to frequent failures. Overall, our study demonstrates the viability of our approach, thus opening the way towards it adoption, and encouraging further studies and practical experiences to evaluate and improve it.
Autonomic Rejuvenation of Cloud Applications as a Countermeasure to Software Anomalies / DI SANZO, Pierangelo; Avresky, Dimiter R.; Pellegrini, Alessandro. - In: SOFTWARE, PRACTICE AND EXPERIENCE. - ISSN 1097-024X. - 51:1(2021), pp. 46-71. [10.1002/spe.2908]
Autonomic Rejuvenation of Cloud Applications as a Countermeasure to Software Anomalies
Pierangelo Di Sanzo;Alessandro Pellegrini
2021
Abstract
Failures in computer systems can be often tracked down to software anomalies of various kinds. In many scenarios, it could be difficult, unfeasible, or unprofitable to carry out extensive debugging activity to spot the causes of anomalies and remove them. In other cases, taking corrective actions may led to undesirable service downtime. In this article we propose an alternative approach to cope with the problem of software anomalies in cloud-based applications, and we present the design of a distributed autonomic framework that implements our approach. It exploits the elastic capabilities of cloud infrastructures, and relies on machine learning models, proactive rejuvenation techniques and a new load balancing approach. By putting together all these elements, we show that it is possible to improve both availability and performance of applications deployed over heterogeneous cloud regions and subject to frequent failures. Overall, our study demonstrates the viability of our approach, thus opening the way towards it adoption, and encouraging further studies and practical experiences to evaluate and improve it.File | Dimensione | Formato | |
---|---|---|---|
DiSanzo_Autonomic_2021.pdf
solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.17 MB
Formato
Adobe PDF
|
2.17 MB | Adobe PDF | Contatta l'autore |
DiSanzo_postprint_Autonomic_2021.pdf
accesso aperto
Note: https://doi-org.ezproxy.uniroma1.it/10.1002/spe.2908
Tipologia:
Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
1.33 MB
Formato
Adobe PDF
|
1.33 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.