Autoencoders have been recently applied to outlier detection. However, neural networks are known to be vulnerable to overfitting, and therefore have limited potential in the unsupervised outlier detection setting. The majority of existing deep learning methods for anomaly detection is sensitive to contamination of the training data to anomalous instances. To overcome the aforementioned limitations we develop a Boosting-based Autoencoder Ensemble approach (BAE). BAE is an unsupervised ensemble method that, similarly to boosting, builds an adaptive cascade of autoencoders to achieve improved and robust results. BAE trains the autoencoder components sequentially by performing a weighted sampling of the data, aimed at reducing the amount of outliers used during training, and at injecting diversity in the ensemble. We perform extensive experiments and show that the proposed methodology outperforms state-of-the-art approaches under a variety of conditions.

Unsupervised Boosting-Based Autoencoder Ensembles for Outlier Detection / Sarvari, H.; Domeniconi, C.; Prenkaj, B.; Stilo, G.. - 12712 LNAI:(2021), pp. 91-103. (Intervento presentato al convegno Pacific-Asia Conference on Knowledge Discovery and Data Mining tenutosi a Dehli, India) [10.1007/978-3-030-75762-5_8].

Unsupervised Boosting-Based Autoencoder Ensembles for Outlier Detection

Prenkaj, B.
Penultimo
Software
;
Stilo, G.
Ultimo
Supervision
2021

Abstract

Autoencoders have been recently applied to outlier detection. However, neural networks are known to be vulnerable to overfitting, and therefore have limited potential in the unsupervised outlier detection setting. The majority of existing deep learning methods for anomaly detection is sensitive to contamination of the training data to anomalous instances. To overcome the aforementioned limitations we develop a Boosting-based Autoencoder Ensemble approach (BAE). BAE is an unsupervised ensemble method that, similarly to boosting, builds an adaptive cascade of autoencoders to achieve improved and robust results. BAE trains the autoencoder components sequentially by performing a weighted sampling of the data, aimed at reducing the amount of outliers used during training, and at injecting diversity in the ensemble. We perform extensive experiments and show that the proposed methodology outperforms state-of-the-art approaches under a variety of conditions.
2021
Pacific-Asia Conference on Knowledge Discovery and Data Mining
anomaly detection; boosting mechanism
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Unsupervised Boosting-Based Autoencoder Ensembles for Outlier Detection / Sarvari, H.; Domeniconi, C.; Prenkaj, B.; Stilo, G.. - 12712 LNAI:(2021), pp. 91-103. (Intervento presentato al convegno Pacific-Asia Conference on Knowledge Discovery and Data Mining tenutosi a Dehli, India) [10.1007/978-3-030-75762-5_8].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1699645
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 6
social impact