new robust model based clustering method is proposed, which is based on trimming and reweighting. In parallel with the case of location and scatter estimation, reweighting allows to achieve high breakdown together with high efficiency. Additionally, the user does not need to set the trimming level in advance. The method proceeds by fitting an initial robust finite mixture model based on a high trimming level. Observations with high robust Mahalanobis distance from the closest centroid are then flagged as outlying, and centroid estimation is repeated based on the other ones. The procedure is iterated until convergence. It is shown formally, with examples, and with extensive simulation studies that the resulting rtclust procedure can resist to different outlier generating schemes, and is highly efficient in the presence of little or no contamination. Additionally, the new procedure compares well and does not need much tuning.
The rtclust procedure for robust clustering / Farcomeni, Alessio; Dotto, Francesco; Garcia Escudero, Luis Angel; Mayo Iscar, Agustin. - STAMPA. - (2015), pp. 36-36. (Intervento presentato al convegno Computational and Financial Econometrics (CFE 2015) tenutosi a Londra nel 12-14/12/2015).
The rtclust procedure for robust clustering
FARCOMENI, Alessio;DOTTO, FRANCESCO;
2015
Abstract
new robust model based clustering method is proposed, which is based on trimming and reweighting. In parallel with the case of location and scatter estimation, reweighting allows to achieve high breakdown together with high efficiency. Additionally, the user does not need to set the trimming level in advance. The method proceeds by fitting an initial robust finite mixture model based on a high trimming level. Observations with high robust Mahalanobis distance from the closest centroid are then flagged as outlying, and centroid estimation is repeated based on the other ones. The procedure is iterated until convergence. It is shown formally, with examples, and with extensive simulation studies that the resulting rtclust procedure can resist to different outlier generating schemes, and is highly efficient in the presence of little or no contamination. Additionally, the new procedure compares well and does not need much tuning.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.