We introduce the concept of snipping, complementing that of trimming, in robust cluster analysis. An observation is snipped when some of its dimensions are discarded, but the remaining are used for clustering and estimation. Snipped k-means is performed through a probabilistic optimization algorithm which is guaranteed to converge to the global optimum. We show global robustness properties of our snipped k-means procedure. Simulations and a real data application to optical recognition of handwritten digits are used to illustrate and compare the approach. © 2013 Springer Science+Business Media New York.
Snipping for robust k-means clustering under component-wise contamination / Farcomeni, Alessio. - In: STATISTICS AND COMPUTING. - ISSN 1573-1375. - STAMPA. - 24:6(2014), pp. 907-919. [10.1007/s11222-013-9410-8]
Snipping for robust k-means clustering under component-wise contamination
FARCOMENI, Alessio
2014
Abstract
We introduce the concept of snipping, complementing that of trimming, in robust cluster analysis. An observation is snipped when some of its dimensions are discarded, but the remaining are used for clustering and estimation. Snipped k-means is performed through a probabilistic optimization algorithm which is guaranteed to converge to the global optimum. We show global robustness properties of our snipped k-means procedure. Simulations and a real data application to optical recognition of handwritten digits are used to illustrate and compare the approach. © 2013 Springer Science+Business Media New York.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.