Principal-Component Analysis (PCA) is a fundamental tool in data science and machine learning, used for compressing, analyzing, visualizing, and processing large datasets. At the same time, temporal segmentation is important for coherent component analysis of big data collections generated by time-varying distributions. However, both segmentation and PCA can be critically affected and misled by corrupted points that often exist in big data collections. To address these issues, we propose a novel and robust method for joint segmentation and principal-component analysis of time-varying data, based on L1-norm formulations. Our proposed method estimates robust L1-norm principal components (L1-PCs) over different temporal horizons and combines them to perform outlier detection, data segmentation, and subspace estimation. Numerical studies on real-world data, including videos and smartphone-sensed human body motion measurements, corroborate the merits of the proposed method in terms of segmentation, PCA, and outlier detection/removal.

Joint Analysis and Segmentation of Time-Varying Data with Outliers / Colonnese, Stefania; Scarano, Gaetano; Marra, Marcello; Markopoulos, Panos P.; Pados, Dimitris A.. - In: DIGITAL SIGNAL PROCESSING. - ISSN 1051-2004. - 145:February 2024(2024), pp. 1-15. [10.1016/j.dsp.2023.104338]

Joint Analysis and Segmentation of Time-Varying Data with Outliers

Stefania Colonnese
;
Gaetano Scarano;Marcello Marra;
2024

Abstract

Principal-Component Analysis (PCA) is a fundamental tool in data science and machine learning, used for compressing, analyzing, visualizing, and processing large datasets. At the same time, temporal segmentation is important for coherent component analysis of big data collections generated by time-varying distributions. However, both segmentation and PCA can be critically affected and misled by corrupted points that often exist in big data collections. To address these issues, we propose a novel and robust method for joint segmentation and principal-component analysis of time-varying data, based on L1-norm formulations. Our proposed method estimates robust L1-norm principal components (L1-PCs) over different temporal horizons and combines them to perform outlier detection, data segmentation, and subspace estimation. Numerical studies on real-world data, including videos and smartphone-sensed human body motion measurements, corroborate the merits of the proposed method in terms of segmentation, PCA, and outlier detection/removal.
2024
data segmentation; L1-norm principal-component analysis; time series; outliers; subspace clustering
01 Pubblicazione su rivista::01a Articolo in rivista
Joint Analysis and Segmentation of Time-Varying Data with Outliers / Colonnese, Stefania; Scarano, Gaetano; Marra, Marcello; Markopoulos, Panos P.; Pados, Dimitris A.. - In: DIGITAL SIGNAL PROCESSING. - ISSN 1051-2004. - 145:February 2024(2024), pp. 1-15. [10.1016/j.dsp.2023.104338]
File allegati a questo prodotto
File Dimensione Formato  
Colonnese_Joint-analysis_2024.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 3.76 MB
Formato Adobe PDF
3.76 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1693586
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact