An approach exploiting the principles of Receiver Operating Characteristic (ROC) curves for the simultaneous optimization of both the complexity and the decision threshold in Soft Independent Modeling of Class Analogy (SIMCA) classification models is here proposed. The outcomes resulting from the analysis of two simulated and four real case-studies highlight that, in the presence of strong overlap among various categories of samples, the implemented method can lead to better classification efficiency in external validation, compared to fixing such a threshold a priori. This guarantees a higher robustness toward class dispersion. On the other hand, in cases of clearer and more definite separation among the different groups of observations, their classification performance is equally satisfactory for test samples.

SIMCA Modeling for Overlapping Classes: Fixed or Optimized Decision Threshold? / Vitale, R.; Marini, F.; Ruckebusch, C.. - In: ANALYTICAL CHEMISTRY. - ISSN 0003-2700. - 90:18(2018), pp. 10738-10747. [10.1021/acs.analchem.8b01270]

SIMCA Modeling for Overlapping Classes: Fixed or Optimized Decision Threshold?

Marini F.;
2018

Abstract

An approach exploiting the principles of Receiver Operating Characteristic (ROC) curves for the simultaneous optimization of both the complexity and the decision threshold in Soft Independent Modeling of Class Analogy (SIMCA) classification models is here proposed. The outcomes resulting from the analysis of two simulated and four real case-studies highlight that, in the presence of strong overlap among various categories of samples, the implemented method can lead to better classification efficiency in external validation, compared to fixing such a threshold a priori. This guarantees a higher robustness toward class dispersion. On the other hand, in cases of clearer and more definite separation among the different groups of observations, their classification performance is equally satisfactory for test samples.
2018
classification efficiency; classification models; classification performance; decision threshold; receiver operating characteristic (roc) curves; simultaneous optimization; soft independent modeling of class analogies; test samples
01 Pubblicazione su rivista::01a Articolo in rivista
SIMCA Modeling for Overlapping Classes: Fixed or Optimized Decision Threshold? / Vitale, R.; Marini, F.; Ruckebusch, C.. - In: ANALYTICAL CHEMISTRY. - ISSN 0003-2700. - 90:18(2018), pp. 10738-10747. [10.1021/acs.analchem.8b01270]
File allegati a questo prodotto
File Dimensione Formato  
Vitale_SIMCA_2018.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 459.94 kB
Formato Adobe PDF
459.94 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1281628
Citazioni
  • ???jsp.display-item.citation.pmc??? 6
  • Scopus 33
  • ???jsp.display-item.citation.isi??? 31
social impact