The focus of the present paper is to propose and discuss different procedures for performing variable selection in a multi-block regression context. In particular, the focus is on two multi-block regression methods: Multi-Block Partial Least Squares (MB-PLS) and Sequential and Orthogonalized Partial Least Squares (SO-PLS) regression. A small simulation study for regular PLS regression was conducted in order to select the most promising methods to investigate further in the multi-block context. The combinations of three variable selection methods with MB-PLS and SO-PLS are examined in detail. These methods are Variable Importance in Projection (VIP) Selectivity Ratio (SR) and forward selection. In this paper we focus on both prediction ability and interpretation. The different approaches are tested on three types of data: one sensory data set, one spectroscopic (Raman) data set and a number of simulated multi-block data sets.

Variable selection in multi-block regression / Biancolillo, Alessandra; Liland, Kristian Hovde; Måge, Ingrid; Næs, Tormod; Bro, Rasmus. - In: CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS. - ISSN 0169-7439. - STAMPA. - 156:(2016), pp. 89-101. [10.1016/j.chemolab.2016.05.016]

Variable selection in multi-block regression

BIANCOLILLO, ALESSANDRA;
2016

Abstract

The focus of the present paper is to propose and discuss different procedures for performing variable selection in a multi-block regression context. In particular, the focus is on two multi-block regression methods: Multi-Block Partial Least Squares (MB-PLS) and Sequential and Orthogonalized Partial Least Squares (SO-PLS) regression. A small simulation study for regular PLS regression was conducted in order to select the most promising methods to investigate further in the multi-block context. The combinations of three variable selection methods with MB-PLS and SO-PLS are examined in detail. These methods are Variable Importance in Projection (VIP) Selectivity Ratio (SR) and forward selection. In this paper we focus on both prediction ability and interpretation. The different approaches are tested on three types of data: one sensory data set, one spectroscopic (Raman) data set and a number of simulated multi-block data sets.
2016
MB-PLS; multi-block; Raman; sensory; SO-PLS; variable selection; analytical chemistry; software; computer science applications; computer vision and pattern recognition; spectroscopy; process chemistry and technology
01 Pubblicazione su rivista::01a Articolo in rivista
Variable selection in multi-block regression / Biancolillo, Alessandra; Liland, Kristian Hovde; Måge, Ingrid; Næs, Tormod; Bro, Rasmus. - In: CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS. - ISSN 0169-7439. - STAMPA. - 156:(2016), pp. 89-101. [10.1016/j.chemolab.2016.05.016]
File allegati a questo prodotto
File Dimensione Formato  
Biancolillo_Variable-selection_2016.pdf

solo utenti autorizzati

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 767.94 kB
Formato Adobe PDF
767.94 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/974793
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 56
  • ???jsp.display-item.citation.isi??? 49
social impact