The identification and classification of bio-chemical substances are very important tasks in chemical, biological and forensic analysis. In this work we present a new strategy to improve the accuracy of the supervised classification of this type of data obtained from different analytical techniques that combine two processes: first, a dissimilarity representation of data and second, the selection of templates for the refinement of the representative samples in each class set. In order to evaluate the performance of our proposal, a comparative study between three approaches is presented. As a baseline, entropy template selection (ETS) is performed in the original feature space and selected templates are used for training. The underlying concept of the other two alternatives, is the combination of Dissimilarity Representations and ETS. The first alternative performs ETS in the original feature space and uses the selected templates as prototypes for the generation of the dissimilarity space and as training set. The second one represents the data in the dissimilarity space, and next ETS is performed. The experimental results showed that an adequate combination of the representation in the dissimilarity the space and the selection of templates based on entropy, outperformed the baseline in accuracy and/or efficiency for the majority of the problems studied.

Bio-chemical data classification by dissimilarity representation and template selection / Mendiola-Lau, Victor; Silva Mata, Francisco José; Plasencia Calaña, Yenisel; Talavera Bustamante, Isneri; de Marsico, Maria. - STAMPA. - 10657:(2018), pp. 374-381. (Intervento presentato al convegno 22nd Iberoamerican Congress on Pattern Recognition, CIARP 2017 tenutosi a Valparaiso, Chile nel 2017) [10.1007/978-3-319-75193-1_45].

Bio-chemical data classification by dissimilarity representation and template selection

de Marsico, Maria
2018

Abstract

The identification and classification of bio-chemical substances are very important tasks in chemical, biological and forensic analysis. In this work we present a new strategy to improve the accuracy of the supervised classification of this type of data obtained from different analytical techniques that combine two processes: first, a dissimilarity representation of data and second, the selection of templates for the refinement of the representative samples in each class set. In order to evaluate the performance of our proposal, a comparative study between three approaches is presented. As a baseline, entropy template selection (ETS) is performed in the original feature space and selected templates are used for training. The underlying concept of the other two alternatives, is the combination of Dissimilarity Representations and ETS. The first alternative performs ETS in the original feature space and uses the selected templates as prototypes for the generation of the dissimilarity space and as training set. The second one represents the data in the dissimilarity space, and next ETS is performed. The experimental results showed that an adequate combination of the representation in the dissimilarity the space and the selection of templates based on entropy, outperformed the baseline in accuracy and/or efficiency for the majority of the problems studied.
2018
22nd Iberoamerican Congress on Pattern Recognition, CIARP 2017
Bio-chemical data; Classification; Dissimilarity representation; Entropy; Template selection; Theoretical Computer Science
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Bio-chemical data classification by dissimilarity representation and template selection / Mendiola-Lau, Victor; Silva Mata, Francisco José; Plasencia Calaña, Yenisel; Talavera Bustamante, Isneri; de Marsico, Maria. - STAMPA. - 10657:(2018), pp. 374-381. (Intervento presentato al convegno 22nd Iberoamerican Congress on Pattern Recognition, CIARP 2017 tenutosi a Valparaiso, Chile nel 2017) [10.1007/978-3-319-75193-1_45].
File allegati a questo prodotto
File Dimensione Formato  
DeMarsico_Bio-Chemical_2018.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 506.03 kB
Formato Adobe PDF
506.03 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1074074
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact