Graph embedding is an established and popular approach when designing graph-based pattern recognition systems. Amongst the several strategies, in the last ten years, Granular Computing emerged as a promising framework for structural pattern recognition. In the late 2000’s, symbolic histograms have been proposed as the driving force in order to perform the graph embedding procedure by counting the number of times each granule of information appears in the graph to be embedded. Similarly to a bag-of-words representation of a text corpora, symbolic histograms have been originally conceived as integer-valued vectorial representation of the graphs. In this paper, we propose six ‘relaxed’ versions of symbolic histograms, where the proper dissimilarity values between the information granules and the constituent parts of the graph to be embedded are taken into account, information which is discarded in the original symbolic histogram formulation due to the hard-limited nature of the counting procedure. Experimental results on six open-access datasets of fully-labelled graphs show comparable performance in terms of classification accuracy with respect to the original symbolic histograms (average accuracy shift ranging from -7% to +2%), counterbalanced by a great improvement in terms of number of resulting information granules, hence number of features in the embedding space (up to 75% less features, on average).

Relaxed dissimilarity-based symbolic histogram variants for granular graph embedding / Baldini, Luca; Martino, Alessio; Rizzi, Antonello. - (2021), pp. 221-235. (Intervento presentato al convegno 13th International Joint Conference on Computational Intelligence tenutosi a Online streaming) [10.5220/0010652500003063].

Relaxed dissimilarity-based symbolic histogram variants for granular graph embedding

Luca Baldini
Primo
;
Antonello Rizzi
Ultimo
2021

Abstract

Graph embedding is an established and popular approach when designing graph-based pattern recognition systems. Amongst the several strategies, in the last ten years, Granular Computing emerged as a promising framework for structural pattern recognition. In the late 2000’s, symbolic histograms have been proposed as the driving force in order to perform the graph embedding procedure by counting the number of times each granule of information appears in the graph to be embedded. Similarly to a bag-of-words representation of a text corpora, symbolic histograms have been originally conceived as integer-valued vectorial representation of the graphs. In this paper, we propose six ‘relaxed’ versions of symbolic histograms, where the proper dissimilarity values between the information granules and the constituent parts of the graph to be embedded are taken into account, information which is discarded in the original symbolic histogram formulation due to the hard-limited nature of the counting procedure. Experimental results on six open-access datasets of fully-labelled graphs show comparable performance in terms of classification accuracy with respect to the original symbolic histograms (average accuracy shift ranging from -7% to +2%), counterbalanced by a great improvement in terms of number of resulting information granules, hence number of features in the embedding space (up to 75% less features, on average).
2021
13th International Joint Conference on Computational Intelligence
structural pattern recognition; supervised learning; embedding spaces; granular computing; graph edit distances; graph embedding
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Relaxed dissimilarity-based symbolic histogram variants for granular graph embedding / Baldini, Luca; Martino, Alessio; Rizzi, Antonello. - (2021), pp. 221-235. (Intervento presentato al convegno 13th International Joint Conference on Computational Intelligence tenutosi a Online streaming) [10.5220/0010652500003063].
File allegati a questo prodotto
File Dimensione Formato  
Baldini_Relaxed-dissimilarity_2021.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 529.39 kB
Formato Adobe PDF
529.39 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1584576
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 3
social impact