Relaxed dissimilarity-based symbolic histogram variants for granular graph embedding

Baldini, Luca; Martino, Alessio; Rizzi, Antonello

doi:10.5220/0010652500003063

Graph embedding is an established and popular approach when designing graph-based pattern recognition systems. Amongst the several strategies, in the last ten years, Granular Computing emerged as a promising framework for structural pattern recognition. In the late 2000’s, symbolic histograms have been proposed as the driving force in order to perform the graph embedding procedure by counting the number of times each granule of information appears in the graph to be embedded. Similarly to a bag-of-words representation of a text corpora, symbolic histograms have been originally conceived as integer-valued vectorial representation of the graphs. In this paper, we propose six ‘relaxed’ versions of symbolic histograms, where the proper dissimilarity values between the information granules and the constituent parts of the graph to be embedded are taken into account, information which is discarded in the original symbolic histogram formulation due to the hard-limited nature of the counting procedure. Experimental results on six open-access datasets of fully-labelled graphs show comparable performance in terms of classification accuracy with respect to the original symbolic histograms (average accuracy shift ranging from -7% to +2%), counterbalanced by a great improvement in terms of number of resulting information granules, hence number of features in the embedding space (up to 75% less features, on average).

Relaxed dissimilarity-based symbolic histogram variants for granular graph embedding / Baldini, Luca; Martino, Alessio; Rizzi, Antonello. - (2021), pp. 221-235. (Intervento presentato al convegno 13th International Joint Conference on Computational Intelligence tenutosi a Online streaming) [10.5220/0010652500003063].

Relaxed dissimilarity-based symbolic histogram variants for granular graph embedding

Luca Baldini^Primo;Alessio Martino^Secondo;Antonello Rizzi^Ultimo

2021

Abstract

Graph embedding is an established and popular approach when designing graph-based pattern recognition systems. Amongst the several strategies, in the last ten years, Granular Computing emerged as a promising framework for structural pattern recognition. In the late 2000’s, symbolic histograms have been proposed as the driving force in order to perform the graph embedding procedure by counting the number of times each granule of information appears in the graph to be embedded. Similarly to a bag-of-words representation of a text corpora, symbolic histograms have been originally conceived as integer-valued vectorial representation of the graphs. In this paper, we propose six ‘relaxed’ versions of symbolic histograms, where the proper dissimilarity values between the information granules and the constituent parts of the graph to be embedded are taken into account, information which is discarded in the original symbolic histogram formulation due to the hard-limited nature of the counting procedure. Experimental results on six open-access datasets of fully-labelled graphs show comparable performance in terms of classification accuracy with respect to the original symbolic histograms (average accuracy shift ranging from -7% to +2%), counterbalanced by a great improvement in terms of number of resulting information granules, hence number of features in the embedding space (up to 75% less features, on average).

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2021
			
	Nome convegno
	
				13th International Joint Conference on Computational Intelligence
			
	Parole chiave
	
				structural pattern recognition; supervised learning; embedding spaces; granular computing; graph edit distances; graph embedding
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Relaxed dissimilarity-based symbolic histogram variants for granular graph embedding / Baldini, Luca; Martino, Alessio; Rizzi, Antonello. - (2021), pp. 221-235. (Intervento presentato al  convegno 13th International Joint Conference on Computational Intelligence tenutosi a Online streaming) [10.5220/0010652500003063].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Baldini_Relaxed-dissimilarity_2021.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 529.39 kB Formato Adobe PDF Contatta l'autore	529.39 kB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1584576

Citazioni

ND

5

3

Catalogo dei prodotti della ricerca