Catalogo dei prodotti della ricerca

Entity resolution (ER) aims at matching records that refer to the same real-world entity, e.g., the same product sold by different websites. Recent solutions to this problem have reached unprecedented accuracy. Nonetheless, due to intrinsic limitations of automatic testing methods, it is known among researchers and practitioners that a significant manual effort is still required in production environments for verification and cleaning of ER results. In order to facilitate such activity, we are developing the E2L methodology (Entity to Labels) for automatic computation of human-readable labels of identified entities. Given a selection of entities for which the user wants to compute labels, E2L first extracts relevant features by training a classifier on the ER results, then it leverages the notion of black-box model explanation to select the most important terms for the classifier, and finally it uses those terms to compute labels. In this paper we report our first experiences with E2L. Preliminary results on a real-world application scenario show that E2L labels can provide an accurate description of entities and a natural way for humans to assess the trustworthiness of ER results at a glance.

Automatic entity labeling through explanation techniques / Castano, S.; Ferrara, A.; Firmani, D.; Mathew, J. G.; Montanelli, S.. - 2994:(2021), pp. 299-306. ( 29th Italian Symposium on Advanced Database Systems, SEBD 2021 Pizzo Calabro; Italy ).

Automatic entity labeling through explanation techniques

Castano S.;Ferrara A.;Firmani D.;Mathew J. G.;Montanelli S.

2021

Abstract

Entity resolution (ER) aims at matching records that refer to the same real-world entity, e.g., the same product sold by different websites. Recent solutions to this problem have reached unprecedented accuracy. Nonetheless, due to intrinsic limitations of automatic testing methods, it is known among researchers and practitioners that a significant manual effort is still required in production environments for verification and cleaning of ER results. In order to facilitate such activity, we are developing the E2L methodology (Entity to Labels) for automatic computation of human-readable labels of identified entities. Given a selection of entities for which the user wants to compute labels, E2L first extracts relevant features by training a classifier on the ER results, then it leverages the notion of black-box model explanation to select the most important terms for the classifier, and finally it uses those terms to compute labels. In this paper we report our first experiences with E2L. Preliminary results on a real-world application scenario show that E2L labels can provide an accurate description of entities and a natural way for humans to assess the trustworthiness of ER results at a glance.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2021
			
	Nome convegno
	
				29th Italian Symposium on Advanced Database Systems, SEBD 2021
			
	Parole chiave
	
				entity labeling; machine learning; explainable aI;
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Automatic entity labeling through explanation techniques / Castano, S.; Ferrara, A.; Firmani, D.; Mathew, J. G.; Montanelli, S.. - 2994:(2021), pp. 299-306. ( 29th Italian Symposium on Advanced Database Systems, SEBD 2021 Pizzo Calabro; Italy ).
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Castano_Automatic-entity-labeling_2021.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 860.7 kB Formato Adobe PDF	860.7 kB	Adobe PDF
Castano_Automatic-entity-labeling_indice_2021.pdf accesso aperto Tipologia: Altro materiale allegato Licenza: Creative commons Dimensione 99.69 kB Formato Adobe PDF	99.69 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1643753

Citazioni

ND

0

ND

social impact