We present ASPEN+, which extends an existing ASP-based system, ASPEN, for collective entity resolution with two important functionalities: support for local merges and new optimality criteria for preferred solutions. Indeed, ASPEN only supports so-called global merges of entity-referring constants (e.g. author ids), in which all occurrences of matched constants are treated as equivalent and merged accordingly. However, it has been argued that when resolving data values, local merges are often more appropriate, as e.g. some instances of ‘J. Lee’ may refer to ‘Joy Lee’, while others should be matched with ‘Jake Lee’. In addition to allowing such local merges, ASPEN+ offers new optimality criteria for selecting solutions, such as minimizing rule violations or maximizing the number of rules supporting a merge. Our main contributions are thus (1) the formalization and computational analysis of various notions of optimal solution, and (2) an extensive experimental evaluation on real-world datasets, demonstrating the effect of local merges and the new optimality criteria on both accuracy and runtime.

Advances in Logic-Based Entity Resolution: Enhancing ASPEN with Local Merges and Optimality Criteria / Xiang, Zhiliang; Bienvenu, Meghyn; Cima, Gianluca; Gutierrez-Basulto, Victor; Ibanez-Garcia, Yazmin. - (2025), pp. 659-669. ( 22nd International Conference on Principles of Knowledge Representation and Reasoning, KR 2025 Melbourne, Australia ) [10.24963/kr.2025/64].

Advances in Logic-Based Entity Resolution: Enhancing ASPEN with Local Merges and Optimality Criteria

Gianluca Cima;
2025

Abstract

We present ASPEN+, which extends an existing ASP-based system, ASPEN, for collective entity resolution with two important functionalities: support for local merges and new optimality criteria for preferred solutions. Indeed, ASPEN only supports so-called global merges of entity-referring constants (e.g. author ids), in which all occurrences of matched constants are treated as equivalent and merged accordingly. However, it has been argued that when resolving data values, local merges are often more appropriate, as e.g. some instances of ‘J. Lee’ may refer to ‘Joy Lee’, while others should be matched with ‘Jake Lee’. In addition to allowing such local merges, ASPEN+ offers new optimality criteria for selecting solutions, such as minimizing rule violations or maximizing the number of rules supporting a merge. Our main contributions are thus (1) the formalization and computational analysis of various notions of optimal solution, and (2) an extensive experimental evaluation on real-world datasets, demonstrating the effect of local merges and the new optimality criteria on both accuracy and runtime.
2025
22nd International Conference on Principles of Knowledge Representation and Reasoning, KR 2025
entity resolution; answer set programming
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Advances in Logic-Based Entity Resolution: Enhancing ASPEN with Local Merges and Optimality Criteria / Xiang, Zhiliang; Bienvenu, Meghyn; Cima, Gianluca; Gutierrez-Basulto, Victor; Ibanez-Garcia, Yazmin. - (2025), pp. 659-669. ( 22nd International Conference on Principles of Knowledge Representation and Reasoning, KR 2025 Melbourne, Australia ) [10.24963/kr.2025/64].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1768254
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact