In this paper, we revisit the problem of entity resolution and propose a novel, logical framework, LACE, which mixes declarative and procedural elements to achieve a number of desirable properties. Our approach is fundamentally declarative in nature: it utilizes hard and soft rules to specify conditions under which pairs of entity references must or may be merged, together with denial constraints that enforce consistency of the resulting instance. Importantly, however, rule bodies are evaluated on the instance resulting from applying the already 'derived' merges. It is the dynamic nature of our semantics that enables us to capture collective entity resolution scenarios, where merges can trigger further merges, while at the same time ensuring that every merge can be justified. As the denial constraints restrict which merges can be performed together, we obtain a space of (maximal) solutions, from which we can naturally define notions of certain and possible merges and query answers. We explore the computational properties of our framework and determine the precise computational complexity of the relevant decision problems. Furthermore, as a first step towards implementing our approach, we demonstrate how we can encode the various reasoning tasks using answer set programming.

LACE: A Logical Approach to Collective Entity Resolution / Bienvenu, M.; Cima, Gianluca; Gutierrez-Basulto, V.. - (2022), pp. 379-391. (Intervento presentato al convegno ACM SIGMOD-SIGACT-SIGART Conference on Principles of Database Systems tenutosi a usa) [10.1145/3517804.3526233].

LACE: A Logical Approach to Collective Entity Resolution

Cima Gianluca
;
2022

Abstract

In this paper, we revisit the problem of entity resolution and propose a novel, logical framework, LACE, which mixes declarative and procedural elements to achieve a number of desirable properties. Our approach is fundamentally declarative in nature: it utilizes hard and soft rules to specify conditions under which pairs of entity references must or may be merged, together with denial constraints that enforce consistency of the resulting instance. Importantly, however, rule bodies are evaluated on the instance resulting from applying the already 'derived' merges. It is the dynamic nature of our semantics that enables us to capture collective entity resolution scenarios, where merges can trigger further merges, while at the same time ensuring that every merge can be justified. As the denial constraints restrict which merges can be performed together, we obtain a space of (maximal) solutions, from which we can naturally define notions of certain and possible merges and query answers. We explore the computational properties of our framework and determine the precise computational complexity of the relevant decision problems. Furthermore, as a first step towards implementing our approach, we demonstrate how we can encode the various reasoning tasks using answer set programming.
2022
ACM SIGMOD-SIGACT-SIGART Conference on Principles of Database Systems
answer set programming; collective entity resolution; complexity analysis; declarative framework; logical constraints
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
LACE: A Logical Approach to Collective Entity Resolution / Bienvenu, M.; Cima, Gianluca; Gutierrez-Basulto, V.. - (2022), pp. 379-391. (Intervento presentato al convegno ACM SIGMOD-SIGACT-SIGART Conference on Principles of Database Systems tenutosi a usa) [10.1145/3517804.3526233].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1678182
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 0
social impact