Linkage of different data sources is an intermediate step in many statistical processes. When dealing with data resulting from a record linkage process, it should be considered that the linkage is affected by two types of errors: false links and missed matches. If the linkage errors are not properly taken into account, i.e. standard statistical procedures are applied to the linked data, biased estimates and mis-relationships between variables recorded in different sources may result. This paper provides a sensitivity analysis of the effect of linkage errors on the estimation of linear and logistic regressions. Different linkage scenarios are proposed, with various matching variables and accordingly different linkage error levels. The analysis confirms the importance of linkage errors and highlights the relevance of missed matches. The effectiveness of the proposed adjustment methods is demonstrated even when the conditions for their applicability are not fully satisfied, however a framework for taking into account the complexity of linkage procedures is needed

When adjusting for linkage errors: A sensitivity analysis / Tuoto, Tiziana; DI CONSIGLIO, Loredana. - In: STATISTICAL JOURNAL OF THE IAOS. - ISSN 1874-7655. - 34:(2018), pp. 589-597.

When adjusting for linkage errors: A sensitivity analysis

Tuoto Tiziana
;
Di Consiglio Loredana
2018

Abstract

Linkage of different data sources is an intermediate step in many statistical processes. When dealing with data resulting from a record linkage process, it should be considered that the linkage is affected by two types of errors: false links and missed matches. If the linkage errors are not properly taken into account, i.e. standard statistical procedures are applied to the linked data, biased estimates and mis-relationships between variables recorded in different sources may result. This paper provides a sensitivity analysis of the effect of linkage errors on the estimation of linear and logistic regressions. Different linkage scenarios are proposed, with various matching variables and accordingly different linkage error levels. The analysis confirms the importance of linkage errors and highlights the relevance of missed matches. The effectiveness of the proposed adjustment methods is demonstrated even when the conditions for their applicability are not fully satisfied, however a framework for taking into account the complexity of linkage procedures is needed
2018
Unbiased estimators; probabilistic record linkage; regression, data integration; linkage errors
01 Pubblicazione su rivista::01a Articolo in rivista
When adjusting for linkage errors: A sensitivity analysis / Tuoto, Tiziana; DI CONSIGLIO, Loredana. - In: STATISTICAL JOURNAL OF THE IAOS. - ISSN 1874-7655. - 34:(2018), pp. 589-597.
File allegati a questo prodotto
File Dimensione Formato  
Tuoto_When-adjusting_2018.pdf

accesso aperto

Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 169.67 kB
Formato Adobe PDF
169.67 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1608065
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? ND
social impact