Linkage of different data sources is an intermediate step in many statistical processes. When dealing with data resulting from a record linkage process, it should be considered that the linkage is affected by two types of errors: false links and missed matches. If the linkage errors are not properly taken into account, i.e. standard statistical procedures are applied to the linked data, biased estimates and mis-relationships between variables recorded in different sources may result. This paper provides a sensitivity analysis of the effect of linkage errors on the estimation of linear and logistic regressions. Different linkage scenarios are proposed, with various matching variables and accordingly different linkage error levels. The analysis confirms the importance of linkage errors and highlights the relevance of missed matches. The effectiveness of the proposed adjustment methods is demonstrated even when the conditions for their applicability are not fully satisfied, however a framework for taking into account the complexity of linkage procedures is needed
When adjusting for linkage errors: A sensitivity analysis / Tuoto, Tiziana; DI CONSIGLIO, Loredana. - In: STATISTICAL JOURNAL OF THE IAOS. - ISSN 1874-7655. - 34:(2018), pp. 589-597.
When adjusting for linkage errors: A sensitivity analysis
Tuoto Tiziana
;Di Consiglio Loredana
2018
Abstract
Linkage of different data sources is an intermediate step in many statistical processes. When dealing with data resulting from a record linkage process, it should be considered that the linkage is affected by two types of errors: false links and missed matches. If the linkage errors are not properly taken into account, i.e. standard statistical procedures are applied to the linked data, biased estimates and mis-relationships between variables recorded in different sources may result. This paper provides a sensitivity analysis of the effect of linkage errors on the estimation of linear and logistic regressions. Different linkage scenarios are proposed, with various matching variables and accordingly different linkage error levels. The analysis confirms the importance of linkage errors and highlights the relevance of missed matches. The effectiveness of the proposed adjustment methods is demonstrated even when the conditions for their applicability are not fully satisfied, however a framework for taking into account the complexity of linkage procedures is neededFile | Dimensione | Formato | |
---|---|---|---|
Tuoto_When-adjusting_2018.pdf
accesso aperto
Tipologia:
Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
169.67 kB
Formato
Adobe PDF
|
169.67 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.