Results from simulated data sets: probabilistic record linkage outperforms deterministic record linkage. 2011

Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
Department of Medical Informatics, Academic Medical Center, University of Amsterdam, 1100 DE Amsterdam, The Netherlands. m.tromp@amc.uva.nl

OBJECTIVE To gain insight into the performance of deterministic record linkage (DRL) vs. probabilistic record linkage (PRL) strategies under different conditions by varying the frequency of registration errors and the amount of discriminating power. METHODS A simulation study in which data characteristics were varied to create a range of realistic linkage scenarios. For each scenario, we compared the number of misclassifications (number of false nonlinks and false links) made by the different linking strategies: deterministic full, deterministic N-1, and probabilistic. RESULTS The full deterministic strategy produced the lowest number of false positive links but at the expense of missing considerable numbers of matches dependent on the error rate of the linking variables. The probabilistic strategy outperformed the deterministic strategy (full or N-1) across all scenarios. A deterministic strategy can match the performance of a probabilistic approach providing that the decision about which disagreements should be tolerated is made correctly. This requires a priori knowledge about the quality of all linking variables, whereas this information is inherently generated by a probabilistic strategy. CONCLUSIONS PRL is more flexible and provides data about the quality of the linkage process that in turn can minimize the degree of linking errors, given the data provided.

UI MeSH Term Description Entries
D008297 Male Males
D008498 Medical Record Linkage The creation and maintenance of medical and vital records in multiple institutions in a manner that will facilitate the combined use of the records of identified individuals. Record Linkage, Medical,Linkage, Medical Record,Linkages, Medical Record,Medical Record Linkages,Record Linkages, Medical
D012042 Registries The systems and processes involved in the establishment, support, management, and operation of registers, e.g., disease registers. Parish Registers,Population Register,Parish Register,Population Registers,Register, Parish,Register, Population,Registers, Parish,Registers, Population,Registry
D003627 Data Interpretation, Statistical Application of statistical procedures to analyze specific observed or assumed facts from a particular study. Data Analysis, Statistical,Data Interpretations, Statistical,Interpretation, Statistical Data,Statistical Data Analysis,Statistical Data Interpretation,Analyses, Statistical Data,Analysis, Statistical Data,Data Analyses, Statistical,Interpretations, Statistical Data,Statistical Data Analyses,Statistical Data Interpretations
D005260 Female Females
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D015233 Models, Statistical Statistical formulations or analyses which, when applied to data and found to fit the data, are then used to verify the assumptions and parameters used in the analysis. Examples of statistical models are the linear model, binomial model, polynomial model, two-parameter model, etc. Probabilistic Models,Statistical Models,Two-Parameter Models,Model, Statistical,Models, Binomial,Models, Polynomial,Statistical Model,Binomial Model,Binomial Models,Model, Binomial,Model, Polynomial,Model, Probabilistic,Model, Two-Parameter,Models, Probabilistic,Models, Two-Parameter,Polynomial Model,Polynomial Models,Probabilistic Model,Two Parameter Models,Two-Parameter Model
D015982 Bias Any deviation of results or inferences from the truth, or processes leading to such deviation. Bias can result from several sources: one-sided or systematic variations in measurement from the true value (systematic error); flaws in study design; deviation of inferences, interpretations, or analyses based on flawed data or data collection; etc. There is no sense of prejudice or subjectivity implied in the assessment of bias under these conditions. Aggregation Bias,Bias, Aggregation,Bias, Ecological,Bias, Statistical,Bias, Systematic,Ecological Bias,Outcome Measurement Errors,Statistical Bias,Systematic Bias,Bias, Epidemiologic,Biases,Biases, Ecological,Biases, Statistical,Ecological Biases,Ecological Fallacies,Ecological Fallacy,Epidemiologic Biases,Experimental Bias,Fallacies, Ecological,Fallacy, Ecological,Scientific Bias,Statistical Biases,Truncation Bias,Truncation Biases,Bias, Experimental,Bias, Scientific,Bias, Truncation,Biase, Epidemiologic,Biases, Epidemiologic,Biases, Truncation,Epidemiologic Biase,Error, Outcome Measurement,Errors, Outcome Measurement,Outcome Measurement Error

Related Publications

Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
March 2018, Journal of medical systems,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
August 2016, Revista de saude publica,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
June 2016, International journal of epidemiology,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
January 2016, Studies in health technology and informatics,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
June 2017, Journal of innovation in health informatics,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
April 2020, Journal of the American Medical Informatics Association : JAMIA,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
June 2018, Cadernos de saude publica,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
January 1995, Proceedings. Symposium on Computer Applications in Medical Care,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
August 2015, Journal of biomedical informatics,
Miranda Tromp, and Anita C Ravelli, and Gouke J Bonsel, and Arie Hasman, and Johannes B Reitsma
January 2013, Journal of public health dentistry,
Copied contents to your clipboard!