Skip to main content
Journal cover image

Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files.

Publication ,  Journal Article
Guha, S; Reiter, JP
Published in: Journal of statistical planning and inference
March 2024

We consider causal inference for observational studies with data spread over two files. One file includes the treatment, outcome, and some covariates measured on a set of individuals, and the other file includes additional causally-relevant covariates measured on a partially overlapping set of individuals. By linking records in the two databases, the analyst can control for more covariates, thereby reducing the risk of bias compared to using only one file alone. When analysts do not have access to a unique identifier that enables perfect, error-free linkages, they typically rely on probabilistic record linkage to construct a single linked data set, and estimate causal effects using these linked data. This typical practice does not propagate uncertainty from imperfect linkages to the causal inferences. Further, it does not take advantage of relationships among the variables to improve the linkage quality. We address these shortcomings by fusing regression-assisted, Bayesian probabilistic record linkage with causal inference. The Markov chain Monte Carlo sampler generates multiple plausible linked data files as byproducts that analysts can use for multiple imputation inferences. Here, we show results for two causal estimators based on propensity score overlap weights. Using simulations and data from the Italy Survey on Household Income and Wealth, we show that our approach can improve the accuracy of estimated treatment effects.

Duke Scholars

Published In

Journal of statistical planning and inference

DOI

ISSN

0378-3758

Publication Date

March 2024

Volume

229

Start / End Page

106090

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Guha, S., & Reiter, J. P. (2024). Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files. Journal of Statistical Planning and Inference, 229, 106090. https://doi.org/10.1016/j.jspi.2023.07.004
Guha, Sharmistha, and Jerome P. Reiter. “Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files.Journal of Statistical Planning and Inference 229 (March 2024): 106090. https://doi.org/10.1016/j.jspi.2023.07.004.
Guha S, Reiter JP. Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files. Journal of statistical planning and inference. 2024 Mar;229:106090.
Guha, Sharmistha, and Jerome P. Reiter. “Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files.Journal of Statistical Planning and Inference, vol. 229, Mar. 2024, p. 106090. Epmc, doi:10.1016/j.jspi.2023.07.004.
Guha S, Reiter JP. Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files. Journal of statistical planning and inference. 2024 Mar;229:106090.
Journal cover image

Published In

Journal of statistical planning and inference

DOI

ISSN

0378-3758

Publication Date

March 2024

Volume

229

Start / End Page

106090

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 0104 Statistics