Methods for improving covariate balance in observational studies

Abstract: This thesis contributes to the field of causal inference, where the main interest is to estimate the effect of a treatment on some outcome. At its core, causal inference is an exercise in controlling for imbalance (differences) in covariate distributions between the treated and the controls, as such imbalances otherwise can bias estimates of causal effects. Imbalance on observed covariates can be handled through matching, where treated and controls with similar covariate distributions are extracted from a data set and then used to estimate the effect of a treatment.The first paper of this thesis describes and investigates a matching design, where a data-driven algorithm is used to discretise a covariate before matching. The paper also gives sufficient conditions for if, and how, a covariate can be discretised without introducing bias.Balance is needed for unobserved covariates too, but is more difficult to achieve and verify. Unobserved covariates are sometimes replaced with correlated counterparts, usually referred to as proxy variables. However, just replacing an unobserved covariate with a correlated one does not guarantee an elimination of, or even reduction of, bias. In the second paper we formalise proxy variables in a causal inference framework and give sufficient conditions for when they lead to nonparametric identification of causal effects.The third and fourth papers both concern estimating the effect an enhanced cooperation between the Swedish Social Insurance Agency and the Public Employment Service has on reducing sick leave. The third paper is a study protocol, where the matching design used to estimate this effect is described. The matching was then also carried out in the study protocol, before the outcome for the treated was available, ensuring that the matching design was not influenced by any estimated causal effects. The third paper also presents a potential proxy variable for unobserved covariates, that is used as part of the matching. The fourth paper then carries out the analysis described in the third paper, and uses an instrumental variable approach to test for unobserved confounding not captured by the supposed proxy variable.

  CLICK HERE TO DOWNLOAD THE WHOLE DISSERTATION. (in PDF format)