1 Introduction
Estimating causal exposure effects in observational studies demands a vast knowledge of the domain of application. For instance, to estimate the causal effect of an exposure on an outcome, the graphical framework to causality usually involves postulating a causal graph to identify an appropriate set of confounding variables [1]. Specifying such a graph can be difficult, especially in subject areas where prior knowledge is scarce or limited.
Investigators often bypass difficulties related to the identification and selection of confounders through the use of fully adjusted outcome regression models. Such models express the outcome variable as a function of the exposure variable and all available potential confounding variables. A fully adjusted outcome regression model is commonly assumed to yield an unbiased estimator of the true effect of the exposure. However, since such models likely contain more covariates than required, the variance of the regression coefficient for exposure may be unnecessarily large. Instead of using a fully adjusted model, model selection can be attempted.
Most classical statistical model selection approaches do not readily address causal effect estimation. One such approach is Bayesian model averaging (BMA) [2, 3]. BMA averages quantities of interest (e.g. a regression coefficient or the value of a future observation) over all possible models under consideration: in the average, each estimate is weighted by the posterior probability attributed to the corresponding model. When the goal is prediction, BMA accounts for the uncertainty associated with model choice and produces confidence intervals that have adequate coverage probabilities [4]. Unfortunately, BMA can perform poorly when used to estimate a causal effect of exposure [5, 6].
Wang et al. [6] suggested two novel approaches that modify BMA to specifically target causal effect estimation: Bayesian adjustment for confounding (BAC) and two-stage Bayesian adjustment for confounding (TBAC). Graph-based simulations presented in Wang et al. [6] show that the causal effect estimators of BAC and TBAC are unbiased in a variety of scenarios, hence supporting their adequacy for causal inference. A theoretical justification for the use of BAC for causal inference purposes is further discussed in Lefebvre, Atherton, and Talbot [7]. However, some simulations comparing BAC and TBAC to fully adjusted models show little difference in the variance of the causal effect estimators of each method [6, 8]. Moreover, the choice of BAC’s hyperparameter
In this paper we propose a new model averaging approach to causal inference: Bayesian causal effect estimation (BCEE). BCEE aims to unbiasedly estimate the causal effect of a continuous exposure on a continuous outcome, while being more efficient than a fully adjusted approach. With a sample of finite size, however, this is an ambitious objective. Hence, through a user-selected hyperparameter, BCEE enables an analyst to consider various degrees of trade-off between bias and variance for the estimator. While BCEE shares some similarities with TBAC, one distinctive feature of our approach is that its motivation lies in the graphical framework for causal inference (e.g. Pearl [1]).
The paper is structured as follows. In Section 2, we present the BCEE algorithm and discuss, in Section 3, a number of aspects of its practical implementation. We compare BCEE to some existing approaches for causal effect estimation in Section 4. In Section 5, we apply BCEE to a real dataset where we estimate the causal effect of mathematical perceived competence on the self-reported average in mathematics for highschool students in the province of Quebec. We conclude in Section 6 with a discussion of our results and provide suggestions for further research.
2 Bayesian causal effect estimation (BCEE)
Before presenting BCEE in Section 2.3, we first describe the modeling framework in Section 2.1 and provide a proposition and corollary concerning directed acyclic graphs (DAGs) in Section 2.2. The description of how the proposition and the corollary are used to develop BCEE is presented in Section 2.4. We conclude, in Section 2.5, with a toy example that sheds light on BCEE’s properties. Note that although we refer to BCEE as a Bayesian algorithm, strictly speaking, it is approximately Bayesian since it requires specifying prior distributions only for a subset of the parameters. To simplify the discussion, we motivate BCEE from a frequentist perspective.
2.1 Modeling framework
We consider estimating the causal effect of a continuous exposure on a continuous outcome. Let X be the random exposure variable, Y be the random outcome variable and
where
Consider G an assumed causal directed acyclic graph (DAG) compatible with the distribution of the observed covariates in G,
can also be interpreted as the average causal effect of X on Y. It can also be shown that outcome models adjusting for sets of pre-exposure covariates that at least include the direct causes of exposure are unbiased; BAC may be seen to be exploiting this feature [7]. Adjusting for the set of direct causes of X in the outcome model thus seems appealing since
BAC, TBAC and BCEE all rely on the fact that the set of direct causes of X is sufficient for estimating the causal effect and that this set of covariates can be identified from the data. A differentiating feature of BCEE is that it aims to disfavor outcome models that include one or more direct causes of X that are unnecessary to eliminate confounding. This is viewed as desirable since these variables generally increase the variance of
2.2 A motivation based on directed acyclic graphs
The results presented in this section are based on Pearl’s back-door criterion and are thus obtained from a graphical perspective to causality using directed acyclic graphs (DAGs). For a brief review of this framework, we refer the reader to the appendix of VanderWeele and Shpitser [10].
Proposition 2.1 presented below gives a sufficient condition to identify a set
Consider data compatible with a causal DAG G. Let
- 1)no descendants of X are in
and - 2)if for each
, either- (a)
or - (b)if
then Y and are d-separated by .
- (a)
Proof: see Appendix A.1.
Consider a
- 1)If
and Y are d-separated by then all back-door paths are blocked by . - 2)If in addition to 1.,
is sufficient to identify the average causal effect according to Proposition 2.1, then is also sufficient to identify the average causal effect of X on Y.
Proof: see Appendix A.2.
We now address how the proposition and the corollary are used in the linear regression setting presented in Section 2.1. First, Theorem 1.2.4 from Pearl [1] states the quasi-equivalence between d-separation and conditional independence. That is, unless a very precise tuning of parameters occurs, d-separation of Y and
2.3 The BCEE algorithm
BCEE is viewed as a BMA procedure where the prior distribution of the outcome model is informative and constructed by using estimates from earlier steps of the algorithm, including the exposure model. In this section, we introduce BCEE and define the aforementioned prior distribution. The connections between Proposition 2.1, Corollary 2.1 and BCEE’s prior distribution are discussed in Section 2.4.
We now define the outcome model using the same model averaging notation as in BAC and TBAC. Let
Given model (4) and a prior distribution
The first step in the construction of BCEE’s prior distribution
where
We are now ready to define
For vectors
where
With this additional notation, we define the hyperparameter
2.4 The rationale behind BCEE
In this section, we explain in detail how BCEE’s prior distribution
To begin, recall that the first step of BCEE serves to identify likely exposure models. Classical properties of Bayesian model selection ensure that the true (structural) exposure model, the one including only and all direct causes of X (
The algorithm BCEE aims to give the bulk of the posterior weight to outcome models in which
To illustrate how Proposition 2.1 and Corollary 2.1 motivate the formulation of
Suppose
Magnitudes of
| Situation | ||||||
| (1) | Excl. | Large | Not likely | Large | Close to 0 | Close to 0 |
| (2) | Incl. | Close to 0 | Likely | Close to 0 | Close to 0 | Close to 0 |
| (3) | Incl. | Large | Not likely | Large | Close to 1 | Depends |
| (4) | Excl. | Close to 0 | Likely | Close to 0 | Close to 1 | Depends |
Note:
Remark in Table 1 that in situations 3 and 4, where
2.5 A toy example
We consider a toy example to gain preliminary insights on the finite sample properties of BCEE. We generated a sample of size
with
The first step of BCEE is to calculate the posterior distribution of the exposure model
We approximate
Next, we compute the posterior distribution of the outcome model using
We get
Following the same process for the three other outcome models, we calculate the prior probabilities. From there, we calculate the posterior distribution of the outcome model using the relationship
Calculation of the BCEE outcome model posterior distribution with intermediate steps.
| Model | BIC | BMA | |||
| 0.0230 | 0.0229 | 1,435.82 | 0.4602 | 0.0254 | |
| 0.8658 | 0.8618 | 1,435.81 | 0.4629 | 0.9625 | |
| 0.0749 | 0.0746 | 1,440.04 | 0.0560 | 0.0101 | |
| 0.0409 | 0.0407 | 1,442.00 | 0.0209 | 0.0021 |
Note:
We see from these results how BCEE, as compared to BMA, shifts the posterior weight toward models that identify the causal effect of exposure. In fact, in this toy example, BCEE puts almost all the posterior weight on the true outcome model. BCEE accomplishes this by using an informative prior distribution for the outcome model that borrows information both from the exposure selection step and from neighboring regression coefficient estimates in the outcome models.
3 Practical considerations regarding BCEE
In this section we discuss practical considerations regarding the usage of the BCEE algorithm. First, we discuss the choice of the hyperparameter
3.1 Choice of
Recall that BCEE’s prior distribution
Assume that the true outcome model is a normal linear model of the form (1) and first consider the case
Consider the case
If
Recall that
By taking
3.2 Implementing BCEE
In this section, we first consider a naive implementation of BCEE that closely follows our presentation of the algorithm in Section 2.3. Then we describe a modified implementation that accounts for using the MLE
We perform three steps to sample one draw from the posterior distribution of the average causal exposure effect
- S1.Draw
from the posterior distribution of the exposure model , using to approximate ; - S2.Draw
from the conditional posterior distribution , where the regression coefficients are estimated by their MLEs and is approximated by ; - S3.Draw
from the conditional posterior distribution , which we approximate by its limit normal distribution [17, 18], where is the maximum likelihood estimator of and is its estimated standard error.
Because N-BCEE does not take into account the uncertainty related to the estimation of the regression coefficients
A-BCEE is the same as N-BCEE except for step S2. Recall that this step is directed at sampling from the conditional posterior distribution
Using simplified RP (10), it becomes an easy task to incorporate the variability associated with the estimation of the
The finite sample properties of N-BCEE and A-BCEE are studied and compared in some simulation scenarios presented in the next section. We also consider nonparametric bootstrap [22] in a few simple and small scale simulations as an alternative to A-BCEE to correct confidence intervals. Note that, due to computing time, this bootstrapped BCEE (B-BCEE) approach is considerably less practical than A-BCEE to evaluate in simulations and to apply to real data sets of moderate to large sizes.
4 Simulation studies
In this section, we study the finite sample properties of BCEE in various simulation scenarios. The first primary objective of the simulations is to compare BCEE to standard or related methods that are used to estimate total average causal effects of exposure. The second primary objective is to study the sensitivity of BCEE to the choice of its user-selected hyperparameter
To achieve the two main objectives, we examine 24 different simulation scenarios obtained by considering three factors: data-generating process (DGP1, DGP2, DGP3 and DGP4), sample size (200, 600 and 1,000) and true causal effect of exposure (
The first data-generating process (DGP1) satisfies the following relationships:
The second data-generating process (DGP2) involves a larger number of covariates than DGP1 and features an indirect effect of X on Y:
The third data-generating process (DGP3) is similar to the first simulation example in Wang et al. [6] but includes only 18 additional (noise) covariates (instead of 49):
The fourth data-generating process (DGP4) is inspired by a DAG presented in Morgan and Winship [23], Figure 1.1, page 25:
For each of the 24 simulation scenarios, we randomly generated 500 datasets. We estimated the average causal effect of exposure using 8 different procedures: (1) the true outcome model, (2) the fully adjusted model, (3) Bayesian model averaging (BMA) with a uniform prior distribution on the outcome model, (4) Bayesian adjustment for confounding (BAC) with
Comparison of estimates of
| n | Method | Mean | SDE | CP | ||
| 200 | True model | 0.100 | 0.045 | 0.047 | 0.047 | 94 |
| 200 | Fully adjusted model | 0.098 | 0.072 | 0.074 | 0.074 | 94 |
| 200 | BMA | 0.113 | 0.047 | 0.047 | 0.048 | 95 |
| 200 | BAC ( | 0.104 | 0.055 | 0.064 | 0.064 | 92 |
| 200 | BAC ( | 0.098 | 0.072 | 0.074 | 0.074 | 94 |
| 200 | TBAC ( | 0.098 | 0.072 | 0.074 | 0.074 | 94 |
| 200 | N-BCEE (c=100) | 0.108 | 0.051 | 0.055 | 0.056 | 93 |
| 200 | N-BCEE (c=500) | 0.104 | 0.055 | 0.062 | 0.062 | 92 |
| 200 | N-BCEE (c=1000) | 0.102 | 0.057 | 0.065 | 0.065 | 93 |
| 200 | A-BCEE (c=100) | 0.107 | 0.055 | 0.054 | 0.054 | 95 |
| 200 | A-BCEE (c=500) | 0.104 | 0.061 | 0.060 | 0.060 | 96 |
| 200 | A-BCEE (c=1000) | 0.103 | 0.063 | 0.063 | 0.063 | 96 |
| 600 | True model | 0.100 | 0.026 | 0.025 | 0.025 | 96 |
| 600 | Fully adjusted model | 0.100 | 0.041 | 0.039 | 0.039 | 97 |
| 600 | BMA | 0.111 | 0.027 | 0.027 | 0.029 | 94 |
| 600 | BAC ( | 0.105 | 0.031 | 0.035 | 0.035 | 95 |
| 600 | BAC ( | 0.100 | 0.041 | 0.039 | 0.039 | 97 |
| 600 | TBAC ( | 0.100 | 0.041 | 0.039 | 0.039 | 96 |
| 600 | N-BCEE (c=100) | 0.108 | 0.029 | 0.031 | 0.031 | 93 |
| 600 | N-BCEE (c=500) | 0.106 | 0.030 | 0.033 | 0.034 | 93 |
| 600 | N-BCEE (c=1000) | 0.105 | 0.031 | 0.034 | 0.034 | 93 |
| 600 | A-BCEE (c=100) | 0.108 | 0.030 | 0.030 | 0.031 | 95 |
| 600 | A-BCEE (c=500) | 0.105 | 0.033 | 0.032 | 0.032 | 97 |
| 600 | A-BCEE (c=1000) | 0.105 | 0.035 | 0.033 | 0.033 | 97 |
| 1,000 | True model | 0.101 | 0.020 | 0.020 | 0.020 | 95 |
| 1,000 | Fully adjusted model | 0.100 | 0.032 | 0.033 | 0.033 | 94 |
| 1,000 | BMA | 0.111 | 0.021 | 0.022 | 0.025 | 92 |
| 1,000 | BAC ( | 0.102 | 0.026 | 0.030 | 0.030 | 93 |
| 1,000 | BAC ( | 0.100 | 0.032 | 0.033 | 0.033 | 94 |
| 1,000 | TBAC ( | 0.100 | 0.032 | 0.033 | 0.033 | 94 |
| 1,000 | N-BCEE (c=100) | 0.107 | 0.022 | 0.024 | 0.025 | 94 |
| 1,000 | N-BCEE (c=500) | 0.105 | 0.023 | 0.026 | 0.026 | 94 |
| 1,000 | N-BCEE (c=1000) | 0.104 | 0.024 | 0.026 | 0.027 | 94 |
| 1,000 | A-BCEE (c=100) | 0.107 | 0.023 | 0.024 | 0.025 | 95 |
| 1,000 | A-BCEE (c=500) | 0.105 | 0.026 | 0.026 | 0.026 | 96 |
| 1,000 | A-BCEE (c=1000) | 0.104 | 0.027 | 0.027 | 0.027 | 96 |
Note: Mean is the mean estimated value of
Comparison of estimates of
| n | Method | Mean | SDE | CP | ||
| 200 | True model | 0.102 | 0.046 | 0.045 | 0.045 | 96 |
| 200 | Fully adjusted model | 0.104 | 0.075 | 0.078 | 0.078 | 94 |
| 200 | BMA | 0.148 | 0.044 | 0.046 | 0.067 | 68 |
| 200 | BAC ( | 0.118 | 0.052 | 0.075 | 0.077 | 76 |
| 200 | BAC ( | 0.103 | 0.073 | 0.077 | 0.077 | 95 |
| 200 | TBAC ( | 0.103 | 0.073 | 0.076 | 0.076 | 95 |
| 200 | N-BCEE (c=100) | 0.120 | 0.053 | 0.068 | 0.071 | 83 |
| 200 | N-BCEE (c=500) | 0.110 | 0.058 | 0.073 | 0.074 | 86 |
| 200 | N-BCEE (c=1000) | 0.107 | 0.060 | 0.074 | 0.074 | 88 |
| 200 | A-BCEE (c=100) | 0.120 | 0.062 | 0.066 | 0.069 | 92 |
| 200 | A-BCEE (c=500) | 0.112 | 0.067 | 0.071 | 0.072 | 95 |
| 200 | A-BCEE (c=1000) | 0.110 | 0.068 | 0.072 | 0.073 | 95 |
| 600 | True model | 0.100 | 0.026 | 0.026 | 0.026 | 96 |
| 600 | Fully adjusted model | 0.102 | 0.042 | 0.041 | 0.041 | 95 |
| 600 | BMA | 0.133 | 0.030 | 0.032 | 0.046 | 70 |
| 600 | BAC ( | 0.106 | 0.036 | 0.042 | 0.042 | 85 |
| 600 | BAC ( | 0.102 | 0.041 | 0.041 | 0.041 | 96 |
| 600 | TBAC ( | 0.102 | 0.041 | 0.041 | 0.041 | 96 |
| 600 | N-BCEE (c=100) | 0.114 | 0.032 | 0.037 | 0.040 | 86 |
| 600 | N-BCEE (c=500) | 0.108 | 0.034 | 0.039 | 0.039 | 91 |
| 600 | N-BCEE (c=1000) | 0.106 | 0.035 | 0.039 | 0.040 | 91 |
| 600 | A-BCEE (c=100) | 0.114 | 0.036 | 0.037 | 0.039 | 92 |
| 600 | A-BCEE (c=500) | 0.109 | 0.038 | 0.038 | 0.039 | 94 |
| 600 | A-BCEE (c=1000) | 0.107 | 0.039 | 0.039 | 0.039 | 94 |
| 1,000 | True model | 0.100 | 0.020 | 0.021 | 0.021 | 95 |
| 1,000 | Fully adjusted model | 0.100 | 0.032 | 0.032 | 0.032 | 95 |
| 1,000 | BMA | 0.121 | 0.024 | 0.027 | 0.034 | 80 |
| 1,000 | BAC ( | 0.100 | 0.029 | 0.031 | 0.031 | 92 |
| 1,000 | BAC ( | 0.099 | 0.032 | 0.032 | 0.031 | 95 |
| 1,000 | TBAC ( | 0.099 | 0.032 | 0.032 | 0.031 | 95 |
| 1,000 | N-BCEE (c=100) | 0.107 | 0.025 | 0.029 | 0.029 | 90 |
| 1,000 | N-BCEE (c=500) | 0.103 | 0.026 | 0.029 | 0.029 | 90 |
| 1,000 | N-BCEE (c=1000) | 0.102 | 0.026 | 0.030 | 0.030 | 91 |
| 1,000 | A-BCEE (c=100) | 0.108 | 0.028 | 0.028 | 0.029 | 93 |
| 1,000 | A-BCEE (c=500) | 0.104 | 0.029 | 0.029 | 0.029 | 94 |
| 1,000 | A-BCEE (c=1000) | 0.103 | 0.030 | 0.029 | 0.030 | 95 |
Note: Mean is the mean estimated value of
Comparison of estimates of
| n | Method | Mean | SDE | CP | ||
| 200 | True model | 0.103 | 0.100 | 0.100 | 0.100 | 95 |
| 200 | Fully adjusted model | 0.101 | 0.105 | 0.104 | 0.104 | 96 |
| 200 | BMA | 0.149 | 0.085 | 0.086 | 0.099 | 89 |
| 200 | BAC ( | 0.116 | 0.087 | 0.103 | 0.104 | 90 |
| 200 | BAC ( | 0.101 | 0.100 | 0.101 | 0.101 | 95 |
| 200 | TBAC ( | 0.102 | 0.101 | 0.101 | 0.101 | 95 |
| 200 | N-BCEE (c=100) | 0.113 | 0.093 | 0.100 | 0.101 | 93 |
| 200 | N-BCEE (c=500) | 0.106 | 0.096 | 0.101 | 0.101 | 94 |
| 200 | N-BCEE (c=1000) | 0.104 | 0.097 | 0.101 | 0.101 | 94 |
| 200 | A-BCEE (c=100) | 0.116 | 0.096 | 0.098 | 0.099 | 95 |
| 200 | A-BCEE (c=500) | 0.109 | 0.098 | 0.099 | 0.100 | 95 |
| 200 | A-BCEE (c=1000) | 0.108 | 0.099 | 0.100 | 0.100 | 95 |
| 600 | True model | 0.098 | 0.057 | 0.060 | 0.060 | 96 |
| 600 | Fully adjusted model | 0.098 | 0.058 | 0.061 | 0.061 | 96 |
| 600 | BMA | 0.138 | 0.054 | 0.061 | 0.072 | 80 |
| 600 | BAC ( | 0.104 | 0.054 | 0.065 | 0.065 | 87 |
| 600 | BAC ( | 0.097 | 0.058 | 0.060 | 0.060 | 96 |
| 600 | TBAC ( | 0.097 | 0.057 | 0.060 | 0.060 | 95 |
| 600 | N-BCEE (c=100) | 0.108 | 0.056 | 0.064 | 0.064 | 88 |
| 600 | N-BCEE (c=500) | 0.101 | 0.056 | 0.062 | 0.062 | 92 |
| 600 | N-BCEE (c=1000) | 0.100 | 0.056 | 0.061 | 0.061 | 92 |
| 600 | A-BCEE (c=100) | 0.111 | 0.057 | 0.063 | 0.064 | 90 |
| 600 | A-BCEE (c=500) | 0.104 | 0.057 | 0.062 | 0.062 | 92 |
| 600 | A-BCEE (c=1000) | 0.103 | 0.057 | 0.062 | 0.062 | 94 |
| 1,000 | True model | 0.098 | 0.044 | 0.043 | 0.043 | 96 |
| 1,000 | Fully adjusted model | 0.098 | 0.045 | 0.043 | 0.043 | 95 |
| 1,000 | BMA | 0.130 | 0.045 | 0.050 | 0.058 | 79 |
| 1,000 | BAC ( | 0.102 | 0.043 | 0.046 | 0.046 | 91 |
| 1,000 | BAC ( | 0.098 | 0.045 | 0.043 | 0.043 | 96 |
| 1,000 | TBAC ( | 0.098 | 0.044 | 0.043 | 0.043 | 96 |
| 1,000 | N-BCEE (c=100) | 0.106 | 0.044 | 0.048 | 0.048 | 91 |
| 1,000 | N-BCEE (c=500) | 0.101 | 0.044 | 0.045 | 0.045 | 93 |
| 1,000 | N-BCEE (c=1000) | 0.100 | 0.044 | 0.044 | 0.044 | 94 |
| 1,000 | A-BCEE (c=100) | 0.108 | 0.045 | 0.048 | 0.048 | 92 |
| 1,000 | A-BCEE (c=500) | 0.103 | 0.045 | 0.046 | 0.046 | 94 |
| 1,000 | A-BCEE (c=1000) | 0.102 | 0.045 | 0.045 | 0.045 | 94 |
Note: Mean is the mean estimated value of
Comparison of estimates of
| n | Method | Mean | SDE | CP | ||
| 200 | True model | 0.103 | 0.054 | 0.052 | 0.052 | 96 |
| 200 | Fully adjusted model | 0.105 | 0.072 | 0.068 | 0.068 | 95 |
| 200 | BMA | 0.119 | 0.060 | 0.054 | 0.057 | 96 |
| 200 | BAC ( | 0.110 | 0.061 | 0.062 | 0.063 | 95 |
| 200 | BAC ( | 0.103 | 0.072 | 0.068 | 0.068 | 95 |
| 200 | TBAC ( | 0.105 | 0.071 | 0.067 | 0.067 | 96 |
| 200 | N-BCEE (c=100) | 0.108 | 0.061 | 0.063 | 0.064 | 93 |
| 200 | N-BCEE (c=500) | 0.106 | 0.064 | 0.066 | 0.066 | 94 |
| 200 | N-BCEE (c=1000) | 0.105 | 0.065 | 0.066 | 0.066 | 95 |
| 200 | A-BCEE (c=100) | 0.110 | 0.066 | 0.062 | 0.062 | 96 |
| 200 | A-BCEE (c=500) | 0.107 | 0.068 | 0.064 | 0.064 | 96 |
| 200 | A-BCEE (c=1000) | 0.107 | 0.068 | 0.065 | 0.065 | 96 |
| 600 | True model | 0.099 | 0.031 | 0.031 | 0.031 | 95 |
| 600 | Fully adjusted model | 0.097 | 0.041 | 0.043 | 0.043 | 95 |
| 600 | BMA | 0.110 | 0.036 | 0.036 | 0.038 | 92 |
| 600 | BAC ( | 0.100 | 0.037 | 0.042 | 0.042 | 92 |
| 600 | BAC ( | 0.096 | 0.041 | 0.043 | 0.043 | 95 |
| 600 | TBAC ( | 0.096 | 0.041 | 0.042 | 0.043 | 94 |
| 600 | N-BCEE (c=100) | 0.102 | 0.036 | 0.040 | 0.040 | 92 |
| 600 | N-BCEE (c=500) | 0.099 | 0.037 | 0.041 | 0.041 | 92 |
| 600 | N-BCEE (c=1000) | 0.098 | 0.037 | 0.042 | 0.042 | 92 |
| 600 | A-BCEE (c=100) | 0.102 | 0.038 | 0.040 | 0.040 | 94 |
| 600 | A-BCEE (c=500) | 0.100 | 0.039 | 0.041 | 0.041 | 94 |
| 600 | A-BCEE (c=1000) | 0.099 | 0.040 | 0.041 | 0.041 | 94 |
| 1,000 | True model | 0.099 | 0.024 | 0.024 | 0.024 | 96 |
| 1,000 | Fully adjusted model | 0.099 | 0.032 | 0.032 | 0.032 | 95 |
| 1,000 | BMA | 0.107 | 0.028 | 0.029 | 0.030 | 92 |
| 1,000 | BAC ( | 0.100 | 0.029 | 0.032 | 0.032 | 91 |
| 1,000 | BAC ( | 0.098 | 0.032 | 0.032 | 0.032 | 94 |
| 1,000 | TBAC ( | 0.098 | 0.032 | 0.032 | 0.032 | 93 |
| 1,000 | N-BCEE (c=100) | 0.102 | 0.028 | 0.030 | 0.030 | 92 |
| 1,000 | N-BCEE (c=500) | 0.100 | 0.028 | 0.031 | 0.031 | 92 |
| 1,000 | N-BCEE (c=1000) | 0.100 | 0.029 | 0.032 | 0.031 | 92 |
| 1,000 | A-BCEE (c=100) | 0.102 | 0.030 | 0.031 | 0.031 | 94 |
| 1,000 | A-BCEE (c=500) | 0.101 | 0.030 | 0.031 | 0.031 | 94 |
| 1,000 | A-BCEE (c=1000) | 0.100 | 0.031 | 0.032 | 0.032 | 94 |
Note: Mean is the mean estimated value of
We start by discussing the results pertaining to non-BCEE methods for estimating the average causal effect of exposure. Then, we discuss the results for BCEE and contrast them to the former results.
As expected, Bayesian model averaging (BMA) can perform very poorly to estimate the average causal effect. More precisely, the simulation results show that the bias can be substantial when the most important confounding covariates are only slightly associated with the outcome (DGP2 and DGP3). For instance, in DGP2,
The simulation results also support the claim that BAC and TBAC with
The simulation results show that the choice of using
Despite sometimes producing slightly biased estimates, A-BCEE performs at least as well as BAC and TBAC with

Comparison of the distribution of
Citation: Journal of Causal Inference 3, 2; 10.1515/jci-2014-0035
On the basis of these results, we hypothesized that BCEE would perform best when (1) there are some direct causes of the exposure that are strongly associated with the exposure, and (2) there exists variables that can d-separate those direct causes from the outcome. In such situations, we expect BCEE to favor models excluding those direct causes and including the d-separating variables. To verify this, we simulated data according to a fifth data-generating scenario (DGP5) which meets these two conditions. The equations for DGP5 are:
Comparison of estimates of
| n | Method | Estimate | SDE | CP | ||
| 200 | True model | 0.103 | 0.053 | 0.054 | 0.054 | 92 |
| 200 | Fully adjusted model | 0.102 | 0.072 | 0.076 | 0.076 | 94 |
| 200 | BMA | 0.103 | 0.054 | 0.055 | 0.055 | 93 |
| 200 | BAC ( | 0.102 | 0.059 | 0.066 | 0.066 | 92 |
| 200 | BAC ( | 0.102 | 0.072 | 0.076 | 0.076 | 94 |
| 200 | TBAC ( | 0.102 | 0.072 | 0.076 | 0.076 | 95 |
| 200 | A-BCEE (c=100) | 0.103 | 0.055 | 0.056 | 0.056 | 93 |
| 200 | A-BCEE (c=500) | 0.103 | 0.059 | 0.059 | 0.059 | 94 |
| 200 | A-BCEE (c=1000) | 0.102 | 0.061 | 0.061 | 0.061 | 95 |
| 600 | True model | 0.099 | 0.031 | 0.029 | 0.029 | 96 |
| 600 | Fully adjusted model | 0.097 | 0.041 | 0.040 | 0.040 | 96 |
| 600 | BMA | 0.099 | 0.031 | 0.029 | 0.029 | 96 |
| 600 | BAC ( | 0.097 | 0.034 | 0.036 | 0.036 | 95 |
| 600 | BAC ( | 0.097 | 0.041 | 0.040 | 0.040 | 96 |
| 600 | TBAC ( | 0.097 | 0.041 | 0.040 | 0.040 | 96 |
| 600 | A-BCEE (c=100) | 0.098 | 0.031 | 0.030 | 0.030 | 96 |
| 600 | A-BCEE (c=500) | 0.098 | 0.033 | 0.030 | 0.030 | 97 |
| 600 | A-BCEE (c=1000) | 0.098 | 0.034 | 0.031 | 0.031 | 97 |
| 1,000 | True model | 0.100 | 0.024 | 0.023 | 0.023 | 95 |
| 1,000 | Fully adjusted model | 0.100 | 0.032 | 0.031 | 0.031 | 94 |
| 1,000 | BMA | 0.100 | 0.024 | 0.023 | 0.023 | 96 |
| 1,000 | BAC ( | 0.101 | 0.027 | 0.027 | 0.027 | 95 |
| 1,000 | BAC ( | 0.100 | 0.032 | 0.031 | 0.031 | 94 |
| 1,000 | TBAC ( | 0.100 | 0.032 | 0.031 | 0.031 | 95 |
| 1,000 | A-BCEE (c=100) | 0.100 | 0.024 | 0.023 | 0.023 | 96 |
| 1,000 | A-BCEE (c=500) | 0.100 | 0.025 | 0.023 | 0.023 | 96 |
| 1,000 | A-BCEE (c=1000) | 0.100 | 0.025 | 0.023 | 0.023 | 96 |
Note: Mean is the mean estimated value of
5 Application: estimation of the causal effect of perceived mathematical competence on grades in mathematics
In this section we use A-BCEE to estimate the causal effect of perceived competence in mathematics (measured on a scale from 1 to 7) on self-reported grades (in %) in mathematics. We consider longitudinal data obtained from 1,430 students during their first three years of highschool. Participants lived in various regions throughout Quebec, Canada. The data were collected by postal questionnaires every year for a period of three years (time 1, time 2 and time 3). Further details can be found in Guay et al. [24].
We used measures of perceived competence in mathematics at time 2 as the exposure and grades in mathematics at time 3 as the outcome to estimate the causal effect of interest. Recall that A-BCEE requires specifying a set of potential confounding covariates that includes all direct causes of the exposure and none of its descendants. Moreover, it is beneficial that this set also includes strong predictors of the outcome. We took advantage of the longitudinal feature of the data to build the set of potential confounding covariates. Because a cause always precedes its effect in time, we constructed the set of potential confounding covariates by including variables at time 1 that were potential direct causes of perceived-competence at time 2. We also included variables at time 2 that were thought to be strong predictors of grades in mathematics at time 3.
We selected the following 26 covariates: gender, highest level of education reached by the mother, highest level of education reached by the father, perceived competence in mathematics (at time 1), perceived autonomy support from the mother, perceived autonomy support from the father, perceived autonomy support from the mathematics teacher, perceived autonomy support from friends at school, self-reported mathematics’ grades, intrinsic motivation in mathematics, identified motivation in mathematics, introjected motivation in mathematics, externally regulated motivation in mathematics, victimization and sense of belonging to school. All variables except the first four were considered both at times 1 and 2.
Before applying A-BCEE on these data, we obtained some descriptive statistics. We drew scatter plots of the outcome versus the exposure and versus each potential confounding covariate to roughly verify the linearity assumption and to check for outliers. For the same reasons, we drew scatter plots of the exposure versus each potential confounding covariate. We also noticed that only 46.5% of the participants have complete information for all the selected covariates. The variables measured at time 1 have generally few missing cases (between 1.8% and 8.3%), but the variables measured at times 2 and 3 have a larger degree of missingness (between 26.4% and 36.4%). We performed multiple imputation [25] to account for the missing data, using 50 imputed datasets to ensure the power falloff is negligible [26].
We estimated the causal effect of perceived competence on grades in mathematics using the fully adjusted outcome model, A-BCEE with
Because Step S1 of A-BCEE aims to find the direct causes of the exposure, it is reasonable to only allow covariates measured before the exposure to be selected in this step. Hence, we ran the A-BCEE algorithm a second time, but this time excluding the possibility that covariates measured at time 2 enter the exposure model. We denote this implementation of A-BCEE as A-BCEE* in Table 8.
Comparison of the estimated causal effect of perceived mathematical competence in mathematics on self-reported mathematics’ grades.
| Method | Estimate | SEE | CI |
| Fully adjusted model | 0.693 | 0.460 | (–0.208, 1.594) |
| BAC ( | 0.729 | 0.462 | (–0.178, 1.635) |
| TBAC ( | 0.778 | 0.465 | (–0.133, 1.690) |
| A-BCEE (c=100) | 0.807 | 0.451 | (–0.076, 1.691) |
| A-BCEE (c=500) | 0.790 | 0.456 | (–0.105, 1.685) |
| A-BCEE (c=1000) | 0.786 | 0.459 | (–0.113, 1.685) |
| A-BCEE* (c=100) | 0.823 | 0.445 | (–0.049, 1.696) |
| A-BCEE* (c=500) | 0.808 | 0.444 | (–0.062, 1.679) |
| A-BCEE* (c=1000) | 0.803 | 0.444 | (–0.066, 1.673) |
Note: Estimate is the estimated causal effect, SEE is the standard error estimate, CI is a 95% confidence interval for the causal effect.
Table 8 shows that the results from A-BCEE and A-BCEE* are very similar. This is not surprising since the marginal posterior probability of inclusion of covariates do not differ much between A-BCEE and A-BCEE* (not shown). Using A-BCEE instead of the fully adjusted model slightly decreases the standard error of estimate, between 0.3% and 3.5%, which translates in a small decrease of the 95% confidence intervals’ width. Moreover the standard errors of estimate for BAC and TBAC are slightly larger than the one for the fully adjusted model in this illustration. Although the point estimates appear to vary substantially between methods, the differences are small relative to the magnitude of the estimated standard errors. We conclude that perceived competence in mathematics at one point in time likely has little or no causal effect on self-reported grades in mathematics a year later.
6 Discussion
We have introduced the Bayesian causal effect estimation (BCEE) algorithm to estimate causal exposure effects in observational studies. This novel data-driven approach avoids the need to rely on the specification of a causal graph and aims to control the variability of the estimator of the exposure effect. BCEE employs a prior distribution that is motivated by a theoretical proposition embedded in the graphical framework to causal inference. We also proposed a practical implementation of BCEE, A-BCEE, that accounts for the fact that this prior distribution uses information from the data. Using simulation studies, we found that A-BCEE generally achieves at least some reduction of the MSE of the causal effect estimator as compared to the MSE generated by a fully-adjusted model approach or by other data-driven approaches to causal inference, such as BAC and TBAC, thus resulting in estimates that are overall closer to the true value. In some circumstances, the reduction of the MSE can be substantial. Moreover, confidence intervals with appropriate coverage probabilities were obtained. Hence, we believe that BCEE is a promising algorithm to perform causal inference.
Some current limitations of BCEE could be addressed in future research. The generalization to non continuous exposure variable (e.g. binary) is straightforward. Recall that the first step of BCEE aims at identifying the direct causes of the exposure. As in the normal case we have considered, classical Bayesian procedures asymptotically select the true exposure model with probability 1 when assuming X belongs to an exponential family (e.g. Bernoulli) and that an adequate parametric model is considered [12]. The generalization of BCEE to other types of outcome variables is less straightforward. One could specify a generalized linear model for the outcome of the form
We think that BCEE can be particularly helpful to those working in fields where current subject-matter knowledge is sparse. To facilitate usage of the BCEE algorithm, we provide an R package named BCEE (available at
Dr. Talbot has been supported by doctoral scholarships from the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Fonds de recherche du Québec: Nature et Technologies. Dr. Lefebvre’s is a Chercheur-Boursier from the Fonds de recherche du Québec – Santé and is also supported by NSERC. Dr. Atherton’s research is supported by NSERC.
A Proofs
A.1 Proof of Proposition 2.1
Proof. First, we know from Pearl [1] section 3.3.1 that a set
Suppose that a direct cause
It remains to show that all back-door paths for which the second variable in the path is not a direct cause included in
The proof is complete by the back-door criterion as we realize that all back-door paths, whether their
A.2 Proof of Corollary 2.1
Proof. 1. Suppose that G admits some back-door paths of the form
2. To prove that
First, we consider the back-door paths that admit
Next, we divide the back-door paths that do not admit
The case where the second variable is a
Hence, all back-door paths between X and Y in G are blocked by
B General conditions for the equivalence of zero regression coefficient and conditional independence
We show that the independence of Y and
Consider the same normal linear model as in eq. (1)
Thus if
C The behavior of
In Figure 2 we examine how the term

Citation: Journal of Causal Inference 3, 2; 10.1515/jci-2014-0035
In Figure 2(A), we see that, for all sample sizes considered,
D Additional simulations
We now address the secondary objectives with additional simulations. The first secondary goal is to study the large, whilst finite, sample properties of BCEE. To do this, we examine four different simulation scenarios obtained by considering the four data-generating processes (DGP1, DGP2, DGP3 and DGP4) with a sample size of 10,000. Once again, for each scenario, we randomly generated 500 datasets. We estimated the average causal effect of exposure using A-BCEE and N-BCEE with
Estimates of
| DGP | Method | Mean | SDE | CP | ||
| 1 | N-BCEE (c=500) | 0.100 | 0.0068 | 0.0067 | 0.0067 | 97 |
| 1 | A-BCEE (c=500) | 0.100 | 0.0069 | 0.0066 | 0.0066 | 97 |
| 2 | N-BCEE (c=500) | 0.100 | 0.0074 | 0.0075 | 0.0075 | 96 |
| 2 | A-BCEE (c=500) | 0.100 | 0.0079 | 0.0075 | 0.0075 | 98 |
| 3 | N-BCEE (c=500) | 0.100 | 0.0140 | 0.0141 | 0.0141 | 95 |
| 3 | A-BCEE (c=500) | 0.100 | 0.0140 | 0.0141 | 0.0141 | 95 |
| 4 | N-BCEE (c=500) | 0.099 | 0.0083 | 0.0084 | 0.0084 | 96 |
| 4 | A-BCEE (c=500) | 0.099 | 0.0089 | 0.0086 | 0.0086 | 97 |
Note: DGP is the data-generating process, Mean is the mean estimated value of
The second secondary objective is to study the performance of B-BCEE to correct the confidence intervals of N-BCEE. Since this bootstrapped implementation is very computationally intensive, we only considered two simulation scenarios: DGP1 and DGP4 with a sample size of 200. In this case, only 100 datasets were generated for each scenario. We estimated the causal effect of exposure using the fully adjusted model, the true outcome model, BMA, BAC, TBAC, A-BCEE, N-BCEE and B-BCEE. For A-BCEE, N-BCEE and B-BCEE we took
Comparison of estimates of
| DGP | Method | Mean | SDE | CP | ||
| 1 | True model | 0.105 | 0.045 | 0.053 | 0.053 | 93 |
| 1 | Fully adjusted model | 0.104 | 0.072 | 0.075 | 0.075 | 94 |
| 1 | BMA | 0.121 | 0.048 | 0.050 | 0.054 | 93 |
| 1 | BAC ( | 0.104 | 0.072 | 0.075 | 0.075 | 94 |
| 1 | TBAC ( | 0.104 | 0.072 | 0.075 | 0.074 | 93 |
| 1 | N-BCEE (c=500) | 0.112 | 0.056 | 0.063 | 0.064 | 92 |
| 1 | A-BCEE (c=500) | 0.111 | 0.062 | 0.061 | 0.062 | 96 |
| 1 | B-BCEE (c=500, no bias corr.) | 0.112 | 0.067 | 0.063 | 0.064 | 96 |
| 1 | B-BCEE (c=500, w/bias corr.) | 0.107 | 0.067 | 0.066 | 0.066 | 96 |
| 4 | True model | 0.111 | 0.063 | 0.063 | 0.063 | 96 |
| 4 | Fully adjusted model | 0.106 | 0.072 | 0.064 | 0.064 | 96 |
| 4 | BMA | 0.120 | 0.060 | 0.051 | 0.055 | 96 |
| 4 | BAC ( | 0.105 | 0.072 | 0.064 | 0.064 | 96 |
| 4 | TBAC ( | 0.106 | 0.071 | 0.064 | 0.063 | 96 |
| 4 | N-BCEE (c=500) | 0.108 | 0.064 | 0.061 | 0.061 | 97 |
| 4 | A-BCEE (c=500) | 0.109 | 0.068 | 0.060 | 0.060 | 96 |
| 4 | B-BCEE (c=500, no bias corr.) | 0.108 | 0.071 | 0.061 | 0.061 | 97 |
| 4 | B-BCEE (c=500, w/bias corr.) | 0.107 | 0.071 | 0.062 | 0.062 | 98 |
Note: DGP is the data-generating process, Mean is the mean estimated value of
E Marginal posterior probabilities of inclusion of potential confounding covariates
Marginal posterior probability of inclusion of potential confounding covariate
| n | Method | |||||
| 200 | BMA | 0.11 | 0.11 | 1.00 | 0.18 | 1.00 |
| 200 | BAC ( | 0.35 | 0.35 | 1.00 | 0.41 | 1.00 |
| 200 | BAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 200 | TBAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 200 | N-BCEE (c=100) | 0.19 | 0.24 | 1.00 | 0.37 | 1.00 |
| 200 | N-BCEE (c=500) | 0.36 | 0.41 | 1.00 | 0.54 | 1.00 |
| 200 | N-BCEE (c=1000) | 0.44 | 0.49 | 1.00 | 0.61 | 1.00 |
| 200 | A-BCEE (c=100) | 0.29 | 0.35 | 1.00 | 0.44 | 1.00 |
| 200 | A-BCEE (c=500) | 0.51 | 0.56 | 1.00 | 0.63 | 1.00 |
| 200 | A-BCEE (c=1000) | 0.60 | 0.64 | 1.00 | 0.70 | 1.00 |
| 600 | BMA | 0.06 | 0.06 | 1.00 | 0.24 | 1.00 |
| 600 | BAC ( | 0.32 | 0.33 | 1.00 | 0.47 | 1.00 |
| 600 | BAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 600 | TBAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 600 | N-BCEE (c=100) | 0.11 | 0.15 | 1.00 | 0.44 | 1.00 |
| 600 | N-BCEE (c=500) | 0.22 | 0.30 | 1.00 | 0.60 | 1.00 |
| 600 | N-BCEE (c=1000) | 0.28 | 0.37 | 1.00 | 0.66 | 1.00 |
| 600 | A-BCEE (c=100) | 0.15 | 0.21 | 1.00 | 0.45 | 1.00 |
| 600 | A-BCEE (c=500) | 0.34 | 0.42 | 1.00 | 0.63 | 1.00 |
| 600 | A-BCEE (c=1000) | 0.44 | 0.51 | 1.00 | 0.70 | 1.00 |
| 1,000 | BMA | 0.05 | 0.04 | 1.00 | 0.33 | 1.00 |
| 1,000 | BAC ( | 0.38 | 0.37 | 1.00 | 0.61 | 1.00 |
| 1,000 | BAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 1,000 | TBAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 1,000 | N-BCEE (c=100) | 0.09 | 0.11 | 1.00 | 0.55 | 1.00 |
| 1,000 | N-BCEE (c=500) | 0.19 | 0.22 | 1.00 | 0.69 | 1.00 |
| 1,000 | N-BCEE (c=1000) | 0.25 | 0.28 | 1.00 | 0.74 | 1.00 |
| 1,000 | A-BCEE (c=100) | 0.12 | 0.15 | 1.00 | 0.54 | 1.00 |
| 1,000 | A-BCEE (c=500) | 0.30 | 0.34 | 1.00 | 0.70 | 1.00 |
| 1,000 | A-BCEE (c=1000) | 0.39 | 0.44 | 1.00 | 0.75 | 1.00 |
Marginal posterior probability of inclusion of potential confounding covariate
| n | Method | |||||||
| 200 | BMA | 0.14 | 0.12 | 0.20 | 0.18 | 0.31 | 0.10 | 0.10 |
| 200 | BAC ( | 0.44 | 0.41 | 0.48 | 0.18 | 0.32 | 0.12 | 0.11 |
| 200 | BAC ( | 1.00 | 1.00 | 1.00 | 0.22 | 0.40 | 0.15 | 0.15 |
| 200 | TBAC ( | 1.00 | 1.00 | 1.00 | 0.18 | 0.29 | 0.14 | 0.14 |
| 200 | N-BCEE (c=100) | 0.45 | 0.39 | 0.55 | 0.26 | 0.34 | 0.14 | 0.14 |
| 200 | N-BCEE (c=500) | 0.64 | 0.58 | 0.72 | 0.28 | 0.35 | 0.17 | 0.17 |
| 200 | N-BCEE (c=1000) | 0.71 | 0.66 | 0.78 | 0.29 | 0.36 | 0.18 | 0.18 |
| 200 | A-BCEE (c=100) | 0.64 | 0.56 | 0.66 | 0.19 | 0.30 | 0.13 | 0.13 |
| 200 | A-BCEE (c=500) | 0.79 | 0.73 | 0.81 | 0.19 | 0.30 | 0.14 | 0.14 |
| 200 | A-BCEE (c=1000) | 0.84 | 0.79 | 0.85 | 0.19 | 0.30 | 0.14 | 0.14 |
| 600 | BMA | 0.12 | 0.09 | 0.39 | 0.25 | 0.64 | 0.08 | 0.07 |
| 600 | BAC ( | 0.63 | 0.59 | 0.75 | 0.22 | 0.62 | 0.09 | 0.07 |
| 600 | BAC ( | 1.00 | 1.00 | 1.00 | 0.21 | 0.57 | 0.07 | 0.06 |
| 600 | TBAC ( | 1.00 | 1.00 | 1.00 | 0.19 | 0.53 | 0.09 | 0.08 |
| 600 | N-BCEE (c=100) | 0.37 | 0.26 | 0.73 | 0.28 | 0.65 | 0.09 | 0.08 |
| 600 | N-BCEE (c=500) | 0.56 | 0.45 | 0.84 | 0.29 | 0.63 | 0.11 | 0.09 |
| 600 | N-BCEE (c=1000) | 0.64 | 0.54 | 0.88 | 0.29 | 0.61 | 0.12 | 0.10 |
| 600 | A-BCEE (c=100) | 0.56 | 0.43 | 0.76 | 0.22 | 0.59 | 0.08 | 0.07 |
| 600 | A-BCEE (c=500) | 0.74 | 0.63 | 0.86 | 0.21 | 0.56 | 0.09 | 0.08 |
| 600 | A-BCEE (c=1000) | 0.80 | 0.70 | 0.90 | 0.20 | 0.56 | 0.09 | 0.08 |
| 1,000 | BMA | 0.12 | 0.08 | 0.55 | 0.33 | 0.82 | 0.06 | 0.06 |
| 1,000 | BAC ( | 0.69 | 0.66 | 0.86 | 0.28 | 0.75 | 0.06 | 0.06 |
| 1,000 | BAC ( | 1.00 | 1.00 | 1.00 | 0.25 | 0.71 | 0.05 | 0.05 |
| 1,000 | TBAC ( | 1.00 | 1.00 | 1.00 | 0.25 | 0.69 | 0.07 | 0.07 |
| 1,000 | N-BCEE (c=100) | 0.34 | 0.23 | 0.83 | 0.34 | 0.78 | 0.07 | 0.07 |
| 1,000 | N-BCEE (c=500) | 0.50 | 0.39 | 0.90 | 0.34 | 0.76 | 0.08 | 0.08 |
| 1,000 | N-BCEE (c=1000) | 0.58 | 0.47 | 0.92 | 0.34 | 0.75 | 0.08 | 0.08 |
| 1,000 | A-BCEE (c=100) | 0.50 | 0.37 | 0.84 | 0.28 | 0.75 | 0.06 | 0.07 |
| 1,000 | A-BCEE (c=500) | 0.69 | 0.57 | 0.91 | 0.27 | 0.72 | 0.07 | 0.07 |
| 1,000 | A-BCEE (c=1000) | 0.75 | 0.65 | 0.93 | 0.26 | 0.71 | 0.07 | 0.07 |
Marginal posterior probability of inclusion of potential confounding covariate
| n | Method | ||||
| 200 | BMA | 0.17 | 0.26 | 0.08 | 0.09 |
| 200 | BAC ( | 0.48 | 0.29 | 0.09 | 0.10 |
| 200 | BAC ( | 1.00 | 0.28 | 0.08 | 0.10 |
| 200 | TBAC ( | 1.00 | 0.30 | 0.14 | 0.15 |
| 200 | N-BCEE (c=100) | 0.57 | 0.32 | 0.14 | 0.16 |
| 200 | N-BCEE (c=500) | 0.75 | 0.34 | 0.17 | 0.19 |
| 200 | N-BCEE (c=1000) | 0.80 | 0.35 | 0.18 | 0.20 |
| 200 | A-BCEE (c=100) | 0.62 | 0.30 | 0.13 | 0.14 |
| 200 | A-BCEE (c=500) | 0.77 | 0.30 | 0.13 | 0.14 |
| 200 | A-BCEE (c=1000) | 0.83 | 0.30 | 0.13 | 0.15 |
| 600 | BMA | 0.29 | 0.50 | 0.07 | 0.05 |
| 600 | BAC ( | 0.75 | 0.53 | 0.07 | 0.06 |
| 600 | BAC ( | 1.00 | 0.52 | 0.07 | 0.05 |
| 600 | TBAC ( | 1.00 | 0.51 | 0.09 | 0.08 |
| 600 | N-BCEE (c=100) | 0.68 | 0.52 | 0.10 | 0.08 |
| 600 | N-BCEE (c=500) | 0.82 | 0.53 | 0.11 | 0.10 |
| 600 | N-BCEE (c=1000) | 0.87 | 0.53 | 0.12 | 0.10 |
| 600 | A-BCEE (c=100) | 0.68 | 0.51 | 0.09 | 0.07 |
| 600 | A-BCEE (c=500) | 0.81 | 0.51 | 0.09 | 0.08 |
| 600 | A-BCEE (c=1000) | 0.85 | 0.51 | 0.09 | 0.08 |
| 1,000 | BMA | 0.41 | 0.68 | 0.05 | 0.05 |
| 1,000 | BAC ( | 0.86 | 0.70 | 0.06 | 0.05 |
| 1,000 | BAC ( | 0.99 | 0.69 | 0.06 | 0.05 |
| 1,000 | TBAC ( | 1.00 | 0.68 | 0.07 | 0.07 |
| 1,000 | N-BCEE (c=100) | 0.76 | 0.69 | 0.07 | 0.07 |
| 1,000 | N-BCEE (c=500) | 0.88 | 0.69 | 0.08 | 0.08 |
| 1,000 | N-BCEE (c=1000) | 0.91 | 0.69 | 0.09 | 0.09 |
| 1,000 | A-BCEE (c=100) | 0.75 | 0.68 | 0.07 | 0.07 |
| 1,000 | A-BCEE (c=500) | 0.85 | 0.68 | 0.07 | 0.07 |
| 1,000 | A-BCEE (c=1000) | 0.89 | 0.68 | 0.07 | 0.07 |
Marginal posterior probability of inclusion of potential confounding covariate
| n | Method | ||||||
| 200 | BMA | 0.15 | 0.14 | 0.13 | 0.22 | 1.00 | 1.00 |
| 200 | BAC ( | 0.32 | 0.15 | 0.31 | 0.22 | 1.00 | 1.00 |
| 200 | BAC ( | 0.97 | 0.23 | 0.98 | 0.24 | 1.00 | 1.00 |
| 200 | TBAC ( | 0.87 | 0.23 | 0.99 | 0.25 | 1.00 | 1.00 |
| 200 | N-BCEE (c=100) | 0.36 | 0.17 | 0.36 | 0.29 | 1.00 | 1.00 |
| 200 | N-BCEE (c=500) | 0.53 | 0.22 | 0.57 | 0.32 | 1.00 | 1.00 |
| 200 | N-BCEE (c=1000) | 0.60 | 0.25 | 0.66 | 0.33 | 1.00 | 1.00 |
| 200 | A-BCEE (c=100) | 0.50 | 0.19 | 0.48 | 0.23 | 1.00 | 1.00 |
| 200 | A-BCEE (c=500) | 0.65 | 0.21 | 0.66 | 0.24 | 1.00 | 1.00 |
| 200 | A-BCEE (c=1000) | 0.70 | 0.21 | 0.73 | 0.24 | 1.00 | 1.00 |
| 600 | BMA | 0.14 | 0.10 | 0.08 | 0.35 | 1.00 | 1.00 |
| 600 | BAC ( | 0.44 | 0.16 | 0.41 | 0.29 | 1.00 | 1.00 |
| 600 | BAC ( | 0.99 | 0.24 | 1.00 | 0.27 | 1.00 | 1.00 |
| 600 | TBAC ( | 0.96 | 0.23 | 1.00 | 0.26 | 1.00 | 1.00 |
| 600 | N-BCEE (c=100) | 0.33 | 0.12 | 0.22 | 0.41 | 1.00 | 1.00 |
| 600 | N-BCEE (c=500) | 0.50 | 0.18 | 0.41 | 0.41 | 1.00 | 1.00 |
| 600 | N-BCEE (c=1000) | 0.58 | 0.21 | 0.50 | 0.41 | 1.00 | 1.00 |
| 600 | A-BCEE (c=100) | 0.48 | 0.16 | 0.32 | 0.31 | 1.00 | 1.00 |
| 600 | A-BCEE (c=500) | 0.67 | 0.19 | 0.53 | 0.29 | 1.00 | 1.00 |
| 600 | A-BCEE (c=1000) | 0.73 | 0.20 | 0.61 | 0.28 | 1.00 | 1.00 |
| 1,000 | BMA | 0.13 | 0.09 | 0.07 | 0.52 | 1.00 | 1.00 |
| 1,000 | BAC ( | 0.48 | 0.18 | 0.42 | 0.42 | 1.00 | 1.00 |
| 1,000 | BAC ( | 0.99 | 0.30 | 1.00 | 0.35 | 1.00 | 1.00 |
| 1,000 | TBAC ( | 0.99 | 0.30 | 1.00 | 0.34 | 1.00 | 1.00 |
| 1,000 | N-BCEE (c=100) | 0.30 | 0.12 | 0.19 | 0.57 | 1.00 | 1.00 |
| 1,000 | N-BCEE (c=500) | 0.45 | 0.19 | 0.36 | 0.56 | 1.00 | 1.00 |
| 1,000 | N-BCEE (c=1000) | 0.53 | 0.22 | 0.44 | 0.54 | 1.00 | 1.00 |
| 1,000 | A-BCEE (c=100) | 0.47 | 0.18 | 0.26 | 0.44 | 1.00 | 1.00 |
| 1,000 | A-BCEE (c=500) | 0.67 | 0.23 | 0.47 | 0.40 | 1.00 | 1.00 |
| 1,000 | A-BCEE (c=1000) | 0.73 | 0.25 | 0.56 | 0.38 | 1.00 | 1.00 |
Marginal posterior probability of inclusion of potential confounding covariate
| n | Method | |||||
| 200 | BMA | 0.11 | 0.12 | 0.12 | 0.11 | 1.00 |
| 200 | BAC ( | 0.42 | 0.42 | 0.42 | 0.42 | 1.00 |
| 200 | BAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 200 | TBAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 200 | A-BCEE (c=100) | 0.21 | 0.22 | 0.22 | 0.21 | 1.00 |
| 200 | A-BCEE (c=500) | 0.45 | 0.45 | 0.46 | 0.45 | 1.00 |
| 200 | A-BCEE (c=1000) | 0.55 | 0.55 | 0.56 | 0.55 | 1.00 |
| 600 | BMA | 0.08 | 0.07 | 0.08 | 0.08 | 1.00 |
| 600 | BAC ( | 0.42 | 0.41 | 0.42 | 0.42 | 1.00 |
| 600 | BAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 600 | TBAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 600 | A-BCEE (c=100) | 0.12 | 0.11 | 0.12 | 0.12 | 1.00 |
| 600 | A-BCEE (c=500) | 0.30 | 0.29 | 0.31 | 0.31 | 1.00 |
| 600 | A-BCEE (c=1000) | 0.40 | 0.40 | 0.41 | 0.41 | 1.00 |
| 1,000 | BMA | 0.06 | 0.07 | 0.06 | 0.06 | 1.00 |
| 1,000 | BAC ( | 0.41 | 0.41 | 0.41 | 0.40 | 1.00 |
| 1,000 | BAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 1,000 | TBAC ( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| 1,000 | A-BCEE (c=100) | 0.08 | 0.09 | 0.09 | 0.08 | 1.00 |
| 1,000 | A-BCEE (c=500) | 0.22 | 0.23 | 0.23 | 0.22 | 1.00 |
| 1,000 | A-BCEE (c=1000) | 0.32 | 0.33 | 0.33 | 0.32 | 1.00 |
F Comparison of the distribution of obtained from A-BCEE and BAC

Comparison of the distribution of
Citation: Journal of Causal Inference 3, 2; 10.1515/jci-2014-0035

Comparison of the distribution of
Citation: Journal of Causal Inference 3, 2; 10.1515/jci-2014-0035
References
- 1.↑
Pearl J. Causality: models, reasoning, and inference, 2nd ed. New York: Cambridge University Press, 2009.
- 2.↑
Hoeting JA, Madigan D, Raftery AE, Volinsky CT. Bayesian model averaging: a tutorial. Stat Sci 1999;14:382–401.
- 3.↑
Raftery AE, Madigan D, Hoeting JA. Bayesian model averaging for linear regression models. J Am Stat Assoc 1997;92:179–91.
- 4.↑
Raftery AE, Zheng Y. Discussion: performance of Bayesian model averaging. J Am Stat Assoc 2003;98:931–8.
- 5.↑
Crainiceanu CM, Dominici F, Parmigiani G. Adjustment uncertainty in effect estimation. Biometrika 2008;95:635–51.
- 6.↑
Wang C, Parmigiani G, Dominici F. Bayesian effect estimation accounting for adjustment uncertainty. Biometrics 2012;68:661–71.
- 7.↑
Lefebvre G, Atherton J, Talbot D. The effect of the prior distribution in the Bayesian adjustment for confounding algorithm. Comput Stat Data Anal 2014;70:227–40.
- 9.↑
Wang C, Parmigiani G, Dominici F. Rejoinder: Bayesian effect estimation accounting for adjustment uncertainty. Biometrics 2012;68:680–6.
- 10.↑
VanderWeele TJ, Shpitser I. A new criterion for confounder selection. Biometrics 2011;67:1406–13.
- 11.↑
Baba K, Shibata R, Sibuya M. Partial correlation and conditional correlation as measures of conditional independence. Aust NZ J Stat 2004;46:657–64.
- 12.↑
Haughton DMA. On the choice of a model to fit data from an exponential family. Ann Stat 1988;16:342–55.
- 14.↑
Clyde M. Subjective and objective Bayesian statistics, 2nd ed. S. James Press, New Jersey: Wiley-Interscience, 2003
- 15.↑
Yuan K-H, Chan W. Biases and standard errors of standardized regression coefficients. Psychometrika 2011;76:670–90.
- 17.↑
Dawid AP. On the limiting normality of posterior distributions. Proc Cambridge Philos Soc 1970;67:625–33.
- 18.↑
Walker AM. On the asymptotic behavior of posterior distributions. J R Stat Soc Ser B 1969;31:80–8.
- 19.↑
Madigan D, York J, Allard D. Bayesian graphical models for discrete data. Int Stat Rev 1995;63:215–32.
- 20.↑
Carlin BP, Gelfand AE. Approaches for empirical Bayes confidence intervals. J Am Stat Assoc 1990;85:105–14.
- 21.↑
Carlin BP, Louis TA. Bayes and empirical Bayes methods for data analysis, 2nd ed. Boca Raton: Chapman & Hall/CRC, 2000.
- 22.↑
Laird NM, Louis TA. Empirical Bayes confidence intervals based on bootstrap samples. J Am Stat Assoc 1987;82:739–50.
- 23.↑
Morgan SL, Winship C. Counterfactuals and causal inference: methods and principles for social research. New York: Cambridge University Press, 2007
- 24.↑
Guay F, Larose S, Ratelle C, Sénécal C, Vallerand RJ, Vitaro F (2011): Mes amis, mes parents et mes professeurs: Une analyses comparée de leurs effets respectifs sur la motivation, la réussite, l‘orientation et la persévérance scolaires. Technical Report 2007-PE-118485, Fonds de recherche du Québec - Société et culture, Québec.
- 25.↑
Little RJ, Rubin DB. Statistical analysis with missing data, 2nd ed. New Jersey: John Wiley & Sons, 2002.
- 26.↑
Graham JW, Olchowski AE, Gilreath TD. How many imputations are really needed? Some practical clarifications of multiple imputation theory. Prev Sci 2007;8:206–13.
- 27.↑
Talbot D, Lefebvre G, Atherton J (2014): BACprior: Choice of the Hyperparameter Omega in the Bayesian Adjustment for Confounding (BAC) Algorithm. Available at: http://CRAN.R-project.org/package=BACprior, R package version 2.0.
- 28.↑
Greenland S, Robins JM, Pearl J. Confounding and collapsibility in causal inference. Stat Sci 1999;14:29–46.




