Abstract
While child and adolescent obesity is a serious public health concern, few studies have utilized parameters based on the causal inference literature to examine the potential impacts of early intervention. The purpose of this analysis was to estimate the causal effects of early interventions to improve physical activity and diet during adolescence on body mass index (BMI), a measure of adiposity, using improved techniques. The most widespread statistical method in studies of child and adolescent obesity is multivariable regression, with the parameter of interest being the coefficient on the variable of interest. This approach does not appropriately adjust for time-dependent confounding, and the modeling assumptions may not always be met. An alternative parameter to estimate is one motivated by the causal inference literature, which can be interpreted as the mean change in the outcome under interventions to set the exposure of interest. The underlying data-generating distribution, upon which the estimator is based, can be estimated via a parametric or semi-parametric approach. Using data from the National Heart, Lung, and Blood Institute Growth and Health Study, a 10-year prospective cohort study of adolescent girls, we estimated the longitudinal impact of physical activity and diet interventions on 10-year BMI z-scores via a parameter motivated by the causal inference literature, using both parametric and semi-parametric estimation approaches. The parameters of interest were estimated with a recently released R package, ltmle, for estimating means based upon general longitudinal treatment regimes. We found that early, sustained intervention on total calories had a greater impact than a physical activity intervention or non-sustained interventions. Multivariable linear regression yielded inflated effect estimates compared to estimates based on targeted maximum-likelihood estimation and data-adaptive super learning. Our analysis demonstrates that sophisticated, optimal semiparametric estimation of longitudinal treatment-specific means via ltmle provides an incredibly powerful, yet easy-to-use tool, removing impediments for putting theory into practice.
Introduction
Adolescent obesity has become a major concern in public health due to its increasing prevalence [1, 2]. Overweight and obese adolescents often become overweight and obese adults, and are at increased risk for chronic diseases such as heart disease, diabetes and cancer [3–6]. Physical activity and diet are established determinants of obesity development [7–11]. At a physiological level, obesity occurs when there is energy imbalance, specifically when energy intake exceeds energy expenditure. Energy needs vary considerably for the growing child/adolescent, and there is considerable variability in energy intake and expenditure patterns throughout childhood and adolescence.
Multivariable linear or logistic regression has often been used to model the effect of energy intake and expenditure patterns on body mass index (BMI) or other indicators of obesity using observational data, and the parameter of interest is the coefficient attached to certain variables [8]. An alternative approach, which can utilize data adaptive estimation, is based upon the so-called G-computation formula, from which a substitution-type estimator can be derived [12, 13]. In the longitudinal settings, G-computation is an identifiability result derived from the sequential randomization assumption implied by a causal graph or the nonparametric structural equation (NPSEM) model of the data-generating mechanism [12, 14]. The parameters of interest returned by these substitution estimators are chosen so that they can be interpreted, under assumptions, of the treatment (or exposure) effects one typically estimates in a randomized controlled trial appropriately adjusted for time-dependent confounders. Since these estimators are functions of portions of the data-generating distribution (such as conditional means), rather than byproducts of a parametric regression model, they put little constraints on the form of the data-generating distribution. Thus, one can use data-adaptive (machine learning), flexible algorithms in order to search among a large class of models to find the one that fits the model optimally. If this fit, according to some loss function, is estimated in an unbiased way (using cross-validation), then one can derive a better fit to the data than say an arbitrary linear regression model, and so derive estimates of the parameter that both have less bias, and lower variance. In addition, because the model selection is done to maximize the fit (minimize the cross-validated risk) of the prediction models, and not targeted towards the parameter of interest, significant reductions in bias are possible via an augmentation to the estimated statistical model based on targeted maximum likelihood estimation (TMLE) [14].
The purpose of this article was to quantify the longitudinal effects of energy intake and expenditure patterns on BMI in adolescence using data from the National Heart, Lung, and Blood Institute Growth and Health Study (NGHS), a 10-year prospective cohort study of preadolescent white and African American girls designed to assess factors associated with the onset and development of obesity [15]. In particular, we assessed the impact of early interventions on physical activity level and caloric intake on BMI by estimating parameters defined by the G-computation formula. Our objectives were to compare, in this setting of longitudinal studies of obesity, the use of standard regression models for estimating the relevant components of the data-generating distribution with semi-parametric, data-adaptive methods and discuss practical implementation of latter methods using the ltmle R package [42].
Methods
Data structure
For this analysis we used a subset of the NGHS data consisting of the participants recruited by the University of California at Berkeley center, one of three recruitment sites for NGHS (n = 530). The participants were 9–10 years of age at study entry and followed for 10 years; 887 girls enrolled at baseline. The study collected anthropometric measurements annually and an extensive set of variables potentially relevant to weight gain, including physical, behavioral, socioeconomic and mental health factors such as pubertal maturation stage, diet, physical activity, parental education, and perceived self-worth [10, 15–20].
The time-dependent outcome variable of interest was BMI-for-age z-score, which indicates BMI relative to other girls of the same age on a standard deviation scale [21]. We focused on physical activity and total calories as the exposures/potential interventions of interest, with the hypotheses that increasing physical activity would result in a lower 10-year BMI and that increasing total calories would have the opposite effect. Physical activity was measured using a Habitual Activity Questionnaire that was adapted from a questionnaire developed by Ku et al. (1981) [22], and compared against two other assessment methods [18]. Total calories were estimated from 3-day food diaries [7]. Potential confounding variables were selected based primarily upon previously reported associations with BMI [10, 16–24]. These included baseline race (white or African American), and the time-dependent variables pubertal maturation stage (four levels: prepuberty, early maturity, midpuberty, maturity), number of hours of television watched per week, perceived stress scale [23], global self-worth score (an indicator of self-esteem measured using a Harter’s Self Perception Profile for Children) [18, 24], as well as the outcome, BMI, at previous measurements.
We focused on three time points after enrollment: years 0 (age 9–10 years), 5 and 10 (19–20 years); selection of these time points was based on preliminary work indicating that these were the most relevant for capturing BMI trajectories. We restricted the sample to the subset of participants who had total calories, physical activity, and BMI z-score measured at years 0, 5, and 10 (n=530) since this is the level of resolution such that there would be relatively little missing data. However, even at this resolution, there were still missing data for some of the time-varying covariates (confounders), and we imputed missing values by using a local average of the years around the time point of interest within the subject, rather than omitting the girls with missing values completely. This did not appreciably change the overall summary statistics for the covariates when compared to a complete case analysis.
Parameter of interest
The research question we endeavored to answer concerned the potential causal effect of intervening to set longitudinal profiles of physical activity level or caloric intake amount on BMI z-score at year 10 of the study, when the girls were aged 19–20 years. An ideal experiment that would answer this question would be to randomize a cohort of girls at baseline to each potential longitudinal pattern of physical activity and diet, then follow up with the girls, ensuring perfect adherence and no attrition. The causal inference estimation framework makes transparent the identifiability assumptions necessary to estimate such parameters from observational data. The structural causal model (SCM) reflecting our belief about the time-ordering and relationships between the exposure, covariates, and outcome of interest was
where the subscripts denote the three time points of interest (years 0, 5 and 10), Aj, denotes the exposure of interest (physical activity or total calories) within time interval j, Lj are the vector of time-varying confounders (variables that affect both future exposure and the outcome of interest, including intervening measurements of BMI) within interval j,
Intervening on the exposure of interest at a certain time point corresponds to deterministically setting
where the (1) notation represents counterfactuals indexed by specific treatment regimes, a. Once the SCM is specified, the next step is to specify the counterfactuals indexed by interventions on
Identifiability assumptions – G-computation
In order to estimate the marginal distribution of different counterfactuals from observed data, identifiability assumptions must be asserted. In longitudinal data, one such assumption, the sequential randomization assumption [14, 27] corresponds to
,
where Parents indicates all preceding measured variables, and in this case
This result is the longitudinal G-computation formula, which allows specification of parameters of the observed data distribution. The parameters we specified depended on the different intervention patterns contained in
where
In order for one to be able to estimate these causal parameters, there needs to be sufficient natural experimentation of the variable of interest (the Aj) within groups defined by the history of covariates,
The substitution estimator is then given by
This estimator requires the estimates of the relevant regressions and joint distribution of intermediates, given the past. Typically, one derives the mean via time-sequential simulation of the L-process, setting the treatment history to the desired rule. Thus, the standard formulation of this substitution estimator requires often challenging estimation, since it requires estimation of joint densities.
Van der Laan and Gruber [28] present a different representation of eq. (3) that avoids having to estimate the joint conditional density of intermediate variables, e.g. the
,
where
The substitution estimator, following eq. (6), starts with estimating the innermost conditional expectation,
Steps of defining parameter and estimate in the ICE formulation
Step | Definition of parameter in ICE formulation | Estimation step in the obesity study |
1 | Define the conditional expectation of the outcome with covariate history set to observed, and the treatment history to the desired intervention, or: | Regress the final BMI z-score, |
2 | Take the expectation of | Regress |
3 | Repeat to define | Regress |
4 | The parameter of interest is: | Derive the estimate of the treatment-specific mean by taking an average of the |
Because the
Estimation via SuperLearner
Given little theory to guide one on the functional form of the regressions that comprise our ICE estimator, and the impossibility of nonparametric estimation because of the high dimension of the predictors, a potentially consistent approach requires a data-adaptive, semiparametric modeling method. Traditional approaches would fit the regressions based on a potential simplification (dimension reduction) of the predictors (e.g. the conditional means given the entire history are only functions of the most recent history), and/or the assumption of a simple parametric forms (e.g. regression with only main effects). Because doing so will almost certainly result in a misspecified model, such arbitrary (non-targeted) simplifications will result in an arbitrarily biased estimator. Thus, we used the data-adaptive, ensemble machine learning algorithm known as the SuperLearner (SL) [29]. The SL takes a library of algorithms and uses cross-validation to create a convex combination of the algorithms with weights based on the ability of each algorithm to minimize the cross-validated risk (expected value of the user-supplied loss function) in the cross-validation procedure. The optimality properties of cross-validation and super learning are detailed in van der Laan and Polley [29] and van der Laan et al. [30]. If k algorithms are competitors, the Oracle Selector is the one that chooses the algorithm that minimizes the true risk as if the true underlying distribution of the data was known. The Oracle Inequality suggests that the number of candidates algorithms considered can be polynomial in size without hindering the performance of SL. Thus, many algorithms can be considered and the combination will perform asymptotically as well as the Oracle selector. Therefore, it is a good choice for estimating the required models since one can include both more agnostic algorithms, and particular models favored by the researchers. Although sample size might indicate a variance-bias trade-off towards a smaller model (and thus more biased model), theory indicates that this trade-off will be in an optimal direction.
Since the SL is an ensemble learner, the candidate algorithms from which it chooses and utilizes in the combination are not indicated. We used the SL package available in R [31] and our library of candidate learners included generalized linear models (GLM), generalized additive models (GAM) [32], Bayesian generalized linear models (bayesGLM) [33], generalized linear models using coordinate-wise descent (glmnet) [34, 35], and the mean function. In addition, one must specify the number of splits (folds); 10-fold cross-validation was used to estimate the final model.
Targeted maximum likelihood estimation
The data-adaptive fitting of the models that make up Step 1 above are targeted to optimizing the ability of models to predict the outcome, and not towards estimating the parameter. One can improve the estimator by “targeting” these fits towards the parameter of interest. Specifically, we used Targeted Maximum-Likelihood Estimation (TMLE), a two-stage estimator that augments the estimated models that comprise the ICE-inspired algorithm discussed in Table 1. This results in a bias-reduction step, which will remove residual bias relative to the substitution estimator if the treatment mechanism can be estimated consistently [14]. The TMLE augmentation [28] involves adding a so-called clever covariates to each of these regressions, which requires estimation of the treatment mechanism. Specifically, for each time point, it requires estimates of the probability of being in the treatment group (e.g. high physical activity) given the past (that is, all past covariates, treatments, and outcomes) [36].
Specifically, we use the same formulation as discussed above, but now based on
where
As discussed above, one of the virtues of TMLE over the initial substitution estimator is that it reduces bias due to the fact that the original statistical models are chosen based on minimizing expected loss with regards to prediction of the outcomes (intermediate and final), and not with regards to minimizing the mean-squared-error of the estimate of the parameter of interest. Relatedly, the TMLE estimate is consistent if either the original models or the treatment/censoring models are consistently estimated (doubly-robust). If they both are consistently estimated, the estimator is semi-parametrically (locally) efficient. Finally, the TMLE can be thought of a smoothing of the original substitution estimator, and thus can have more predictable sampling distributional properties; it is an asymptotically linear estimator with a known influence function, and this can be used to derive robust asymptotic inference [14]. Thus, one can derive an approximate SE by simply taking the sample standard deviation of the estimated IC of a subject divided by the sample size, or
The required calculations would seem a daunting task to code, and thus the effort required could inhibit taking such an approach. However, an R package, ltmle, provides a one-stop, user-friendly implementation of the ICE, TMLE estimator. One can either use the defaults available for estimating the prediction and treatment models, or one can specify particular algorithms, including SL with an associated library of learners. Though not discussed here, the package incorporates the possibility of missing data and not just fixed treatments of interest, but treatment rules. The package also returns, in addition to the TMLE estimate, the ICE based on non-augmented model, the so-called inverse probability of treatment weighted (IPTW) estimators, as well as a “naïve” estimate based on no dependent confounding. Finally, the package will also return estimates of marginal structural models if one wants to model the treatment-specific mean as a potentially smooth function of the history of intervention (e.g. the total number of time intervals with treatment). Thus, the powerful ltmle packages makes estimating a relatively complex suite of estimators amazingly straightforward, which opens the door to estimation of targeted casual parameters based on potentially complex interventions to a much wider audience.
Data analysis results
Characteristics of the sample of NGHS participants at the three time points in the analysis are provided in Table 2. All participants were aged 9 or 10 at Year 0. There was a noticeable decrease in physical activity at Year 10. Additionally, by the end of the study, all participants had reached pubertal maturation. Table 3 contains the numbers and percent of individuals who followed the relevant patterns of interest in total calories and physical activity.
Characteristics of the sample of NGHS participants (N = 530) at three time points
Variable | Year 0 | Year 5 | Year 10 |
mean (SD1) | mean (SD) | mean (SD) | |
BMI z-score | 0.41 (1.1) | 0.72 (1.0) | 0.53 (1.1) |
Total calories (kcal) | 1,953 (607) | 1,894 (699) | 1,972 (717) |
Physical activity (METS-times/wk) | 29.1 (17.9) | 19.7 (14.1) | 11.4 (17.2) |
Hours of TV/video per week | 33.0 (18.4) | 35.5 (20.6) | 30.6 (24.4) |
Perceived stress2 | 25.3 (6.8) | 24.0 (6.4) | 23.4 (8.0) |
Global self-worth score3 | 3.1 (0.6) | 3.1 (0.7) | 3.2 (0.6) |
N (%) | N (%) | N (%) | |
Race | |||
White | 270 (51%) | ||
Black | 262 (49%) | – | – |
Pubertal stage | |||
Prepuberty | 334 (63%) | 0 (0%) | 0 (0%) |
Early maturity | 172 (32%) | 32 (6%) | 0 (0%) |
Midpuberty | 24 (5%) | 400 (75%) | 0 (0%) |
Maturity | 0 (0%) | 98 (18%) | 530 (100%) |
The results of estimation of the marginal means of interest by four estimators are in: a “naïve” estimator, and a series of ICE substitution estimators based on different models for the relevant regressions: (a) based on multivariate regressions with only main effect terms, (b) on SL fits, and (c) based on augmented SL fits, or TMLE. We fit these to compare, in order, an estimators that (1) simply assumed no confounding at all, but in that context makes no modeling assumptions (naïve), (2) adjusts for confounding, but in a parametric model (MTLR for multivariable linear regression), (3) adjusts for confounding in a much bigger (and therefore less biased) statistical model, but where the estimator is not targeted towards the particular parameter of interest (and thus, contains no “smoothing” adjustment of the estimate of the data-generating distribution towards that parameter – SL), and (4) is targeted for estimation in a large (semiparametric) model and, but specifically for the parameter of interest (TMLE). Statistical inference was based on the influence curve for the TMLE estimators, but on the nonparametric bootstrap for all others (we also re-did the inference for one of the TMLE estimators using the bootstrap as a check on the IC-based inference).
Number and percent following the early interventions on physical activity and total calories
1,1,1 | 1, 0, 0 | 0, 0, 1 | 0, 0, 0 | |
Physical activity | 102 (19%) | 132 (25%) | 10 (2%) | 45 (8%) |
Energy intake | 130 (25%) | 63 (12%) | 35 (7%) | 54 (10%) |
There are often substantial differences between the adjusted estimates based on the SL (TMLE and SL) and the, naïve, suggesting that these are confounded by the included covariates. For example, the naive point estimate of E(Y(0,0,1)) is very different from the estimates generated from the SL and TMLE estimators (Table 4). The point estimates serve as the inputs for the various intervention comparisons (4) shown in Table 5. One can see some significant differences between the simple substitution estimator based on parametric regression and that based on SL (the equivalent substitution estimator and TMLE). For instance, for physical activity, the estimate of

Nonparametric bootstrap distribution of TMLE estimator of
Comparison of the point estimates for each intervention using 4 different estimators
Intervention | Estimation method | Intervention | |
Physical activity | Energy intake | ||
E(Y(0,0,0)) | NAÏVE | 0.747 | 0.526 |
MTLR | 0.636 | 0.662 | |
SL | 0.590 | 0.599 | |
TMLE | 0.548 | 0.695 | |
E(Y(0,0,1)) | NAÏVE | 0.400 | 0.638 |
MTLR | 0.729 | 0.582 | |
SL | 0.636 | 0.561 | |
TMLE | 0.641 | 0.508 | |
E(Y(1,0,0)) | NAÏVE | 0.526 | 0.433 |
MTLR | 0.490 | 0.549 | |
SL | 0.505 | 0.535 | |
TMLE | 0.528 | 0.597 | |
E(Y(1,1,1)) | NAÏVE | 0.399 | 0.555 |
MTLR | 0.522 | 0.434 | |
SL | 0.531 | 0.476 | |
TMLE | 0.597 | 0.503 | |
E(Ybar) | NAÏVE | 0.533 | 0.533 |
MTLR | 0.533 | 0.533 | |
SL | 0.531 | 0.531 | |
TMLE | 0.540 | 0.528 |
Estimates for specific intervention comparisons with different estimation methods
Method | Intervention | ||
Physical activity (95% CI) | Energy intake (95% CI) | ||
Ψ1 | NAÏVE | −0.134 (–0.429, 0.100) | −0.056 (–0.254, 0.212) |
MTLR | −0.010 (–0.122, 0.103) | −0.099 (–0.166, –0.027) | |
SL | 0.0001 (–0.115, 0.086) | −0.055 (–0.167, –0.047)* | |
TMLE | 0.057 (–0.520, 0.634) | −0.026 (–0.458, 0.406) | |
Ψ2 | NAÏVE | −0.002 (0.617, 0.497) | 0.161 (–0.471, 0.237) |
MTLR | −0.206 (–0.382, –0.032)* | −0.148 (–0.298, 0.019) | |
SL | −0.106 (–0.345, 0.043) | −0.085 (–0.306, –0.018)* | |
TMLE | −0.044 (–0.304, 0.216) | −0.005 (–0.227, 0.217) | |
Ψ3 | NAÏVE | 0.125 (–0.481, 0.582) | −0.103 (–0.628, 0.163) |
MTLR | −0.239 (–0.436, –0.057) | −0.032 (–0.206, 0.145) | |
SL | −0.131 (–0.363, –0.064)* | −0.0261 (–0.200, 0.100) | |
TMLE | −0.114 (–0.330, 0.103) | 0.089 (–0.157, 0.335) | |
Ψ4 | NAÏVE | −0.007 (–0.226, 0.149) | 0.002 (–0.426, 0.174) |
MTLR | −0.437 (–0.110, 0.022) | 0.017 (–0.087, 0.114) | |
SL | −0.025 (–0.093, 0.020) | 0.004 (–0.080, 0.095) | |
TMLE | −0.012 (–0.620, 0.596) | 0.068 (–0.354, 0.491) | |
Ψ5 | NAÏVE | −0.348 (–0.646, –0.095)* | −0.056 (–0.325, 0.307) |
MTLR | −0.114 (–0.312, 0.083) | −0.228 (–0.388, –0.065)* | |
SL | −0.059 (–0.271, 0.076) | −0.123 (–0.400, –0.108)* | |
TMLE | 0.049 (–0.170, 0.269) | −0.192 (–0.374, –0.010)* |
Discussion
One should always estimate the data-generating distribution within a statistical model that only is constrained by what one actually knows about the distribution. In most circumstances, such as longitudinal studies of obesity, little if anything is known about the form of the data-generating distribution, so the true statistical model is semiparametric. In addition, the parameters of interest should be based upon the specific scientific/policy questions of interest, and not as a convenient byproduct of a parametric statistical model. In this article, we have applied advances in data-adaptive estimation (SuperLearning) combined with a targeted bias reduction (TMLE) to estimate parameters targeting the impact of longitudinal interventions in both caloric intake and physical activity on BMI in the study of adolescent obesity.
In our case, we defined the parameters of interest as marginal means of BMI at different fixed longitudinal intervention profiles, and differences of those means related to specific questions regarding how a potential pattern of intervention could affect BMI at age 19–20 years old. Then, we compared the use of data-adaptive super learning and a targeted (TMLE) augmentation to this initial fit to estimates of the same parameter using both parametric linear regression analyses, as well as a simple naïve (unadjusted) analysis.
The estimated impacts of physical activity and total calories on year 10 (ages 19–20) mean BMI z-score in this population were not significant, with total calories having a larger effect regardless of the estimation method used. Specifically, looking at the TMLE estimates, only the most extreme comparison (that is
To suggest that we can identify the causal effect of the proposed interventions from the data requires strong assumptions, and we acknowledge that many of them will not be precisely true. We also acknowledge that examining the longitudinal effects of total caloric intake and physical activity on BMI is a complex problem and that other variables could be considered in future studies. For instance, accurate measurement of diet and physical activity, especially in children, is challenging, and measurement error could have been a major issue in this application. Additionally, deterministically setting diet and physical activity to certain levels may not be realistic since, for example, girls who are very physically active may consume more calories. Dynamic regimes assign the exposure value based on the value of covariates. In this case, for example, physical activity could be set to a certain level depending on the total calories consumed. Rather than forcing everyone in the study population to the same exposure value, this allows for more realistic interventions. Finally, we limited our analysis to a relatively small set of potential confounders. Some other potential confounders of interest that warrant future investigation with regard to the development of adolescent obesity include genetic factors, self-perception, and exposures to mental health stressors, endocrine disruptors, and community-level variables such as aspects of the built environment [38–40]. Though there is clearly a benefit to reducing the bias of residual confounding, new challenges arise, such as greater potential for the violating positivity assumption, which requires that there be variation of the exposure variable within strata of the covariates; this assumption generally becomes harder to meet as more covariates are included in the analysis [41]. In our application, even with a relatively small set of covariates, some dimension reduction measures, such as dichotomizing continuous variables, was needed. Of course, that is one of the challenges of estimating ambitious parameters with relatively small sample sizes.
Recognizing the limitations of this analysis, we have provided a framework for estimation of longitudinal interventions related to childhood obesity from observational data. This analysis has demonstrated how a very flexible, and yet user-friendly implementation of estimation of intervention rules via longitudinal TMLE (ltmle) [28, 42] provides a powerful way of estimating parameters with direct public health relevance within a large (honest) statistical model.
Acknowledgment
This research was supported by National Institutes of Health (NIH) grant 1RC1DK086038
References
1. WangY, LobsteinT. Worldwide trends in childhood overweight and obesity. Int J Pediatr Obes2006;1:11–25.10.1080/17477160600586747Search in Google Scholar
2. SkeltonJA, CookSR, AuingerP, KleinJD, BarlowSE. Prevalence and trends of severe obesity among US children and adolescents. Acad Pediatr2009;9:322–9.10.1016/j.acap.2009.04.005Search in Google Scholar
3. ReillyJJ, MethvenE, McDowellZC, HackingB, AlexanderD, StewartL, et al. Health consequences of obesity. Arch Dis Child2003;88:748–52.10.1136/adc.88.9.748Search in Google Scholar
4. GuoSS, WuW, ChumleaWC, RocheAF. Predicting overweight and obesity in adulthood from body mass index values in childhood and adolescence. Am J Clin Nutr2002;76:653–8.10.1093/ajcn/76.3.653Search in Google Scholar
5. BjørgeT, EngelandA, TverdalA, SmithGD. Body mass index in adolescence in relation to cause-specific mortality: a follow-up of 230,000 Norwegian adolescents. Am J Epidemiol2008;168:30–7.10.1093/aje/kwn096Search in Google Scholar
6. ReillyJ, KellyJ. Long-term impact of overweight and obesity in childhood and adolescence on morbidity and premature mortality in adulthood: systematic review. Int J Obes2010;35:891–8.10.1038/ijo.2010.222Search in Google Scholar
7. CrawfordPB, ObarzanekE, MorrisonJ, SabryZ. Comparative advantage of 3-day food records over 24-hour recall and 5-day food frequency validated by observation of 9-and 10-year-old girls. J Am Diet Assoc1994;94:626–30.10.1016/0002-8223(94)90158-9Search in Google Scholar
8. MustA, TyborD. Physical activity and sedentary behavior: a review of longitudinal studies of weight and adiposity in youth. Int J Obes2005;29:S84–96.10.1038/sj.ijo.0803064Search in Google Scholar PubMed
9. NaderPR, O’BrienM, HoutsR, BradleyR, BelskyJ, CrosnoeR, et al. Identifying risk for obesity in early childhood. Pediatrics2006;118:e594–601.10.1542/peds.2005-2801Search in Google Scholar PubMed
10. PatrickK, NormanGJ, CalfasKJ, SallisJF, ZabinskiMF, RuppJ, et al. Diet, physical activity, and sedentary behaviors as risk factors for overweight in adolescence. Arch Pediatr Adolesc Med2004;158:385.10.1001/archpedi.158.4.385Search in Google Scholar PubMed
11. BerkeyCS, RockettHR, FieldAE, GillmanMW, FrazierAL, CamargoCA, et al. Activity, dietary intake, and weight changes in a longitudinal study of preadolescent and adolescent boys and girls. Pediatrics2000;105:e56.10.1542/peds.105.4.e56Search in Google Scholar
12. RobinsJ. A new approach to causal inference in mortality studies with a sustained exposure period – application to control of the healthy worker survivor effect. Math Model1986;7:1393–512.10.1016/0270-0255(86)90088-6Search in Google Scholar
13. TaubmanSL, RobinsJM, MittlemanMA, HernánMA. Intervening on risk factors for coronary heart disease: an application of the parametric g-formula. Int J Epidemiol2009;38:1599–611.10.1093/ije/dyp192Search in Google Scholar
14. Van der LaanMJ, RoseS. Targeted learning: causal inference for observational and experimental data. New York: Springer, 2011.10.1007/978-1-4419-9782-1Search in Google Scholar
15. MorrisonJ, BiroF, CampaigneB, BartonB, ShumannB, CrawfordP, et al. Obesity and cardiovascular-disease-risk-factors in black-and-white girls-the NHLBI Growth and Health Study. Am J Public Health1992;82:1613–20.10.2105/AJPH.82.12.1613Search in Google Scholar
16. SchwimmerJB, BurwinkleTM, VarniJW. Health-related quality of life of severely obese children and adolescents. JAMA J Am Med Assoc2003;289:1813–19.10.1001/jama.289.14.1813Search in Google Scholar
17. WangM-C, CrawfordPB, HudesM, Van LoanM, SiemeringK, BachrachLK. Diet in midpuberty and sedentary activity in prepuberty predict peak bone mass. Am J Clin Nutr2003;77:495–503.10.1093/ajcn/77.2.495Search in Google Scholar
18. KimmS, BartonBA, BerhaneK, RossJW, PayneGH, SchreiberGB. Self-esteem and adiposity in black and white girls: the NHLBI growth and health study. Ann Epidemiol1997;7:550–60.10.1016/S1047-2797(97)00124-5Search in Google Scholar
19. Striegel-MooreRH, ThompsonDR, AffenitoSG, FrankoDL, BartonBA, SchreiberGB, et al. Fruit and vegetable intake: few adolescent girls meet national guidelines. Prev Med2006;42:223–8.10.1016/j.ypmed.2005.11.018Search in Google Scholar PubMed
20. DannerFW. A national longitudinal study of the association between hours of TV viewing and the trajectory of BMI growth among US children. J Pediatr Psychol2008;33:1100–07.10.1093/jpepsy/jsn034Search in Google Scholar PubMed
21. KuczmarskiRJ, OgdenCL, GuoSS, Grummer-StrawnLM, FlegalKM, MeiZ, et al. 2000 CDC growth charts for the united states: methods and development. Vital Health Stat2002;11:1–190.Search in Google Scholar
22. KuLC, ShapiroLR, CrawfordPB, and HuenemannRL. Body composition and physical activity in 8-year-old children. Am J Clin Nutr1981;34(12):2770–2775.10.1093/ajcn/34.12.2770Search in Google Scholar PubMed
23. CohenS, KamarckT, MermelsteinR. A global measure of perceived stress. J Health Soc Behav1983;4(24):385–96.10.2307/2136404Search in Google Scholar
24. HarterS. The perceived competence scale for children. Child Dev1982;53(1):87–97.10.2307/1129640Search in Google Scholar
25. D. G. A. Committee and others. Report of the dietary guidelines advisory committee on the dietary guidelines for Americans, 2010, to the secretary of agriculture and the secretary of health and human services. Agric. Res. Serv2010.Search in Google Scholar
26. KimmSY, GlynnNW, KriskaAM, BartonBA, KronsbergSS, DanielsSR, et al. Decline in physical activity in black girls and white girls during adolescence. N Engl J Med2002;347:709–15.10.1056/NEJMoa003277Search in Google Scholar PubMed
27. Van der LaanMJ, RobinsJM. Unified methods for censored longitudinal data and causality. New York: Springer, 2003.10.1007/978-0-387-21700-0Search in Google Scholar
28. van der LaanMJ, GruberS. Targeted minimum loss based estimation of an intervention specific mean outcome. Int J Biostat2012;8; PetersenML, SchwabJ, GruberS, BlaserN, SchomakerM, van der LaanMJ. Targeted maximum likelihood estimation for dynamic and static longitudinal marginal structural working models. U.C. Berkeley Division of Biostatistics Working Paper Series. Working Paper 312, May 2013. Available at: http://biostats.bepress.com/ucbbiostat/paper312.Search in Google Scholar
29. van der LaanMJ, PolleyEC, HubbardAE. Super learner. Stat Appl Genet Mol Biol2007;6:1–21.10.2202/1544-6115.1309Search in Google Scholar PubMed
30. van der LaanMJ, DudoitS, KelesS. Asymptotic optimality of likelihood based cross-validation. Stat Appl Genet Mol Biol2003;3(4).10.2202/1544-6115.1036Search in Google Scholar PubMed
31. PolleyE, van der LaanM. SuperLearner in prediction. 2010. UC Berkeley Division of Biostatistics Working Paper Series.Search in Google Scholar
32. HastieT, TibshiraniR. Generalized additive models. Stat Sci1986;1(3):297–310.Search in Google Scholar
33. GelmanA, JakulinA, PittauMG, SuY-S.A default prior distribution for logistic and other regression models. Unpublished Manuscript. See Www Stat Columbia Edugelman, 2006.Search in Google Scholar
34. FriedmanJ, HastieT, TibshiraniR. Regularization paths for generalized linear models via coordinate descent. J Stat Softw2010;33:1.10.18637/jss.v033.i01Search in Google Scholar
35. FriedmanJ, HastieT, HöflingH, TibshiraniR. Pathwise coordinate optimization. Ann Appl Stat2007;1:302–32.10.1214/07-AOAS131Search in Google Scholar
36. BangH, RobinsJM. Doubly robust estimation in missing data and causal inference models. Biometrics2005;61:962–73.10.1111/j.1541-0420.2005.00377.xSearch in Google Scholar PubMed
37. RobinsJM, RotnitzkyA, ZhaoLP. Estimation of regression coefficients when some regressors are not always observed. J Am Stat Assoc1994;89:846–66.10.1080/01621459.1994.10476818Search in Google Scholar
38. BlackJL, MacinkoJ. Neighborhoods and obesity. Nutr Rev2008;66:2–20.10.1111/j.1753-4887.2007.00001.xSearch in Google Scholar PubMed
39. LovasiGS, HutsonMA, GuerraM, NeckermanKM. Built environments and obesity in disadvantaged populations. Epidemiol Rev2009;31:7–20.10.1093/epirev/mxp005Search in Google Scholar PubMed
40. ThorntonLE, PearceJR, KavanaghAM. Using geographic information systems (GIS) to assess the role of the built environment in influencing obesity: a glossary. Int J Behav Nutr Phys Act2011;8:71.10.1186/1479-5868-8-71Search in Google Scholar PubMed PubMed Central
41. PetersenML, PorterK, GruberS, WangY, van der LaanMJ.Diagnosing and responding to violations in the positivity assumption. Stat Methods Med Res2012;21(1):31–54.10.1177/0962280210386207Search in Google Scholar PubMed PubMed Central
42. SchwabJ, LendleS, PetersenM, van der LaanM. Longitudinal targeted maximum liklelihood estimation. R Package version 0.9.3.2013, 2012.Search in Google Scholar
©2014 by Walter de Gruyter Berlin / Boston