Jump to ContentJump to Main Navigation

Statistical Applications in Genetics and Molecular Biology

Editor-in-Chief: Stumpf, Michael P.H.

6 Issues per year

Increased IMPACT FACTOR 2012: 1.717
Rank 18 out of 117 in category Statistics & Probability in the 2012 Thomson Reuters Journal Citation Report/Science Edition
Mathematical Citation Quotient 2012: 0.07

VolumeIssuePage

A Simple Loglinear Model for Haplotype Effects in a Case-Control Study Involving Two Unphased Genotypes

Stuart G. Baker1

1National Cancer Institute

Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 4, Issue 1, ISSN (Online) 1544-6115, DOI: 10.2202/1544-6115.1113, June 2005

Publication History

Published Online:
2005-06-02

Because haplotypes may parsimoniously summarize the effect of genes on disease, there is great interest in using haplotypes in case-control studies of unphased genotype data. Previous methods for investigating haplotypes effects in case-control studies have not allowed for both of the following two scenarios that could have a large impact on results (i) departures from Hardy-Weinberg equilibrium in controls as well as cases, and (ii) an interactive effect of haplotypes and environmental covariates on the probability of disease. A new method is proposed that generalizes the model of Epstein and Satten to incorporate both (i) and (ii). Computations are relatively simple involving a single loglinear design matrix for parameters modeling the distribution of haplotype frequencies in controls, parameters modeling the effect of haplotypes and covariate-haplotype interactions on disease, and nuisance parameters required for correct inference. Based on simulations with realistic sample sizes, the method is recommended with data from two genotypes, a recessive or dominant model linking haplotypes to disease, and estimates of haplotype effects among haplotypes with a frequency greater than 10%. The methodology is most useful with candidate genotype pairs or for searching through pairs of genotypes when scenarios (i) and (ii) are likely. An example without a covariate illustrates the importance of modeling a departure from Hardy-Weinberg equilibrium in controls.

Keywords: Composite Linear Model; EM algorithm; Gene-environment interaction; Genetics; Multinomial-Poisson transformation; Newton-Raphson

Comments (0)

Please log in or register to comment.
Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.