Statistical Applications in Genetics and Molecular Biology
Editor-in-Chief: Stumpf, Michael P.H.
IMPACT FACTOR increased in 2015: 1.265
5-year IMPACT FACTOR: 1.423
Rank 42 out of 123 in category Statistics & Probability in the 2015 Thomson Reuters Journal Citation Report/Science Edition
SCImago Journal Rank (SJR) 2015: 0.954
Source Normalized Impact per Paper (SNIP) 2015: 0.554
Impact per Publication (IPP) 2015: 1.061
Mathematical Citation Quotient (MCQ) 2015: 0.06
A Simple Loglinear Model for Haplotype Effects in a Case-Control Study Involving Two Unphased Genotypes
1National Cancer Institute
Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 4, Issue 1, ISSN (Online) 1544-6115, ISSN (Print) 2194-6302, DOI: 10.2202/1544-6115.1113, June 2005
- Published Online:
Because haplotypes may parsimoniously summarize the effect of genes on disease, there is great interest in using haplotypes in case-control studies of unphased genotype data. Previous methods for investigating haplotypes effects in case-control studies have not allowed for both of the following two scenarios that could have a large impact on results (i) departures from Hardy-Weinberg equilibrium in controls as well as cases, and (ii) an interactive effect of haplotypes and environmental covariates on the probability of disease. A new method is proposed that generalizes the model of Epstein and Satten to incorporate both (i) and (ii). Computations are relatively simple involving a single loglinear design matrix for parameters modeling the distribution of haplotype frequencies in controls, parameters modeling the effect of haplotypes and covariate-haplotype interactions on disease, and nuisance parameters required for correct inference. Based on simulations with realistic sample sizes, the method is recommended with data from two genotypes, a recessive or dominant model linking haplotypes to disease, and estimates of haplotype effects among haplotypes with a frequency greater than 10%. The methodology is most useful with candidate genotype pairs or for searching through pairs of genotypes when scenarios (i) and (ii) are likely. An example without a covariate illustrates the importance of modeling a departure from Hardy-Weinberg equilibrium in controls.