Statistical Applications in Genetics and Molecular Biology
Editor-in-Chief: Stumpf, Michael P.H.
IMPACT FACTOR increased in 2015: 1.265
5-year IMPACT FACTOR: 1.423
Rank 42 out of 123 in category Statistics & Probability in the 2015 Thomson Reuters Journal Citation Report/Science Edition
SCImago Journal Rank (SJR) 2015: 0.954
Source Normalized Impact per Paper (SNIP) 2015: 0.554
Impact per Publication (IPP) 2015: 1.061
Mathematical Citation Quotient (MCQ) 2015: 0.06
GENOVA: Gene Overlap Analysis of GWAS Results
1Queensland Institute of Medical Research
2Queensland Institute of Medical Research
Citation Information: Statistical Applications in Genetics and Molecular Biology. Volume 11, Issue 3, ISSN (Online) 1544-6115, DOI: 10.1515/1544-6115.1784, February 2012
- Published Online:
In many published genome-wide association studies (GWAS), the top few strongly associated variants are often located in or near known genes. This observation raises the more general hypothesis that variants nominally associated with a phenotype are more likely to overlap genes than those not associated with a phenotype. We developed a simple approach – named GENe OVerlap Analysis (GENOVA) – to formally test this hypothesis. This approach includes two steps. First, we define largely independent groups of highly correlated SNPs (or “clumps”) and classify each clump as intersecting a gene or not. Second, we determine how strongly associated each clump is with the phenotype and use logistic regression to formally test the hypothesis that clumps associated with the phenotype are more likely to intersect genes. Simulations suggest that the power of GENOVA is affected by at least three factors: GWAS sample size, the gene boundaries used to define gene-intersecting clumps and the P-value threshold used to define phenotype-associated clumps. We applied GENOVA to results from three recent GWAS meta-analyses of height, body mass index (BMI) and waist-hip ratio (WHR) conducted by the GIANT consortium. SNPs associated with variation in height were 1.44-fold more likely to be in or near genes than SNPs not associated with height (P = 5x10-28). A weaker association was observed for BMI (1.09-fold, P = 0.008) and WHR (1.09-fold, P = 0.014). GENOVA is implemented in C++ and is freely available at https://genepi.qimr.edu.au/staff/manuelF/genova/main.html.