MultiPhen: Joint model of multiple phenotypes can increase discovery in GWAS

276Citations
Citations of this article
352Readers
Mendeley users who have this article in their library.

Abstract

The genome-wide association study (GWAS) approach has discovered hundreds of genetic variants associated with diseases and quantitative traits. However, despite clinical overlap and statistical correlation between many phenotypes, GWAS are generally performed one-phenotype-at-a-time. Here we compare the performance of modelling multiple phenotypes jointly with that of the standard univariate approach. We introduce a new method and software, MultiPhen, that models multiple phenotypes simultaneously in a fast and interpretable way. By performing ordinal regression, MultiPhen tests the linear combination of phenotypes most associated with the genotypes at each SNP, and thus potentially captures effects hidden to single phenotype GWAS. We demonstrate via simulation that this approach provides a dramatic increase in power in many scenarios. There is a boost in power for variants that affect multiple phenotypes and for those that affect only one phenotype. While other multivariate methods have similar power gains, we describe several benefits of MultiPhen over these. In particular, we demonstrate that other multivariate methods that assume the genotypes are normally distributed, such as canonical correlation analysis (CCA) and MANOVA, can have highly inflated type-1 error rates when testing case-control or non-normal continuous phenotypes, while MultiPhen produces no such inflation. To test the performance of MultiPhen on real data we applied it to lipid traits in the Northern Finland Birth Cohort 1966 (NFBC1966). In these data MultiPhen discovers 21% more independent SNPs with known associations than the standard univariate GWAS approach, while applying MultiPhen in addition to the standard approach provides 37% increased discovery. The most associated linear combinations of the lipids estimated by MultiPhen at the leading SNPs accurately reflect the Friedewald Formula, suggesting that MultiPhen could be used to refine the definition of existing phenotypes or uncover novel heritable phenotypes. © 2012 O'Reilly et al.

References Powered by Scopus

Estimation of the concentration of low-density lipoprotein cholesterol in plasma, without use of the preparative ultracentrifuge.

27848Citations
N/AReaders
Get full text

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls

8180Citations
N/AReaders
Get full text

Potential etiologic and functional implications of genome-wide association loci for human diseases and traits

3250Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Pleiotropy in complex traits: Challenges and strategies

772Citations
N/AReaders
Get full text

Efficient multivariate linear mixed model algorithms for genome-wide association studies

556Citations
N/AReaders
Get full text

Analytical methods in untargeted metabolomics: State of the art in 2015

539Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

O’Reilly, P. F., Hoggart, C. J., Pomyen, Y., Calboli, F. C. F., Elliott, P., Jarvelin, M. R., & Coin, L. J. M. (2012). MultiPhen: Joint model of multiple phenotypes can increase discovery in GWAS. PLoS ONE, 7(5). https://doi.org/10.1371/journal.pone.0034861

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 143

53%

Researcher 96

36%

Professor / Associate Prof. 25

9%

Lecturer / Post doc 5

2%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 119

54%

Biochemistry, Genetics and Molecular Bi... 58

26%

Medicine and Dentistry 25

11%

Computer Science 20

9%

Article Metrics

Tooltip
Mentions
Blog Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free