A powerful and efficient set test for genetic markers that handles confounders - PubMed
- ️Tue Jan 01 2013
A powerful and efficient set test for genetic markers that handles confounders
Jennifer Listgarten et al. Bioinformatics. 2013.
Abstract
Motivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger datasets are used to increase power.
Results: We introduce a new approach for set tests that handles confounders. Our model is based on the linear mixed model and uses two random effects-one to capture the set association signal and one to capture confounders. We also introduce a computational speedup for two random-effects models that makes this approach feasible even for extremely large cohorts. Using this model with both the likelihood ratio test and score test, we find that the former yields more power while controlling type I error. Application of our approach to richly structured Genetic Analysis Workshop 14 data demonstrates that our method successfully corrects for population structure and family relatedness, whereas application of our method to a 15 000 individual Crohn's disease case-control cohort demonstrates that it additionally recovers genes not recoverable by univariate analysis.
Availability: A Python-based library implementing our approach is available at http://mscompbio.codeplex.com.
Figures
Similar articles
-
Lippert C, Xiang J, Horta D, Widmer C, Kadie C, Heckerman D, Listgarten J. Lippert C, et al. Bioinformatics. 2014 Nov 15;30(22):3206-14. doi: 10.1093/bioinformatics/btu504. Epub 2014 Jul 29. Bioinformatics. 2014. PMID: 25075117 Free PMC article.
-
Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning.
Padhukasahasram B, Reddy CK, Levin AM, Burchard EG, Williams LK. Padhukasahasram B, et al. PLoS One. 2015 Nov 30;10(11):e0143489. doi: 10.1371/journal.pone.0143489. eCollection 2015. PLoS One. 2015. PMID: 26619286 Free PMC article.
-
RL-SKAT: An Exact and Efficient Score Test for Heritability and Set Tests.
Schweiger R, Weissbrod O, Rahmani E, Müller-Nurasyid M, Kunze S, Gieger C, Waldenberger M, Rosset S, Halperin E. Schweiger R, et al. Genetics. 2017 Dec;207(4):1275-1283. doi: 10.1534/genetics.117.300395. Epub 2017 Oct 12. Genetics. 2017. PMID: 29025915 Free PMC article.
-
Guo B, Wu B. Guo B, et al. Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961. Bioinformatics. 2019. PMID: 30476000 Free PMC article.
-
Population structure in genetic studies: Confounding factors and mixed models.
Sul JH, Martin LS, Eskin E. Sul JH, et al. PLoS Genet. 2018 Dec 27;14(12):e1007309. doi: 10.1371/journal.pgen.1007309. eCollection 2018 Dec. PLoS Genet. 2018. PMID: 30589851 Free PMC article. Review.
Cited by
-
Lai E, Danner AL, Famula TR, Oberbauer AM. Lai E, et al. Front Genet. 2021 May 28;12:657375. doi: 10.3389/fgene.2021.657375. eCollection 2021. Front Genet. 2021. PMID: 34122511 Free PMC article.
-
Sørensen IF, Edwards SM, Rohde PD, Sørensen P. Sørensen IF, et al. Sci Rep. 2017 May 25;7(1):2413. doi: 10.1038/s41598-017-02281-3. Sci Rep. 2017. PMID: 28546557 Free PMC article.
-
Monti R, Rautenstrauch P, Ghanbari M, James AR, Kirchler M, Ohler U, Konigorski S, Lippert C. Monti R, et al. Nat Commun. 2022 Sep 10;13(1):5332. doi: 10.1038/s41467-022-32864-2. Nat Commun. 2022. PMID: 36088354 Free PMC article.
-
Listgarten J, Stegle O, Morris Q, Brenner SE, Parts L. Listgarten J, et al. Pac Symp Biocomput. 2014;19:224-8. doi: 10.1142/9789814583220_0022. Pac Symp Biocomput. 2014. PMID: 24297549 Free PMC article. No abstract available.
-
Using controls to limit false discovery in the era of big data.
Parks MM, Raphael BJ, Lawrence CE. Parks MM, et al. BMC Bioinformatics. 2018 Sep 14;19(1):323. doi: 10.1186/s12859-018-2356-2. BMC Bioinformatics. 2018. PMID: 30217148 Free PMC article.
References
-
- Astle W, Balding DJ. Population structure and cryptic relatedness in genetic association studies. Stat. Sci. 2009;24:451–471.
-
- Balding DJ. A tutorial on statistical methods for population association studies. Nat. Rev. Genet. 2006;7:781–791. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources