nature.com

FaST linear mixed models for genome-wide association studies - Nature Methods

  • ️Heckerman, David
  • ️Sun Sep 04 2011
  • Brief Communication
  • Published: 04 September 2011

Nature Methods volume 8pages 833–835 (2011)Cite this article

Subjects

Abstract

We describe factored spectrally transformed linear mixed models (FaST-LMM), an algorithm for genome-wide association studies (GWAS) that scales linearly with cohort size in both run time and memory use. On Wellcome Trust data for 15,000 individuals, FaST-LMM ran an order of magnitude faster than current efficient algorithms. Our algorithm can analyze data for 120,000 individuals in just a few hours, whereas current algorithms fail on data for even 20,000 individuals (http://mscompbio.codeplex.com/).

This is a preview of subscription content, access via your institution

Access options

Subscribe to this journal

Receive 12 print issues and online access

$259.00 per year

only $21.58 per issue

Buy this article

  • Purchase on SpringerLink
  • Instant access to full article PDF

Prices may be subject to local taxes which are calculated during checkout

Additional access options:

Similar content being viewed by others

References

  1. Balding, D.J. Nat. Rev. Genet. 7, 781–791 (2006).

    Article  CAS  Google Scholar 

  2. Yu, J. et al. Nat. Genet. 38, 203–208 (2006).

    Article  CAS  Google Scholar 

  3. Kang, H.M. et al. Genetics 107, 1709–1723 (2008).

  4. Zhang, Z. et al. Nat. Genet. 42, 355–360 (2010).

    Article  CAS  Google Scholar 

  5. Kang, H.M. et al. Nat. Genet. 42, 348–354 (2010).

    Article  CAS  Google Scholar 

  6. Zhao, K. et al. PLoS Genet. 3, e4 (2007).

    Article  Google Scholar 

  7. Price, A.L., Zaitlen, N.A., Reich, D. & Patterson, N. Nat. Rev. Genet. 11, 459–463 (2010).

    Article  CAS  Google Scholar 

  8. Henderson, C.R. Applications of Linear Models in Animal Breeding (University of Guelph, Guelph, Ontario, Canada, 1984).

  9. Goddard, M.E., Wray, N., Verbyla, K. & Visscher, P.M. Stat. Sci. 24, 517–529 (2009).

    Article  Google Scholar 

  10. Hayes, B.J., Visscher, P.M. & Goddard, M.E. Genet. Res. 91, 47–60 (2009).

    Article  CAS  Google Scholar 

  11. Fisher, R. Trans. R. Soc. Edinb. 52, 399–433 (1918).

    Article  Google Scholar 

  12. Yang, J. et al. Nat. Genet. 42, 565–569 (2010).

    Article  CAS  Google Scholar 

  13. Welham, S. & Thompson, R. J. R. Stat. Soc. B 59, 701–714 (1997).

    Article  Google Scholar 

  14. Demidenko, E. Mixed Models Theory and Applications (Wiley, Hoboken, New Jersey, USA, 2004).

  15. Listgarten, J., Kadie, C., Schadt, E.E. & Heckerman, D. Proc. Natl. Acad. Sci. USA 107, 16465–16470 (2010).

    Article  CAS  Google Scholar 

  16. Devlin, B. & Roeder, K. Biometrics 55, 997–1004 (1999).

    Article  CAS  Google Scholar 

  17. Edenberg, H.J. et al. BMC Genet. 6 (suppl. 1), S2 (2005).

    Article  Google Scholar 

  18. Wellcome Trust Case Control Consortium. Nature 447, 661–678 (2007).

Download references

Acknowledgements

We thank E. Renshaw for help with implementation of Brent's method and the χ2 distribution function, J. Carlson for help with tools used to manage the data and deploy runs on our computer cluster, and N. Pfeifer for an implementation of the ATT. A full list of the investigators who contributed to the generation of the Wellcome Trust Case-Control Consortium data we used in this study is available from http://www.wtccc.org.uk/. Funding for the project was provided by the Wellcome Trust (076113 and 085475). The GAW14 data were provided by the members of the Collaborative Study on the Genetics of Alcoholism (US National Institutes of Health grant U10 AA008401).

Author information

Author notes

  1. Christoph Lippert, Jennifer Listgarten and David Heckerman: These authors contributed equally to this work.

Authors and Affiliations

  1. Microsoft Research, Los Angeles, California, USA

    Christoph Lippert, Jennifer Listgarten, Ying Liu, Carl M Kadie, Robert I Davidson & David Heckerman

  2. Max Planck Institutes Tübingen, Tübingen, Germany

    Christoph Lippert

Authors

  1. Christoph Lippert

    You can also search for this author in PubMed Google Scholar

  2. Jennifer Listgarten

    You can also search for this author in PubMed Google Scholar

  3. Ying Liu

    You can also search for this author in PubMed Google Scholar

  4. Carl M Kadie

    You can also search for this author in PubMed Google Scholar

  5. Robert I Davidson

    You can also search for this author in PubMed Google Scholar

  6. David Heckerman

    You can also search for this author in PubMed Google Scholar

Contributions

C.L., J.L. and D.H. designed and performed research, contributed analytic tools, analyzed data and wrote the paper. Y.L. designed and performed research. C.M.K. and R.I.D contributed analytic tools.

Corresponding authors

Correspondence to Christoph Lippert, Jennifer Listgarten or David Heckerman.

Ethics declarations

Competing interests

C.L., J.L., C.M.K., R.I.D. and D.H. are employees of Microsoft. Y.L. was employed by Microsoft while performing this research.

Supplementary information

About this article

Cite this article

Lippert, C., Listgarten, J., Liu, Y. et al. FaST linear mixed models for genome-wide association studies. Nat Methods 8, 833–835 (2011). https://doi.org/10.1038/nmeth.1681

Download citation

  • Received: 05 April 2011

  • Accepted: 02 August 2011

  • Published: 04 September 2011

  • Issue Date: October 2011

  • DOI: https://doi.org/10.1038/nmeth.1681