Mixed linear model approach adapted for genome-wide association studies

Zhiwu Zhang, Elhan Ersoz, Chao Qiang Lai, Rory J. Todhunter, Hemant K. Tiwari, Michael A. Gore, Peter J. Bradbury, Jianming Yu, Donna K. Arnett, Jose M. Ordovas, Edward S. Buckler

Research output: Contribution to journalArticlepeer-review

Abstract

Mixed linear model (MLM) methods have proven useful in controlling for population structure and relatedness within genome-wide association studies. However, MLM-based methods can be computationally challenging for large datasets. We report a compression approach, called 'compressed MLM', that decreases the effective sample size of such datasets by clustering individuals into groups. We also present a complementary approach, 'population parameters previously determined' (P3D), that eliminates the need to re-compute variance components. We applied these two methods both independently and combined in selected genetic association datasets from human, dog and maize. The joint implementation of these two methods markedly reduced computing time and either maintained or improved statistical power. We used simulations to demonstrate the usefulness in controlling for substructure in genetic association datasets for a range of species and genetic architectures. We have made these methods available within an implementation of the software program TASSEL.

Original languageEnglish (US)
Pages (from-to)355-360
Number of pages6
JournalNature Genetics
Volume42
Issue number4
DOIs
StatePublished - Apr 2010
Externally publishedYes

ASJC Scopus subject areas

  • Genetics

Fingerprint

Dive into the research topics of 'Mixed linear model approach adapted for genome-wide association studies'. Together they form a unique fingerprint.

Cite this