首页> 外文期刊>BMC Bioinformatics >Stochastic Lanczos estimation of genomic variance components for linear mixed-effects models
【24h】

Stochastic Lanczos estimation of genomic variance components for linear mixed-effects models

机译:线性混合效应模型的基因组方差成分的随机Lanczos估计

获取原文
           

摘要

Linear mixed-effects models (LMM) are a leading method in conducting genome-wide association studies (GWAS) but require residual maximum likelihood (REML) estimation of variance components, which is computationally demanding. Previous work has reduced the computational burden of variance component estimation by replacing direct matrix operations with iterative and stochastic methods and by employing loose tolerances to limit the number of iterations in the REML optimization procedure. Here, we introduce two novel algorithms, stochastic Lanczos derivative-free REML (SLDF_REML) and Lanczos first-order Monte Carlo REML (L_FOMC_REML), that exploit problem structure via the principle of Krylov subspace shift-invariance to speed computation beyond existing methods. Both novel algorithms only require a single round of computation involving iterative matrix operations, after which their respective objectives can be repeatedly evaluated using vector operations. Further, in contrast to existing stochastic methods, SLDF_REML can exploit precomputed genomic relatedness matrices (GRMs), when available, to further speed computation. Results of numerical experiments are congruent with theory and demonstrate that interpreted-language implementations of both algorithms match or exceed existing compiled-language software packages in speed, accuracy, and flexibility. Both the SLDF_REML and L_FOMC_REML algorithms outperform existing methods for REML estimation of variance components for LMM and are suitable for incorporation into existing GWAS LMM software implementations.
机译:线性混合效应模型(LMM)是进行全基因组关联研究(GWAS)的主要方法,但需要方差成分的残差最大似然(REML)估计,这在计算上是有要求的。以前的工作通过用迭代和随机方法代替直接矩阵运算,以及通过采用宽松的公差来限制REML优化过程中的迭代次数,从而减轻了方差分量估计的计算负担。在这里,我们介绍两种新颖的算法,随机Lanczos无导数REML(SLDF_REML)和Lanczos一阶蒙特卡洛REML(L_FOMC_REML),它们利用Krylov子空间不变性原理利用问题结构来加快计算速度,超越了现有方法。两种新颖的算法仅需进行涉及迭代矩阵运算的单轮计算,然后可以使用矢量运算重复评估其各自的目标。此外,与现有的随机方法相比,SLDF_REML可以在可用时利用预先计算的基因组相关性矩阵(GRM)来进一步加快计算速度。数值实验的结果与理论相吻合,并证明两种算法的解释语言实现在速度,准确性和灵活性上均匹配或超过了现有的编译语言软件包。 SLDF_REML和L_FOMC_REML算法均优于现有的用于LMM方差分量的REML估计的方法,适用于并入现有GWAS LMM软件实现中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号