...
首页> 外文期刊>BMC Medical Genomics >Identifying disease genes by integrating multiple data sources
【24h】

Identifying disease genes by integrating multiple data sources

机译:通过整合多个数据源识别疾病基因

获取原文
           

摘要

Background Now multiple types of data are available for identifying disease genes. Those data include gene-disease associations, disease phenotype similarities, protein-protein interactions, pathways, gene expression profiles, etc .. It is believed that integrating different kinds of biological data is an effective method to identify disease genes. Results In this paper, we propose a multiple data integration method based on the theory of Markov random field (MRF) and the method of Bayesian analysis for identifying human disease genes. The proposed method is not only flexible in easily incorporating different kinds of data, but also reliable in predicting candidate disease genes. Conclusions Numerical experiments are carried out by integrating known gene-disease associations, protein complexes, protein-protein interactions, pathways and gene expression profiles. Predictions are evaluated by the leave-one-out method. The proposed method achieves an AUC score of 0.743 when integrating all those biological data in our experiments.
机译:背景技术现在有多种类型的数据可用于识别疾病基因。这些数据包括基因-疾病关联,疾病表型相似性,蛋白质-蛋白质相互作用,途径,基因表达谱等。据信,整合不同种类的生物学数据是鉴定疾病基因的有效方法。结果在本文中,我们提出了一种基于马尔可夫随机场(MRF)理论和贝叶斯分析方法的多重数据集成方法,用于识别人类疾病基因。所提出的方法不仅灵活方便地合并不同种类的数据,而且在预测候选疾病基因方面也很可靠。结论通过整合已知的基因-疾病关联,蛋白质复合物,蛋白质-蛋白质相互作用,途径和基因表达谱进行了数值实验。预测通过留一法进行评估。当将所有这些生物学数据整合到我们的实验中时,所提出的方法获得的AUC评分为0.743。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号