...
首页> 外文期刊>BMC Veterinary Research >Constructing a gene semantic similarity network for the inference of disease genes
【24h】

Constructing a gene semantic similarity network for the inference of disease genes

机译:构建疾病语义推断的基因语义相似网络

获取原文
           

摘要

MotivationThe inference of genes that are truly associated with inherited human diseases from a set of candidates resulting from genetic linkage studies has been one of the most challenging tasks in human genetics. Although several computational approaches have been proposed to prioritize candidate genes relying on protein-protein interaction (PPI) networks, these methods can usually cover less than half of known human genes.ResultsWe propose to rely on the biological process domain of the gene ontology to construct a gene semantic similarity network and then use the network to infer disease genes. We show that the constructed network covers about 50% more genes than a typical PPI network. By analyzing the gene semantic similarity network with the PPI network, we show that gene pairs tend to have higher semantic similarity scores if the corresponding proteins are closer to each other in the PPI network. By analyzing the gene semantic similarity network with a phenotype similarity network, we show that semantic similarity scores of genes associated with similar diseases are significantly different from those of genes selected at random, and that genes with higher semantic similarity scores tend to be associated with diseases with higher phenotype similarity scores. We further use the gene semantic similarity network with a random walk with restart model to infer disease genes. Through a series of large-scale leave-one-out cross-validation experiments, we show that the gene semantic similarity network can achieve not only higher coverage but also higher accuracy than the PPI network in the inference of disease genes.Contactruijiang@tsinghua.edu.cn
机译:动机从遗传连锁研究得出的一组候选物中推断与遗传性人类疾病真正相关的基因一直是人类遗传学中最具挑战性的任务之一。尽管已经提出了几种计算方法来依靠蛋白质-蛋白质相互作用(PPI)网络对候选基因进行优先级排序,但是这些方法通常可以覆盖不到一半的已知人类基因。结果我们建议依靠基因本体的生物学过程域来构建基因语义相似性网络,然后使用该网络推断疾病基因。我们显示,构建的网络比典型的PPI网络覆盖的基因多50%。通过与PPI网络分析基因语义相似性网络,我们发现如果相应的蛋白质在PPI网络中彼此更接近,则基因对往往具有较高的语义相似性评分。通过用表型相似性网络分析基因语义相似性网络,我们发现与相似疾病相关的基因的语义相似性得分与随机选择的基因显着不同,并且具有较高语义相似性得分的基因倾向于与疾病相关具有较高的表型相似性评分。我们进一步使用带有重新启动模型的随机游走的基因语义相似性网络来推断疾病基因。通过一系列大规模的留一法交叉验证实验,我们证明基因语义相似性网络在疾病基因的推断上不仅可以实现比PPI网络更高的覆盖率,而且可以达到更高的准确性.Contactruijiang @ tsinghua。 edu.cn

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号