首页> 外文期刊>BMC Medical Informatics and Decision Making >A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
【24h】

A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets

机译:不同生物医学数据集的疾病,基因和药物中的语义关系挖掘方法

获取原文
           

摘要

Semantic web technology has been applied widely in the biomedical informatics field. Large numbers of biomedical datasets are available online in the resource description framework (RDF) format. Semantic relationship mining among genes, disorders, and drugs is widely used in, for example, precision medicine and drug repositioning. However, most of the existing studies focused on a single dataset. It is not easy to find the most current relationships among disorder-gene-drug relationships since the relationships are distributed in heterogeneous datasets. How to mine their semantic relationships from different biomedical datasets is an important issue. First, a variety of biomedical datasets were converted into RDF triple data; then, multisource biomedical datasets were integrated into a storage system using a data integration algorithm. Second, nine query patterns among genes, disorders, and drugs from different biomedical datasets were designed. Third, the gene-disorder-drug semantic relationship mining algorithm is presented. This algorithm can query the relationships among various entities from different datasets. We focused on mining the putative and the most current disorder-gene-drug relationships about Parkinson’s disease (PD). The results demonstrate that our method has significant advantages in mining and integrating multisource heterogeneous biomedical datasets. Twenty-five new relationships among the genes, disorders, and drugs were mined from four different datasets. The query results showed that most of them came from different datasets. The precision of the method increased by 2.51% compared to that of the multisource linked open data fusion method presented in the 4th International Workshop on Semantics-Powered Data Mining and Analytics (SEPDA 2019). Moreover, the number of query results increased by 7.7%, and the number of correct queries increased by 9.5%.
机译:语义Web技术已广泛应用于生物医学信息管理领域。大量的生物医学数据集可在资源描述框架(RDF)格式中在线提供。基因,疾病和药物中的语义关系开采,广泛用于例如精确药物和药物重新定位。然而,大多数现有研究都集中在一个数据集上。由于关系分布在异构数据集中,因此不容易找到紊乱基因 - 药物关系中的最新关系。如何从不同的生物医学数据集中挖掘他们的语义关系是一个重要问题。首先,将各种生物医学数据集转换为RDF三重数据;然后,使用数据集成算法将多源生物医学数据集集成到存储系统中。其次,设计了基因,疾病和来自不同生物医学数据集的药物中的九种查询模式。第三,提出了基因障碍 - 药物语义关系挖掘算法。该算法可以查询来自不同数据集的各种实体之间的关系。我们专注于开采帕金森病(PD)的推定和最目前的疾病基因 - 药物关系。结果表明,我们的方法在挖掘和集成多源异质生物医学数据集中具有显着的优势。从四个不同的数据集中开采了基因,疾病和药物中的二十五个新关系。查询结果表明,大多数来自不同的数据集。与第四届国际研讨会上的MultiSource联系开放数据融合方法相比,该方法的精度增加了2.51%(S​​epda 2019)。此外,查询结果的数量增加了7.7%,正确查询的数量增加了9.5%。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号