首页> 外文会议>International symposium on integrative bioinformatics >IDPredictor: Predict Database Links in Biomedical Database
【24h】

IDPredictor: Predict Database Links in Biomedical Database

机译:iDpredictor:预测生物医学数据库中的数据库链接

获取原文

摘要

Knowledge found in biomedical databases, in particular in Web information systems, is a major bioinformatics resource. In general, this biological knowledge is worldwide rep-resented in a network of databases. These data is spread among thousands of databases, which overlap in content, but differ substantially with respect to content detail, interface, formats and data structure. To support a functional annotation of lab data, such as protein sequences, metabolites or DNA sequences as well as a semi-automated data exploration in information retrieval en-vironments, an integrated view to databases is essential. Search engines have the potential of assisting in data retrieval from these structured sources, but fall short of providing a comprehensive knowledge excerpt out of the interlinked databases. A prerequisit of sup-porting the concept of an integrated data view is to acquire insights into cross-references among database entities. This issue is being hampered by the fact, that only a fraction of all possible cross-references are explicitely tagged in the particular biomedical informations systems. In this work, we investigate to what extend an automated construction of an integrated data network is possible. We propose a method that predicts and extracts cross-references from multiple life science databases and possible referenced data targets. We study the retrieval quality of our method and report on first, promising results. The method is implemented as the tool IDPredictor, which is published under the DOI 10.5447/IPK/2012/4 and is freely available using the URL: http://dx.doi.org/10.5447/IPK72 012/4.
机译:知识在生物医学数据库中发现,特别是在网络信息系统,是一个重大的生物信息资源。在一般情况下,这种生物知识是全球REP-怨恨数据库的网络。这些数据是在成千上万的数据库,这在内容重叠,但基本上相对于内容的细节,接口,格式和数据结构不同传播。为了支持实验室数据,如蛋白质序列,代谢物或DNA序列的功能注释以及在信息检索烯vironments半自动化数据探索,综合视图,以数据库是必不可少的。搜索引擎在数据协助的潜力,从这些结构化的来源获取,但功亏一篑提供全面的知识摘录出来的相互关联的数据库中。 SUP-移植集成数据视图的概念的prerequisit是获取见解数据库实体之间的交叉引用。这个问题是由事实阻碍,只有所有可能的交叉引用的一小部分被明确地标记在特定的生物医学信息系统。在这项工作中,我们调查到什么扩展的综合数据网络的自动化建设是可行的。我们建议,预测和提取物的交叉引用来自多个生命科学数据库和可能引用的数据对象的方法。我们研究了第一,可喜的成果我们的方法和报告的检索质量。该方法被实现为工具IDPredictor,其下的DOI 10.5447 / IPK /四分之二千零十二出版,免费提供使用URL:http://dx.doi.org/10.5447/IPK72 012/4。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号