首页> 外文会议>International Conference on Knowledge Management; 20051027-28; North Carolina(US) >AUTOMATIC UNSUPERVISED KEYPHRASE-BASED QUERY EXPANSION FOR BIOMEDICAL DOMAIN
【24h】

AUTOMATIC UNSUPERVISED KEYPHRASE-BASED QUERY EXPANSION FOR BIOMEDICAL DOMAIN

机译:基于自动密钥管理的,不适用于生物医学领域的查询扩展

获取原文
获取原文并翻译 | 示例

摘要

This paper introduces an automatic querying technique, DocSpotter, and presents that it is an efficient tool to identify the set of documents for the extraction of a pre-defined relation from text. DocSpotter is designed to retrieve documents with more precise match to the initial query by expanding queries as it repeats the keyphrase extraction process in a given database. It utilizes keyphrase extraction in conjunction with referencing ontologies for the query expansion. We report two sets of experimental results demonstrating the performance of DocSpotter. The experiments were designed to evaluate the performance of DocSpotter on the task of protein-protein interaction extraction. The results identified that DocSpotter was able to retrieve more and more documents that contain protein-protein pairs from MEDLINE as it repeated the keyphrase extraction process. In the other set of experiments, performance of DocSpotter was compared with that of SLIPPER, a supervised rule-based query expansion technique. The results showed that DocSpotter outperformed SLIPPER from 17.90% to 29.98% in terms of accuracy in all iterations.
机译:本文介绍了一种自动查询技术DocSpotter,并提出它是一种有效的工具,可用于识别用于从文本中提取预定义关系的文档集。 DocSpotter旨在通过在给定数据库中重复关键词提取过程的同时扩展查询来检索与初始查询更精确匹配的文档。它利用关键字短语提取结合引用本体来进行查询扩展。我们报告了两组实验结果,证明了DocSpotter的性能。设计这些实验是为了评估DocSpotter在蛋白质-蛋白质相互作用提取任务上的性能。结果表明,DocSpotter在重复关键词提取过程时,能够从MEDLINE检索越来越多的包含蛋白质对蛋白质的文档。在另一组实验中,将DocSpotter的性能与SLIPPER(基于监督的基于规则的查询扩展技术)的性能进行了比较。结果表明,在所有迭代中,DocSpotter的准确性均从SLIPPER的17.90%提高到29.98%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号