...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >RECOGNISING DISCOURSE CAUSALITY TRIGGERS IN THE BIOMEDICAL DOMAIN
【24h】

RECOGNISING DISCOURSE CAUSALITY TRIGGERS IN THE BIOMEDICAL DOMAIN

机译:识别生物医学领域的因果关系诱因

获取原文
获取原文并翻译 | 示例
           

摘要

Current domain-specific information extraction systems represent an important resource for biomedical researchers, who need to process vast amounts of knowledge in a short time. Automatic discourse causality recognition can further reduce their workload by suggesting possible causal connections and aiding in the curation of pathway models. We describe here an approach to the automatic identification of discourse causality triggers in the biomedical domain using machine learning. We create several baselines and experiment with and compare various parameter settings for three algorithms, i.e. Conditional Random Fields (CRF), Support Vector Machines (SVM) and Random Forests (RF). We also evaluate the impact of lexical, syntactic, and semantic features on each of the algorithms, showing that semantics improves the performance in all cases. We test our comprehensive feature set on two corpora containing gold standard annotations of causal relations, and demonstrate the need for more gold standard data. The best performance of 79.35% F-score is achieved by CRFs when using all three feature types.
机译:当前的特定领域信息提取系统为生物医学研究人员提供了重要资源,他们需要在短时间内处理大量知识。自动话语因果关系识别可以通过建议可能的因果关系并协助策划路径模型来进一步减轻其工作量。我们在这里描述一种使用机器学习在生物医学领域自动识别话语因果关系触发因素的方法。我们创建了多个基准并针对三种算法(即条件随机字段(CRF),支持向量机(SVM)和随机森林(RF))对各种参数设置进行了实验和比较。我们还评估了词法,句法和语义特征对每种算法的影响,表明语义在所有情况下都能提高性能。我们在两个包含因果关系的黄金标准注释的语料库上测试我们的综合功能集,并证明需要更多的黄金标准数据。使用所有三种功能类型时,CRF可获得79.35%F分数的最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号