首页> 美国卫生研究院文献>Database: The Journal of Biological Databases and Curation >Wide-scope biomedical named entity recognition and normalization with CRFs fuzzy matching and character level modeling
【2h】

Wide-scope biomedical named entity recognition and normalization with CRFs fuzzy matching and character level modeling

机译:具有CRF模糊匹配和字符级建模的宽范围生物医学命名实体识别和归一化

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We present a system for automatically identifying a multitude of biomedical entities from the literature. This work is based on our previous efforts in the BioCreative VI: Interactive Bio-ID Assignment shared task in which our system demonstrated state-of-the-art performance with the highest achieved results in named entity recognition. In this paper we describe the original conditional random field-based system used in the shared task as well as experiments conducted since, including better hyperparameter tuning and character level modeling, which led to further performance improvements. For normalizing the mentions into unique identifiers we use fuzzy character n-gram matching. The normalization approach has also been improved with a better abbreviation resolution method and stricter guideline compliance resulting in vastly improved results for various entity types. All tools and models used for both named entity recognition and normalization are publicly available under open license.
机译:我们提出了一种从文献中自动识别多种生物医学实体的系统。这项工作是基于我们先前在BioCreative VI:交互式Bio-ID分配共享任务中所做的工作,在该任务中,我们的系统展示了最先进的性能,并在命名实体识别中取得了最高的成果。在本文中,我们描述了共享任务中使用的原始的基于条件随机场的系统以及此后进行的实验,包括更好的超参数调整和字符级建模,从而进一步提高了性能。为了将提及归一化为唯一标识符,我们使用模糊字符n元语法匹配。还使用更好的缩写解析方法和更严格的准则合规性对归一化方法进行了改进,从而大大改善了各种实体类型的结果。所有用于命名实体识别和规范化的工具和模型都可以在公开许可下公开获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号