首页> 外文期刊>BioMedical Engineering OnLine >Comparison of named entity recognition methodologies in biomedical documents
【24h】

Comparison of named entity recognition methodologies in biomedical documents

机译:生物医学文献中命名实体识别方法的比较

获取原文
           

摘要

Biomedical named entity recognition (Bio-NER) is a fundamental task in handling biomedical text terms, such as RNA, protein, cell type, cell line, and DNA. Bio-NER is one of the most elementary and core tasks in biomedical knowledge discovery from texts. The system described here is developed by using the BioNLP/NLPBA 2004 shared task. Experiments are conducted on a training and evaluation set provided by the task organizers. Our results show that, compared with a baseline having a 70.09% F1 score, the RNN Jordan- and Elman-type algorithms have F1 scores of approximately 60.53% and 58.80%, respectively. When we use CRF as a machine learning algorithm, CCA, GloVe, and Word2Vec have F1 scores of 72.73%, 72.74%, and 72.82%, respectively. By using the word embedding constructed through the unsupervised learning, the time and cost required to construct the learning data can be saved.
机译:生物医学命名实体识别(Bio-NER)是处理生物医学文本术语(如RNA,蛋白质,细胞类型,细胞系和DNA)的一项基本任务。从文本中发现生物医学知识是生物医学知识中最基本和最核心的任务之一。此处描述的系统是通过使用BioNLP / NLPBA 2004共享任务开发的。在任务组织者提供的培训和评估集上进行实验。我们的结果表明,与F1得分为70.09%的基线相比,RNN Jordan和Elman型算法的F1得分分别约为60.53%和58.80%。当我们使用CRF作为机器学习算法时,CCA,GloVe和Word2Vec的F1分数分别为72.73%,72.74%和72.82%。通过使用通过无监督学习构建的单词嵌入,可以节省构建学习数据所需的时间和成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号