首页> 外文期刊>Journal of Bioinformatics and Computational Biology >IMPROVING THE INTER-CORPORA COMPATIBILITY FOR PROTEIN ANNOTATIONS
【24h】

IMPROVING THE INTER-CORPORA COMPATIBILITY FOR PROTEIN ANNOTATIONS

机译:改善公司间蛋白注释的兼容性

获取原文
获取原文并翻译 | 示例
           

摘要

Although there are several corpora with protein annotation, incompatibility between the annotations in different corpora remains a problem that hinders the progress of automatic recognition of protein names in biomedical literature. Here, we report on our efforts to find a solution to the incompatibility issue, and to improve the compatibility between two representative protein-annotated corpora: the GENIA corpus and the GENETAG corpus. In a comparative study, we improve our insight into the two corpora, and a series of experimental results show that most of the incompatibility can be removed.
机译:尽管有多个带有蛋白质注释的语料库,但是不同语料库中注释之间的不兼容性仍然是一个问题,阻碍了生物医学文献中蛋白质名称自动识别的进展。在这里,我们报告了我们的工作,以寻找解决不兼容问题的方法,并改善两个具有代表性的带有蛋白质注释的语料库:GENIA语料库和GENETAG语料库。在一项比较研究中,我们提高了对这两个语料库的了解,一系列实验结果表明可以消除大多数不兼容的情况。

著录项

  • 来源
  • 作者单位

    YUE WANG Corresponding author.Department of Computer Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-0033, Japanwangyue@is.s.u-tokyo.ac.jp JIN-DONG KIM Database Center for Life Science, Research Organization of Information and Systems, 2-11-16 Yayoi, Bunkyo-Ku, Tokyo, 113-0032, Japanjdkim@dbcls.rois.ac.jp RUNE SÆTRE Department of Computer Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-0033, Japanrune.saetre@is.s.u-tokyo.ac.jp SAMPO PYYSALO Department of Computer Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-0033, Japansmp@is.s.u-tokyo.ac.jp TOMOKO OHTA Department of Computer Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-0033, Japanokap@is.s.u-tokyo.ac.jp JUN'ICHI TSUJII Department of Computer Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 113-0033, JapanSchool of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, United KingdomNational Center for Text Mining, Manchester Interdisciplinary Biocentre, 131 Princess Street, Manchester, M1 7DN, United Kingdomtsujii@is.s.u-tokyo.ac.jp;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Corpus; named entity recognition; protein annotation.;

    机译:语料库;命名实体识别;蛋白质注释。;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号