首页> 外文会议>IEEE International Conference on Healthcare Informatics >Identifying Disease Definitions with a Correlation Kernel for Symptom Extractions from Text
【24h】

Identifying Disease Definitions with a Correlation Kernel for Symptom Extractions from Text

机译:用相关核来鉴定疾病定义,用于从文本中提取症状提取

获取原文

摘要

Since most health-related knowledge is created by experts, it is not easy for general public to access, understand, and utilize such knowledge in daily living. It would be most convenient and useful to a healthcare knowledge base that a user can easily start exploring from symptoms and arrive at candidate diseases and eventually obtain knowledge for treatment and prevention. We have embarked on a project whose goal is to build such a healthcare knowledge base from text by using natural language processing and text mining techniques. This paper focuses on how definition sentences can be detected and describes a method of ranking sentences based on the degree to which they contain definitions of diseases, which should contain symptom information. While our work is basically to build a classifier that identifies definition sentences, the main contribution lies in the development of a new kernel method that utilizes correlations among different types of tokens. We evaluated our method to arrive at a conclusion that the proposed method can be very effective with a training data that is almost an order of magnitude smaller than the method of using dependency parser.
机译:由于大多数健康相关知识由专家创建,因此公众不容易访问,理解和利用日常生活中的这些知识。对于医疗保健知识库来说,用户可以轻松开始从症状探索并到达候选疾病并最终获得治疗和预防知识是最方便的。我们已经开始了一个项目,其目标是通过使用自然语言处理和文本挖掘技术从文本建立这样的医疗保健知识库。本文重点介绍如何检测句子,并根据它们包含疾病定义的程度来描述句子的方法,这应该包含症状信息。虽然我们的工作基本上建立了一个识别定义句子的分类器,但主要贡献在于开发利用不同类型令牌之间的相关性的新内核方法。我们评估了我们的方法,得出结论,即所提出的方法可以非常有效地对几乎比使用依赖解析器的方法小的数量级的训练数据非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号