首页> 外文会议>IEEE Symposium Series on Computational Intelligence >PHIs (Protected Health Information) identification from free text clinical records based on machine learning
【24h】

PHIs (Protected Health Information) identification from free text clinical records based on machine learning

机译:基于机器学习的自由文本临床记录中的PHI(受保护的健康信息)识别

获取原文

摘要

To preserve patient confidentiality, there is a need to identify PHIs (Protected Health Information) from free text text clinical records, and such sensitive information must either be removed or replaced. Identification of the PHI's are normally performed manually on large sets of structured EHR databases, which is time-consuming, prohibitively expensive and error-prone. Hence, methods for automatic or semi-automatic identification of personal health information are of significant scientific and commercial interest. In this paper, we propose an innovative computational framework based on novel text mining and machine learning algorithms for automatic identification of PHIs from massive, unstructured free text clinical records, discharge summaries and other care documents. The experimental evaluation of the proposed algorithmic framework development, for several publicly available i2b2 challenge datasets from Informatics for Integrating Biology & the Bedside (i2b2) shared tasks, has shown promising outcomes.
机译:为了保护患者的机密性,需要从自由文本文本临床记录中识别出PHI(受保护的健康信息),并且必须删除或替换此类敏感信息。 PHI的识别通常是在大量结构化EHR数据库上手动执行的,这非常耗时,费用过高且容易出错。因此,用于自动或半自动识别个人健康信息的方法具有重大的科学和商业意义。在本文中,我们提出了一种基于新颖的文本挖掘和机器学习算法的创新计算框架,用于从大量,非结构化的自由文本临床记录,出院摘要和其他护理文档中自动识别PHI。对来自整合生物学与床边(i2b2)共享任务的信息学中的几个可公开获得的i2b2挑战数据集的拟议算法框架开发的实验评估显示出了可喜的成果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号