NLP Whack-A-Mole: Challenges in Cross-Domain Temporal Expression Extraction

机译：NLP WHACK-A-MOLE：跨域跨域时间表达提取的挑战

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Incorporating domain knowledge is vital in building successful natural language processing (NLP) applications. Many times, cross-domain application of a tool results in poor performance as the tool does not account for domain-specific attributes. The clinical domain is challenging in this aspect due to specialized medical terms and nomenclature, shorthand notation, fragmented text, and a variety of writing styles used by different medical units. Temporal resolution is an NLP task that, in general, is domain-agnostic because temporal information is represented using a limited lexicon. However, domain-specific aspects of temporal resolution are present in clinical texts. Here we explore parsing issues that arose when running our system, a tool built on Newswire text, on clinical notes in the THYME corpus. Many parsing issues were straightforward to correct; however, a few code changes resulted in a cascading series of parsing errors that had to be resolved before an improvement in performance was observed, revealing the complexity of temporal resolution and rule-based parsing. Our system now outperforms current state-of-the-art systems on the THYME corpus with little change in its performance on Newswire texts.

机译：结合领域知识是构建成功的自然语言处理（NLP）的应用至关重要。很多时候，一个工具，会导致性能差，因为工具不考虑特定领域的属性跨域应用。临床领域在这方面，由于专业的医学术语和命名，速记符号，零散的文本，以及各种书写不同的医疗单位使用的风格挑战。时间分辨率是NLP任务，在一般情况下，是域无关，因为时间信息是使用有限的词汇来表示。然而，时间分辨率的特定领域的方面存在于临床文本。这里，我们探讨解析运行我们的系统，建立在通社文本的工具，当在百里香语料库临床指出，出现的问题。许多分析问题有直接的正确;然而，一些代码修改导致了一个级联系列解析是有未观测到性能的改善之前必须解决的错误，揭示了时间分辨率和基于规则的分析的复杂性。我们的系统现在优于上在其上通社文章性能变化不大百里香语料库国家的最先进的当前系统。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|xciii p. 3498-4195|共11页
会议地点
作者
Amy L. Olex; Luke G. Maffey; Bridget T. McInnes;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Family History Extraction From Synthetic Clinical Narratives Using Natural Language Processing: Overview and Evaluation of a Challenge Data Set and Solutions for the 2019 National NLP Clinical Challenges (n2c2)/Open Health Natural Language Processing (OHNLP) Competition [J] . Feichen Shen, Sijia Liu, Sunyang Fu, JMIR Medical Informatics . 2021,第1期

机译：使用自然语言处理的综合临床叙事的家庭历史提取：概述和评估2019年国家NLP临床挑战（N2C2）/开放式健康自然语言处理（OHNLP）竞争的挑战数据集和解决方案
2. à la recherche du temps perdu: Extracting temporal relations from medical text in the 2012 i2b2 NLP challenge [J] . CherryC., ZhuX., MartinJ., Journal of the American Medical Informatics Association : . 2013,第5期

机译：寻找丢失的时间：在2012 i2b2 NLP挑战中从医学文本中提取时间关系
3. Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives [J] . Kova?evi?A., DehghanA., FilanninoM., Journal of the American Medical Informatics Association : . 2013,第5期

机译：结合规则和机器学习以从临床叙事中提取时间表达和事件
4. NLP Whack-A-Mole: Challenges in Cross-Domain Temporal Expression Extraction [C] . Amy L. Olex, Luke G. Maffey, Bridget T. McInnes Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：NLP Whack-A-Mole：跨域时间表达提取中的挑战
5. From temporal expressions to symptom onset date identification in emergency department notes - a temporal information extraction process . [D] . Mahalingam, Deepika. 2011

机译：从时间表达到急诊科症状发作日期的识别-时间信息的提取过程。
6. À la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge [O] . Colin Cherry, Xiaodan Zhu, Joel Martin, 2013

机译：寻找失落的时间：在2012 i2b2 NLP挑战中从医学文本中提取时间关系
7. KULeuven-LIIR at SemEval-2017 task 12: Cross-domain temporal information extraction from clinical records [O] . Leeuwenberg Tuur, Moens Marie-Francine 2017

机译：KULeuven-LIIR在SemEval-2017任务12：从临床记录中提取跨域时间信息

NLP Whack-A-Mole: Challenges in Cross-Domain Temporal Expression Extraction

摘要

著录项

相似文献

相关主题

期刊订阅