De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention

机译：通过因果干预远离偏见远离命名实体识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Distant supervision tackles the data bottleneck in NER by automatically generating training instances via dictionary matching. Unfortunately, the learning of DS-NER is severely dictionary-biased, which suffers from spurious correlations and therefore undermines the effectiveness and the robustness of the learned models. In this paper, we fundamentally explain the dictionary bias via a Structural Causal Model (SCM), categorize the bias into intra-dictionary and inter-dictionary biases, and identify their causes. Based on the SCM. we learn de-biased DS-NER via causal interventions. For intra-dictionary bias, we conduct backdoor adjustment to remove the spurious correlations introduced by the dictionary confounder. For inter-dictionary bias, we propose a causal invariance regularizer which will make DS-NER models more robust to the perturbation of dictionaries. Experiments on four datasets and three DS-NER models show that our method can significantly improve the performance of DS-NER.

机译：遥远监督通过通过字典匹配自动生成培训实例来解决ner中的数据瓶颈。不幸的是，DS-ner的学习是严重的字典偏见的，这遭受了虚假的相关性，因此破坏了学习模型的有效性和鲁棒性。在本文中，我们从根本上通过结构因果模型（SCM）来解释字典偏差，将偏差分类为字典内和词典偏差，并识别其原因。基于SCM。我们通过因果干预措施学习De-Biased DS-ner。对于字典偏差，我们进行后门调整以消除字典混杂器引入的杂散相关性。对于字典界偏见，我们提出了一个因果不变规范器，它将使DS-NER模型更加强大地对词典的扰动。四个数据集和三个DS-NER模型的实验表明，我们的方法可以显着提高DS-NER的性能。

著录项

来源
《International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics》|2021年|4803-4813|共11页
会议地点
作者
Wenkai Zhang; Hongyu Lin; Xianpei Han; Le Sun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Named entity recognition: a semi-supervised learning approach [J] . H. Sintayehu, G. S. Lehal International Journal of Information Technology . 2021,第4期

机译：命名实体识别：半监督学习方法
2. Biomedical Named Entity Recognition Based on Self-supervised Deep Belief Network [J] . ZHANG Yajun, LIU Zongtian, ZHOU Wen 电子学报（英文版） . 2020,第003期

机译：基于自我监督的深度信仰网络的生物医学命名实体识别
3. Learning to select pseudo labels:a semi-supervised method for named entity recognition [J] . Zhen-zhen LI, Da-wei FENG, Dong-sheng LI, 浙江大学学报（英文版）（C辑：计算机与电子） . 2020,第006期

机译：学习选择伪标签：一个用于命名实体识别的半监督方法
4. Distantly Supervised Named Entity Recognition with Spy-PU Algorithm [C] . Honghao Zheng, Hongtao Yu, Yinuo Hao, International Conference on Pattern Recognition and Machine Learning . 2021

机译：通过SPY-PU算法远处监督命名实体识别
5. Semi-supervised Named Entity Recognition: Learning to recognize 100 entity types with little supervision [D] . Nadeau, David. 2007

机译：半监督的命名实体识别：在很少的监督下学习识别100种实体类型
6. A Weakly-Supervised Named Entity Recognition Machine Learning Approach for Emergency Medical Services Clinical Audit [O] . Han Wang, Wesley Lok Kin Yeung, Qin Xiang Ng, 2021

机译：紧急医疗服务临床审计的弱监督名为实体识别机器学习方法
7. Improving Distantly-Supervised Named Entity Recognition for Traditional Chinese Medicine Text via a Novel Back-Labeling Approach [O] . Dezheng Zhang, Chao Xia, Cong Xu, 2020

机译：通过新的背标方法改善传统中医文本的远端监督的名称实体识别

De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention

摘要

著录项

相似文献

相关主题

期刊订阅