【24h】

De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention

机译:通过因果干预远离偏见远离命名实体识别

获取原文

摘要

Distant supervision tackles the data bottleneck in NER by automatically generating training instances via dictionary matching. Unfortunately, the learning of DS-NER is severely dictionary-biased, which suffers from spurious correlations and therefore undermines the effectiveness and the robustness of the learned models. In this paper, we fundamentally explain the dictionary bias via a Structural Causal Model (SCM), categorize the bias into intra-dictionary and inter-dictionary biases, and identify their causes. Based on the SCM. we learn de-biased DS-NER via causal interventions. For intra-dictionary bias, we conduct backdoor adjustment to remove the spurious correlations introduced by the dictionary confounder. For inter-dictionary bias, we propose a causal invariance regularizer which will make DS-NER models more robust to the perturbation of dictionaries. Experiments on four datasets and three DS-NER models show that our method can significantly improve the performance of DS-NER.
机译:遥远监督通过通过字典匹配自动生成培训实例来解决ner中的数据瓶颈。 不幸的是,DS-ner的学习是严重的字典偏见的,这遭受了虚假的相关性,因此破坏了学习模型的有效性和鲁棒性。 在本文中,我们从根本上通过结构因果模型(SCM)来解释字典偏差,将偏差分类为字典内和词典偏差,并识别其原因。 基于SCM。 我们通过因果干预措施学习De-Biased DS-ner。 对于字典偏差,我们进行后门调整以消除字典混杂器引入的杂散相关性。 对于字典界偏见,我们提出了一个因果不变规范器,它将使DS-NER模型更加强大地对词典的扰动。 四个数据集和三个DS-NER模型的实验表明,我们的方法可以显着提高DS-NER的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号