Probing for Bridging Inference in Transformer Language Models

机译：变压器语言模型桥接推断探讨

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We probe pre-trained transformer language models for bridging inference. We first investigate individual attention heads in BERT and observe that attention heads at higher layers prominently focus on bridging relations in-comparison with the lower and middle layers, also, few specific attention heads concentrate consistently on bridging. More importantly, we consider language models as a whole in our second approach where bridging anaphora resolution is formulated as a masked token prediction task (Of-Cloze test). Our formulation produces optimistic results without any fine-tuning, which indicates that pre-trained language models substantially capture bridging inference. Our further investigation shows that the distance between anaphor-antecedent and the context provided to language models play an important role in the inference.

机译：我们探测预训练的变压器语言模型，用于桥接推断。我们首先调查伯特的个人关注头，并观察到更高层次的注意力突出关注与下层和中层的桥接关系，而且，很少有特定的关注头集中在桥接上。更重要的是，我们在我们的第二种方法中考虑整体的语言模型，其中桥接颠覆分辨率被制定为蒙版令牌预测任务（层压测试）。我们的配方在没有任何微调的情况下产生乐观的结果，这表明预先训练的语言模型基本上捕获了桥接推断。我们的进一步调查表明，宣誓期与语言模型提供的上下文在推理中发挥着重要作用。

著录项

来源
《Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2021年|4153-4163|共11页
会议地点
作者
Onkar Pandit; Yufang Hou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Transformers-sklearn: a toolkit for medical language understanding with transformer-based models [J] . Yang Feihong, Wang Xuwen, Ma Hetong, BMC Medical Informatics and Decision Making . 2021,第2期

机译：变形金刚 - Sklearn：用基于变压器的模型的医疗语言理解的工具包
2. Effect of Modeling Non-Normality and Stochastic Dependence of Variables on Distribution Transformer Loss of Life Inference [J] . Agah S. M., Abyaneh H. A. Power Delivery, IEEE Transactions on . 2012,第4期

机译：建模变量的非常态和随机相关性对配电变压器寿命损失的影响
3. RevBayes: Bayesian Phylogenetic Inference Using Graphical Models and an Interactive Model-Specification Language [J] . Hohna Sebastian, Landis Michael J., Heath Tracy A., Systematic Biology . 2016,第4期

机译：RevBayes：使用图形模型和交互式模型规范语言的贝叶斯系统发生推理
4. An Architecture for Accelerated Large-Scale Inference of Transformer-Based Language Models [C] . Amir Ganiev, Colt Chapin, Anderson de Andrade, Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2021

机译：加速基于变压器的语言模型的大规模推断的架构
5. AN EXPLICIT STATE MODEL OF A SYNCHRONOUS MACHINE - TRANSFORMER - SCR BRIDGE UNIT. [D] . SHOKOOH, FARROKH. 1979

机译：同步电机-变压器-SCR桥单元的显式状态模型。
6. Transformers-sklearn: a toolkit for medical language understanding with transformer-based models [O] . Feihong Yang, Xuwen Wang, Hetong Ma, 2021

机译：变换器 - Sklearn：用基于变压器的模型的医疗语言理解的工具包
7. Gaussian Transformer: A Lightweight Approach for Natural Language Inference [O] . Maosheng Guo, Yu Zhang, Ting Liu 2019

机译：高斯变压器：用于自然语言推理的轻质方法

Probing for Bridging Inference in Transformer Language Models

摘要

著录项

相似文献

相关主题

期刊订阅