When Choosing Plausible Alternatives, Clever Hans can be Clever

机译：当选择可行的替代品，聪明汉斯可以巧妙

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Pretrained language models, such as BERT and RoBERTa, have shown large improvements in the commonsense reasoning benchmark COPA. However, recent work found that many improvements in benchmarks of natural language understanding are not due to models learning the task, but due to their increasing ability to exploit superficial cues, such as tokens that occur more often in the correct answer than the wrong one. Are BERT's and RoBERTa's good performance on COPA also caused by this? We find superficial cues in COPA, as well as evidence that BERT exploits these cues. To remedy this problem, we introduce Balanced COPA, an extension of COPA that does not suffer from easy-to-exploit single token cues. We analyze BERT's and RoBERTa's performance on original and Balanced COPA, finding that BERT relies on superficial cues when they are present, but still achieves comparable performance once they arc made ineffective, suggesting that BERT learns the task to a certain degree when forced to. In contrast, RoBERTa does not appear to rely on superficial cues.

机译：诸如BERT和RoBERTa之类的经过预训练的语言模型在常识推理基准COPA中显示出了很大的改进。但是，最近的工作发现，自然语言理解基准的许多改进不是由于学习任务的模型，而是由于它们利用表象线索的能力不断提高，例如在正确答案中出现的令牌比在错误答案中出现的频率高。 BERT和RoBERTa在COPA上的良好表现是否也是由这引起的？我们在COPA中找到了肤浅的线索，也有证据表明BERT利用了这些线索。为了解决这个问题，我们引入了Balanced COPA，它是COPA的扩展，它没有易于利用的单个令牌提示。我们分析了BERT和RoBERTa在原始COPA和Balanced COPA上的表现，发现BERT在出现时依赖于肤浅的提示，但是一旦它们变得无效，仍然可以达到可比的性能，这表明BERT在被迫执行时可以在一定程度上学习任务。相比之下，RoBERTa似乎并不依赖肤浅的线索。

著录项

来源
《Workshop on Commonsense inference in natural language processing》|2019年|33-42|共10页
会议地点
作者
Pride Kavumba; Naoya Inoue; Benjamin Heinzerling; Keshav Singh; Paul Reisert; Kentaro Inui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Unmasking Clever Hans predictors and assessing what machines really learn [J] . Sebastian Lapuschkin, Stephan W?ldchen, Alexander Binder, Nature Communications . 2019,第1期

机译：揭露聪明的汉斯预测变量并评估什么机器真正学习
2. Mentoring the Next Generation: Hans Clevers [J] . Clevers Hans Cell stem cell . 2018,第6期

机译：指导下一代：汉斯聪明
3. Advances in Organoid Technology: Hans Clevers, Madeline Lancaster, and Takanori Takebe [J] . Cell stem cell . 2017,第6期

机译：有机体技术的进步：Hans Clevers，Madeline Lancaster和Takanori招草
4. The interaction between TCP and traffic shapers - clever alternatives to the leaky bucket [C] . Abendroth, D., Below, Global Telecommunications Conference, 2002. GLOBECOM '02. IEEE . 2002

机译：TCP与流量整形器之间的交互-泄漏桶的明智替代品
5. Cleverly Voiced: The Narrators' Uncommon Perceptions and Depictions of the Bad Woman in "Middlemarch" and "Vanity Fair". [D] . Matos Ayala, Stephanie. 2012

机译：巧妙地表达：在“中秋节”和“名利场”中叙事者对坏女人的不寻常感知和描述。
6. The Clever Hans Phenomenon revisited [O] . Laasya Samhita, Hans J Gross 2013

机译：重温聪明的汉斯现象
7. When Choosing Plausible Alternatives, Clever Hans can be Clever [O] . Pride Kavumba, Naoya Inoue, Benjamin Heinzerling, 2019

机译：选择合理的替代品时，聪明的汉斯可以聪明

When Choosing Plausible Alternatives, Clever Hans can be Clever

摘要

著录项

相似文献

相关主题

期刊订阅