Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children's mindreading ability

机译：矢量可以比专家更好地阅读思想吗？比较儿童思维能力自动评分的数据增强策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we implement and compare 7 different data augmentation strategies for the task of automatic scoring of children's ability to understand others' thoughts, feelings, and desires (or "mindreading"). We recruit in-domain experts to re-annotate augmented samples and determine to what extent each strategy preserves the original rating. We also carry out multiple experiments to measure how much each augmentation strategy improves the performance of automatic scoring systems. To determine the capabilities of automatic systems to generalize to unseen data, we create UK-MIND-20 - a new corpus of children's performance on tests of mindreading, consisting of 10,320 question-answer pairs. We obtain a new state-of-the-art performance on the MIND-CA corpus, improving macro-Fl-score by 6 points. Results indicate that both the number of training examples and the quality of the augmentation strategies affect the performance of the systems. The task-specific augmentations generally outperform task-agnostic augmentations. Automatic augmentations based on vectors (GloVe, FastText) perform the worst. We find that systems trained on MIND-CA generalize well to UK-MIND-20. We demonstrate that data augmentation strategies also improve the performance on unseen data.

机译：在本文中，我们实施并比较了7个不同的数据增强策略，以便为儿童了解其他人的思想，感受和欲望（或“Mindreading”）的自动评分任务。我们招募域名专家以重新注释增强样本，并确定各种策略保留原始评级的程度。我们还对多次实验进行了多次实验，以衡量每个增强策略提高自动评分系统性能的程度。为了确定自动系统的能力概括到未看见的数据，我们创建了英国思维 - 20 - 一种新的儿童表现的思维表现，包括10,320个问题答案对。我们在Mind-CA语料库中获得了新的最先进的性能，提高了6分的宏观分数。结果表明，增强策略的训练示例的数量和增强策略的质量影响了系统的性能。任务特定的增强通常优于任务不可知的增强。基于矢量（手套，FastText）的自动增强表现最差。我们发现在Mind-CA上培训的系统概括为英国思维 - 20。我们证明数据增强策略还提高了看不见数据的性能。

著录项

来源
《Annual Meeting of the Association for Computational Linguistics;International Joint Conference on natural Language Processing》|2021年|1196-1206|共11页
会议地点
作者
Venelin Kovatchev; Phillip Smith; Mark Lee; Rory Devine;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Constructing minds: The development of mindreading abilities in typical and atypical trajectories [J] . Ruth Campos, Maria Sotillo Behavioral and Brain Sciences . 2008,第3期

机译：构筑思维：典型和非典型轨迹中思维能力的发展
2. Reply: Comparing Round and Anatomically Shaped Implants in Augmentation Mammaplasty: The Experts' Ability to Differentiate the Type of Implant [J] . Rubi Carlos G., Leache Elena, Angel Lozano Jose, Plastic and reconstructive surgery . 2017,第4期

机译：答复：比较圆形和解剖学植入物在增强乳腺术中的植入物：专家们分辨植入物类型的能力
3. Comparing Round and Anatomically Shaped Implants in Augmentation Mammaplasty: The Experts' Ability to Differentiate the Type of Implant [J] . Cheema Mubashir, Montemurro Paolo, Heden Per Plastic and reconstructive surgery . 2017,第4期

机译：比较增强哺乳动物术中的圆形和解剖学植入物：专家们区分植入物类型的能力
4. 'What is on your mind?' Automated Scoring of Mindreading in Childhood and Early Adolescence [C] . Venelin Kovatchev, Phillip Smith, Mark Lee, International Conference on Computational Linguistics . 2020

机译：'你在想什么？' 童年和早期青春期的心态自动评分
5. Interrater reliability and scoring accuracy of the Rorschach comprehensive system: Comparing students and experts at the variable, response, protocol, and interpretation level. [D] . Burke, Leo J., III. 2011

机译：Rorschach综合系统的评估者间可靠性和评分准确性：在变量，响应，协议和解释级别对学生和专家进行比较。
6. A Comparison of Childrens Ability to Read Childrens and Adults Mental States in an Adaptation of the Reading the Mind in the Eyes Task [O] . Anna van der Meulen, Simone Roerig, Doret de Ruyter, -1

机译：适应儿童阅读心智任务中儿童阅读儿童和成人心理状态的能力比较
7. “What is on your mind?” Automated Scoring of Mindreading in Childhood and Early Adolescence [O] . Venelin Kovatchev, Phillip Smith, Mark Lee, 2020

机译：“你在想什么？”童年和早期青春期的心态自动评分

Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children's mindreading ability

摘要

著录项

相似文献

相关主题

期刊订阅