How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text

Dominic Callan; Jennifer Foster

首页> 外文期刊>Expert systems: The international journal of knowledge engineering >How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text

【24h】

How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text

机译：How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Evaluation of the narrative text generated by machines has traditionally been a challenge,particularly when attempting to evaluate subjective elements such as interestor believability. Recent improvements in narrative machine text generation havebeen largely driven by the emergence of transformer-based language models, trainedon massive quantities of data, resulting in higher quality text generation. In this study,a corpus of stories is generated using the pre-trained GPT-Neo transformer model,with human-written prompts as inputs upon which to base the narrative text. Thestories generated through this process are subsequently evaluated through bothhuman evaluation and two automated metrics: BERTScore and BERT Next SentencePrediction, with the aim of determining whether there is a correlation between theautomatic scores and the human judgements. The results show variation in humanevaluation results in comparison to modern automated metrics, suggesting furtherwork is required to train automated metrics to identify text that is defined as interestingby humans.

著录项

来源
《Expert systems: The international journal of knowledge engineering》 |2023年第6期|e13292.1-e13292.14|共14页
作者
Dominic Callan; Jennifer Foster;
展开▼
作者单位

School of Computing, Dublin City University,Dublin, Ireland;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
evaluation; machine-generated text; natural language generation; transformers;

How interesting and coherent are the stories generated by a large-scale neural language model? Comparing human and automatic evaluations of machine-generated text

摘要

著录项

相关主题

期刊订阅