【24h】

Challenges in Data-to-Document Generation

机译:数据到文档生成中的挑战

获取原文

摘要

Recent neural models have shown significant progress on the problem of generating short descriptive texts conditioned on a small number of database records. In this work, we suggest a slightly more difficult data-to-text generation task, and investigate how effective current approaches are on this task. In particular, we introduce a new, large-scale corpus of data records paired with descriptive documents, propose a series of extractive evaluation methods for analyzing performance, and obtain baseline results using current neural generation methods. Experiments show that these models produce fluent text, but fail to convincingly approximate human-generated documents. Moreover, even templated baselines exceed the performance of these neural models on some metrics, though copy- and reconstruction-based extensions lead to noticeable improvements.
机译:最近的神经模型在生成以少量数据库记录为条件的简短描述性文本的问题上已显示出重大进展。在这项工作中,我们建议执行一个稍微困难的数据到文本生成任务,并研究当前有效的方法在此任务上的有效性。特别是,我们引入了一个新的大规模数据记录语料库与描述性文档配对,提出了一系列用于评估性能的提取评估方法,并使用当前的神经生成方法获得了基线结果。实验表明,这些模型可产生流畅的文本,但无法令人信服地近似人类生成的文档。此外,即使基于模板的基准在某些指标上也超过了这些神经模型的性能,尽管基于复制和重构的扩展导致了明显的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号