首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Step-by-Stcp: Separating Planning from Realization in Neural Data-to-Text Generation
【24h】

Step-by-Stcp: Separating Planning from Realization in Neural Data-to-Text Generation

机译:Step-by-stcp:从神经数据到文本生成中的实现分离规划

获取原文

摘要

Data-to-text generation can be conceptually divided into two parts: ordering and structuring the information (planning), and generating fluent language describing the information (realization). Modern neural generation systems conflate these two steps into a single end-to-end differentiable system. We propose to split the generation process into a symbolic text-planning stage that is faithful to the input, followed by a neural generation stage that focuses only on realization. For training a plan-to-text generator, we present a method for matching reference texts to their corresponding text plans. For inference time, we describe a method for selecting high-quality text plans for new inputs. We implement and evaluate our approach on the WebNLG benchmark. Our results demonstrate that decoupling text planning from neural realization indeed improves the system's reliability and adequacy while maintaining fluent output. We observe improvements both in BLEU scores and in manual evaluations. Another benefit of our approach is the ability to output diverse realizations of the same input, paving the way to explicit control over the generated text structure.
机译:数据到文本生成可以概念上分为两部分:排序和构建信息(规划),并生成描述信息(实现)的流利语言。现代神经发电系统将这两个步骤与单一端到端可微分系统混合。我们建议将生成过程分成符号的文本规划阶段,忠于输入,其次是一个神经发电阶段,只关注实现。对于培训计划到文本生成器,我们提供了一种将参考文本与其相应的文本计划匹配的方法。为了推理时间,我们描述了一种选择用于新输入的高质量文本计划的方法。我们在WebNLG基准测试中实现和评估我们的方法。我们的结果表明,神经意识中的解耦文本规划确实可以提高系统的可靠性和充分性,同时保持流畅的输出。我们观察BLEU分数和手动评估中的改进。我们的方法的另一个好处是能够输出相同输入的多样化实现,铺平了对生成的文本结构的显式控制方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号