Refining Data for Text Generation

机译：文本生成的炼油数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent work on data-to-text generation has made progress under the neural encoder-decoder architectures. However, the data input size is often enormous, while not all data records are important for text generation and inappropriate input may bring noise into the final output. To solve this problem, we propose a two-step approach which first selects and orders the important data records and then generates text from the noise-reduced data. Here we propose a learning to rank model to rank the importance of each record which is supervised by a relation extractor. With the noise-reduced data as input, we implement a text generator which sequentially models the input data records and emits a summary. Experiments on the ROTOWIRE dataset verifies the effectiveness of our proposed method in both performance and efficiency.

机译：最近关于数据到文本生成的工作已经在神经编码器解码器架构下取得了进展。但是，数据输入大小通常是巨大的，而不是所有数据记录对于文本生成很重要，并且不适当的输入可以将噪声带入最终输出。为了解决这个问题，我们提出了一种两步方法，首先选择和命令重要的数据记录，然后从噪声减少的数据生成文本。在这里，我们提出了一个学习来排名模型，以对由关系提取器监督的每个记录的重要性。利用噪声减少数据作为输入，我们实现了文本生成器，该文本生成器顺序地模拟了输入数据记录并发出摘要。 RotoWire数据集上的实验验证了我们提出的方法在性能和效率方面的有效性。

著录项

来源
《China National Conference on Computational Linguistics》|2020年|482p|共14页
会议地点
作者
Qianying Liu; Tianyi Li; Wenyu Guan; Sujian Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Data-to-text generation; Sequence-to-sequence; Model efficiency;

机译：数据到文本生成;序列到序列;模型效率;

相似文献

外文文献
中文文献
专利

1. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data [J] . Kristina M Hettne, André Boorsma, Dorien A M van Dartel, BMC Medical Genomics . 2013,第1期

机译：用于解释基因表达数据的下一代文本挖掘介导的化学反应特异性基因集的产生
2. Neural data-to-text generation with dynamic content planning [J] . Chen Kai, Li Fayuan, Hu Baotian, Knowledge-Based Systems . 2021,第Mara5期

机译：具有动态内容规划的神经数据到文本生成
3. Narrative context-based data-to-text generation for ambient intelligence [J] . Journal of ambient intelligence and humanized computing . 2020,第4期

机译：基于叙事上下文的数据到文本生成，用于环境智能
4. Refining Data for Text Generation [C] . Qianying Liu, Tianyi Li, Wenyu Guan, China National Conference on Computational Linguistics . 2020

机译：文本生成的精炼数据
5. Automated generation of metadata for mining image and text data. [D] . Al-Shameri, Faleh Jassem. 2006

机译：自动生成用于挖掘图像和文本数据的元数据。
6. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data [O] . Kristina M Hettne, André Boorsma, Dorien A M van Dartel, 2013

机译：用于解释基因表达数据的下一代文本挖掘介导的化学反应特异性基因集的产生
7. Refining Instructional Text Generation after Evaluation [O] . Fiorella De Rosis, Floriana Grasso, Dianne C. Berry 1999

机译：评估后完善教学文本生成

Refining Data for Text Generation

摘要

著录项

相似文献

相关主题

期刊订阅