A Diversity-Promoting Objective Function for Neural Conversation Models

机译：神经对话模型的促进多样性的目标函数

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sequence-to-sequence neural network models for generation of conversational responses tend to generate safe, commonplace responses (e.g., I don't know) regardless of the input. We suggest that the traditional objective function, i.e., the likelihood of output (response) given input (message) is unsuited to response generation tasks. Instead we propose using Maximum Mutual Information (MMI) as the objective function in neural models. Experimental results demonstrate that the proposed MMI models produce more diverse, interesting, and appropriate responses, yielding substantive gains in Bleu scores on two conversational datasets and in human evaluations.

机译：无论输入如何，用于生成会话响应的序列到序列神经网络模型都倾向于生成安全，普通的响应（例如，我不知道）。我们建议传统的目标函数，即给定输入（消息）的输出（响应）的可能性不适合响应生成任务。相反，我们建议在神经模型中使用最大互信息（MMI）作为目标函数。实验结果表明，所提出的MMI模型产生了更加多样化，有趣且适当的响应，从而在两个会话数据集和人类评估中的Bleu分数上取得了实质性的收益。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2016年|110-119|共10页
会议地点
作者
Jiwei Li; Michel Galley; Chris Brockett; Jianfeng Gao; Bill Dolan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Objective prediction of pharyngeal swallow dysfunction in dysphagia through artificial neural network modeling [J] . Kritas S., Dejaeger E., Tack J., Neurogastroenterology and motility . 2016,第3期

机译：通过人工神经网络建模客观预测吞咽困难的咽咽功能障碍
2. CFD modeling and multi-objective optimization of cyclone geometry using desirability function, artificial neural networks and genetic algorithms [J] . Khairy Elsayed, Chris Lacor Applied Mathematical Modelling . 2013,第8期

机译：使用期望函数，人工神经网络和遗传算法对旋风分离器几何形状进行CFD建模和多目标优化
3. A Re(modeled) Pragmatic-Functional Pattern of Analysis for the Study of Discourse Markers in Conversation: Discursive Roles and Functions of Discourse Marking ?deci” and ??i” [J] . M?d?lina-Georgiana Matei Procedia - Social and Behavioral Sciences . 2012,第2期

机译：对话中话语标记研究的分析（重构）语用功能模式：话语标记“ deci”和“ i”的话语角色和功能
4. A Diversity-Promoting Objective Function for Neural Conversation Models [C] . Jiwei Li, Michel Galley, Chris Brockett, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：促进神经谈话模型的多样性目标函数
5. Multimodal Conversation Modeling via Neural Perception, Structure Learning, and Communication [D] . Zheng, Zilong. 2021

机译：通过神经感知，结构学习和沟通的多式联合对话建模
6. BOFdat: Generating biomass objective functions for genome-scale metabolic models from experimental data [O] . Jean-Christophe Lachance, Colton J. Lloyd, Jonathan M. Monk, 2019

机译：BOFdat：从实验数据生成基因组规模代谢模型的生物量目标函数
7. A Diversity-Promoting Objective Function for Neural Conversation Models [O] . Li, Jiwei, Galley, Michel, Brockett, Chris, 2016

机译：神经对话模型的多样性促进目标函数

A Diversity-Promoting Objective Function for Neural Conversation Models

摘要

著录项

相似文献

相关主题

期刊订阅