首页> 外文会议>Annual meeting of the special interest group on discourse and dialogue >Not All Dialogues are Created Equal: Instance Weighting for Neural Conversational Models
【24h】

Not All Dialogues are Created Equal: Instance Weighting for Neural Conversational Models

机译:并非所有对话都相等:神经对话模型的实例权重

获取原文

摘要

Neural conversational models require substantial amounts of dialogue data to estimate their parameters and are therefore usually learned on large corpora such as chat forums, Twitter discussions or movie subtitles. These corpora are, however, often challenging to work with, notably due to their frequent lack of turn segmentation and the presence of multiple references external to the dialogue itself. This paper shows that these challenges can be mitigated by adding a weighting model into the neural architecture. The weighting model, which is itself estimated from dialogue data, associates each training example to a numerical weight that reflects its intrinsic quality for dialogue modelling. At training time, these sample weights are included into the empirical loss to be minimised. Evaluation results on retrieval-based models trained on movie and TV subtitles demonstrate that the inclusion of such a weighting model improves the model performance on unsupervised metrics.
机译:神经对话模型需要大量的对话数据来估计其参数,因此通常是在大型语料库(如聊天论坛,Twitter讨论或电影字幕)上学习的。但是,这些语料库通常很难与之配合使用,特别是由于它们经常缺乏回合细分,并且在对话本身之外还存在多个引用。本文表明,可以通过在神经体系结构中添加权重模型来缓解这些挑战。权重模型本身是根据对话数据估算的,将每个训练示例与一个数值权重相关联,以反映对话建模的内在质量。在训练时,将这些样本权重计入经验损失中以将其最小化。对以电影和电视字幕训练的基于检索的模型的评估结果表明,包含这种加权模型可以提高无监督指标上的模型性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号