Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

Blaise Thomson; Steve Young

首页> 外文期刊>Computer speech and language >Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

【24h】

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

机译：贝叶斯对话状态更新：用于语音对话系统的POMDP框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on the partially observable Markov decision process (POMDP), which provides a well-founded, statistical model of spoken dialogue management. However, exact belief state updates in a POMDP model are computationally intractable so approximate methods must be used. This paper presents a tractable method based on the loopy belief propagation algorithm. Various simplifications are made, which improve the efficiency significantly compared to the original algorithm as well as compared to other POMDP-based dialogue state updating approaches. A second contribution of this paper is a method for learning in spoken dialogue systems which uses a component-based policy with the episodic Natural Actor Critic algorithm.rnThe framework proposed in this paper was tested on both simulations and in a user trial. Both indicated that using Bayesian updates of the dialogue state significantly outperforms traditional definitions of the dialogue state. Policy learning worked effectively and the learned policy outperformed all others on simulations. In user trials the learned policy was also competitive, although its optimality was less conclusive. Overall, the Bayesian update of dialogue state framework was shown to be a feasible and effective approach to building real-world POMDP-based dialogue systems.

机译：本文介绍了一种统计动机框架，用于在口头对话系统中执行实时对话状态更新和策略学习。该框架基于部分可观察的马尔可夫决策过程（POMDP），该过程提供了良好的口头对话管理统计模型。但是，POMDP模型中的确切置信状态更新在计算上难以实现，因此必须使用近似方法。本文提出了一种基于循环信念传播算法的可处理方法。进行了各种简化，与原始算法以及与其他基于POMDP的对话状态更新方法相比，显着提高了效率。本文的第二个贡献是一种在口语对话系统中学习的方法，该方法使用带有情节化的Natural Actor Critic算法的基于组件的策略。在模拟和用户试用中都对本文提出的框架进行了测试。两者都表明，使用对话状态的贝叶斯更新显着优于对话状态的传统定义。策略学习有效地发挥了作用，并且在模拟方面学习的策略优于其他所有策略。在用户试用中，尽管其最优性尚无定论，但学习的策略也具有竞争力。总的来说，对话状态框架的贝叶斯更新被证明是一种构建基于POMDP的现实世界对话系统的可行和有效的方法。

著录项

来源
《Computer speech and language》 |2010年第4期|p.562-588|共27页
作者
Blaise Thomson; Steve Young;
展开▼
作者单位

University of Cambridge, Engineering Department, Cambridge CB2 1TP, United Kingdom;

rnUniversity of Cambridge, Engineering Department, Cambridge CB2 1TP, United Kingdom;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
dialogue systems; robustness; POMDP; reinforcement learning;

机译：对话系统;健壮性POMDP;强化学习;

相似文献

外文文献
中文文献
专利

1. The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management [J] . Steve Young, Milica Gasic, Simon Keizer, Computer speech and language . 2010,第2期

机译：隐藏信息状态模型：基于POMDP的口语对话管理的实用框架
2. 'Can I Trust the Spoken Dialogue System Because It Uses the Same Words as I Do?'-Influence of Lexically Aligned Spoken Dialogue Systems on Trustworthiness and User Satisfaction [J] . Linnemann Gesa Alena, Jucks Regina Interacting with Computers . 2018,第3期

机译：“我可以信任口语对话系统，因为它使用的语言与我相同吗？”-词汇对齐的口语对话系统对可信度和用户满意度的影响
3. Real user evaluation of a POMDP spoken dialogue system using automatic belief compression [J] . Paul A. Crook, Simon Keizer, Zhuoran Wang, Computer speech and language . 2014,第4期

机译：使用自动信念压缩对POMDP口语对话系统进行真实用户评估
4. Combining POMDPs trained with User Simulations and Rule-based Dialogue Management in a Spoken Dialogue System [C] . Sebastian Varges, Silvia Quarteroni, Giuseppe Riccardi, Joint conference of the annual meeting of the Association for Computational Linguistics;International joint conference on natural language processing of the Asian Federation of Natural Languages Processing;ACL 2009;IJCNLP 2009 . 2009

机译：在口语对话系统中将受过用户模拟训练的POMDP与基于规则的对话管理相结合
5. Dialogue management in spoken dialogue systems with Degrees of Grounding. [D] . Roque, Antonio. 2009

机译：具有基础程度的语音对话系统中的对话管理。
6. Evaluating a Spoken Dialogue System for Recording Systems of Nursing Care [O] . Tittaya Mairittha, Nattaya Mairittha, Sozo Inoue 2019

机译：评估护理记录系统的口语对话系统
7. 2010. Bayesian update of dialogue state: A pomdp framework for spoken dialogue systems [O] . Blaise Thomson, Steve Young 2013

机译：2010年。对话状态的贝叶斯更新：语音对话系统的pomdp框架

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

摘要

著录项

相似文献

相关主题

期刊订阅