A reinforcement learning diffusion decision model for value-based decisions

Fontanesi Laura; Gluth Sebastian; Spektor Mikhail S.; Rieskamp Joerg

首页> 外文期刊>Psychonomic bulletin & review >A reinforcement learning diffusion decision model for value-based decisions

【24h】

A reinforcement learning diffusion decision model for value-based decisions

机译：基于价值的决策的加强学习扩散决策模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Psychological models of value-based decision-making describe how subjective values are formed and mapped to single choices. Recently, additional efforts have been made to describe the temporal dynamics of these processes by adopting sequential sampling models from the perceptual decision-making tradition, such as the diffusion decision model (DDM). These models, when applied to value-based decision-making, allow mapping of subjective values not only to choices but also to response times. However, very few attempts have been made to adapt these models to situations in which decisions are followed by rewards, thereby producing learning effects. In this study, we propose a new combined reinforcement learning diffusion decision model (RLDDM) and test it on a learning task in which pairs of options differ with respect to both value difference and overall value. We found that participants became more accurate and faster with learning, responded faster and more accurately when options had more dissimilar values, and decided faster when confronted with more attractive (i.e., overall more valuable) pairs of options. We demonstrate that the suggested RLDDM can accommodate these effects and does so better than previously proposed models. To gain a better understanding of the model dynamics, we also compare it to standard DDMs and reinforcement learning models. Our work is a step forward towards bridging the gap between two traditions of decision-making research.

机译：基于价值的决策的心理模型描述了如何形成主观值并映射到单个选择。最近，已经通过采用来自感知决策传统的顺序采样模型来描述这些过程的额外努力，例如扩散决策模型（DDM）。这些型号在应用于基于价值的决策时，允许映射主观值，而不仅可以选择，还可以映射到响应时间。然而，已经提出了很少的尝试，以使这些模型适应决策之后的情况，从而产生学习效果。在这项研究中，我们提出了一种新的组合强化学习扩散决策模型（RLDDM）并在学习任务上测试，其中对选项对具有值差异和总体值不同。 We found that participants became more accurate and faster with learning, responded faster and more accurately when options had more dissimilar values, and decided faster when confronted with more attractive (i.e., overall more valuable) pairs of options.我们证明建议的RLDDM可以适应这些效果，并且比以前提出的模型更好。为了更好地了解模型动态，我们还将其与标准DDMS和强化学习模型进行了比较。我们的作品是向前拓展两种决策研究传统之间差距的一步。

著录项

来源
《Psychonomic bulletin & review》 |2019年第4期|共23页
作者
Fontanesi Laura; Gluth Sebastian; Spektor Mikhail S.; Rieskamp Joerg;
展开▼
作者单位

Univ Basel Fac Psychol Missionsstr 62a CH-4055 Basel Switzerland;

Univ Basel Fac Psychol Missionsstr 62a CH-4055 Basel Switzerland;

Univ Basel Fac Psychol Missionsstr 62a CH-4055 Basel Switzerland;

Univ Basel Fac Psychol Missionsstr 62a CH-4055 Basel Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类心理学;
关键词
Decision-making; Computational modeling; Bayesian inference and parameter estimation; Response time models;

机译：决策;计算建模;贝叶斯推论和参数估计;响应时间模型;

相似文献

外文文献
中文文献
专利

1. A reinforcement learning diffusion decision model for value-based decisions [J] . Fontanesi Laura, Gluth Sebastian, Spektor Mikhail S., Psychonomic bulletin & review . 2019,第4期

机译：基于价值的决策的加强学习扩散决策模型
2. How pupil responses track value-based decision-making during and after reinforcement learning [J] . Joanne C. Van Slooten, Sara Jahfari, Tomas Knapen, PLoS Computational Biology . 2018,第11期

机译：在强化学习期间和之后，学生的反应如何跟踪基于价值的决策
3. Modelling ADHD: A review of ADHD theories through their predictions for computational models of decision-making and reinforcement learning [J] . Ziegler Sigurd, Pedersen Mads L., Mowinckel Athanasia M., Neuroscience and Biobehavioral Reviews . 2016,第Null期

机译：多动症建模：通过对多动症理论进行决策和强化学习计算模型的预测进行回顾
4. Reinforcement learning and instance-based learning approaches to modeling human decision making in a prognostic foraging task [C] . Chelian Suhas E., Paik Jaehyon, Pirolli Peter, Joint IEEE International Conference on Development and Learning and Epigenetic Robotics . 2015

机译：强化学习和基于实例的学习方法可在预测性觅食任务中为人类决策建模
5. Computational Modeling of Behavior and Neural Mechanisms of Decision-Making Using Reinforcement Learning Theory [D] . Pietras, Bradley William. 2019

机译：强化学习理论的行为建模与决策的神经机制
6. Decomposing the effects of context valence and feedback information on speed and accuracy during reinforcement learning: a meta-analytical approach using diffusion decision modeling [O] . Laura Fontanesi, Stefano Palminteri, Maël Lebreton -1

机译：在强化学习中分解上下文价和反馈信息对速度和准确性的影响：使用扩散决策模型的元分析方法
7. Multialternative drift-diffusion model predicts the relationship between visual fixations and choice in value-based decisions [O] . Krajbich, Ian, Rangel, Antonio 2011

机译：多变量漂移扩散模型可预测基于价值的决策中视觉注视与选择之间的关系
8. Predicting What Reinforcement Learning Will Tell You: A Model of Human Decision-Making in Multi-Stage Games. [R] . B. Tracey D. H. Wolpert J. Bono R. Lee R. W. Bent S. N. Backhaus 2011

机译：预测强化学习将告诉你什么：多阶段博弈中的人类决策模型。

A reinforcement learning diffusion decision model for value-based decisions

摘要

著录项

相似文献

相关主题

期刊订阅