A proactive decision support method based on deep reinforcement learning and state partition

Wang Yongheng; Geng Shaofeng; Gao Hui

首页> 外文期刊>Knowledge-Based Systems >A proactive decision support method based on deep reinforcement learning and state partition

【24h】

A proactive decision support method based on deep reinforcement learning and state partition

机译：基于深度强化学习和状态划分的主动决策支持方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Big streaming data is an important kind of big data which we need new technology to process. Getting knowledge from online streaming data and making decision online can help us get more value from Big data. A proactive decision support system can predict future states and mitigate or eliminate undesired future states by taking some actions proactively. But it is difficult to handle some issues like the data distribution change in streaming data, combination of prediction and decision making, and the huge state space in decision making. In this paper, we propose a proactive decision support method based on deep reinforcement learning and state partition. The predictive analytics part uses deep belief networks with two level incremental training method. The deep reinforcement learning part uses deep belief networks as function approximation which is learned by semi-gradient method. Off-policy is supported through important sampling. Two kinds of state partition and parallel execution methods are proposed to improve the performance. The experimental evaluation in traffic congestion control application shows this method works well in both accuracy and performance. (C) 2017 Elsevier B.V. All rights reserved.

机译：大数据流是一种重要的大数据，我们需要新技术来处理。从在线流数据中获取知识并在线做出决策可以帮助我们从大数据中获得更多价值。主动的决策支持系统可以预测未来状态，并通过主动采取一些措施来缓解或消除不希望的未来状态。但是很难处理一些问题，例如流数据中的数据分布更改，预测和决策制定的结合以及决策制定中巨大的状态空间。本文提出了一种基于深度强化学习和状态划分的主动决策支持方法。预测分析部分使用具有两级增量训练方法的深度置信网络。深度强化学习部分使用深度信念网络作为通过半梯度法学习的函数逼近。通过重要的抽样来支持非政策。提出了两种状态划分和并行执行方法来提高性能。在交通拥堵控制应用中的实验评估表明，该方法在准确性和性能上都很好。（C）2017 Elsevier B.V.保留所有权利。

著录项

来源
《Knowledge-Based Systems》 |2018年第1期|248-258|共11页
作者
Wang Yongheng; Geng Shaofeng; Gao Hui;
展开▼
作者单位

Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China;

Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China;

Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Event stream; Proactive decision support; Deep reinforcement learning; State partition;

机译：事件流;主动决策支持;深度强化学习;状态划分;

相似文献

外文文献
中文文献
专利

1. The "Proactive" Model of Learning: Integrative Framework for Model-Free and Model-Based Reinforcement Learning Utilizing the Associative Learning-Based Proactive Brain Concept [J] . Zsuga Judit, Biro Klara, Papp Csaba, Behavioral neuroscience . 2016,第1期

机译：“主动”学习模型：利用基于联合学习的主动脑概念进行无模型和基于模型的强化学习的集成框架
2. Optimization Method of Power Equipment Maintenance Plan Decision-Making Based on Deep Reinforcement Learning [J] . Yanhua Yang, Ligang Yao Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：基于深度加强学习的电力设备维修计划优化方法
3. Deep Reinforcement Learning-Based Mobility-Aware Robust Proactive Resource Allocation in Heterogeneous Networks [J] . Li Jing, Zhang Xing, Zhang Jiaxin, IEEE Transactions on Cognitive Communications and Networking . 2020,第1期

机译：基于深度加强学习的流动性，意识到异构网络中的强大主动资源分配
4. Autonomous Decision-Making Method for Combat Mission of UAV based on Deep Reinforcement Learning [C] . Jie Xu, Qing Guo, Lei Xiao, IEEE Advanced Information Technology, Electronic and Automation Control Conference . 2019

机译：基于深度强化学习的无人机作战任务自主决策方法
5. Methods for Reinforcement Learning in Clinical Decision Support [D] . Prasad, Niranjani. 2020

机译：临床决策支持中加固学习的方法
6. A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning [O] . Ke Li, Kun Zhang, Zhenchong Zhang, 2021

机译：基于深度加强学习的自主空腹措施无人机机动决策算法
7. SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement Learning [O] . Ran Zhang, Miao Wang, Lin X. Cai 2020

机译：SREC：通过深度加强学习主动自由度基于能量限制的基于UV的网络

A proactive decision support method based on deep reinforcement learning and state partition

摘要

著录项

相似文献

相关主题

期刊订阅