Temporal logic control of general Markov decision processes by approximate policy refinement

Sofie Haesaert; Sadegh Soudjani; Alessandro Abate

首页> 外文期刊>IFAC PapersOnLine >Temporal logic control of general Markov decision processes by approximate policy refinement

【24h】

Temporal logic control of general Markov decision processes by approximate policy refinement

机译：通过近似策略改进对一般Markov决策过程进行时间逻辑控制

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The formal verification and controller synthesis for general Markov decision processes (gMDPs) that evolve over uncountable state spaces are computationally hard and thus generally rely on the use of approximate abstractions. In this paper, we contribute to the state of the art of control synthesis for temporal logic properties by computing and quantifying a less conservative gridding of the continuous state space of linear stochastic dynamic systems and by giving a new approach for control synthesis and verification that is robust to the incurred approximation errors. The approximation errors are expressed as both deviations in the outputs of the gMDPs and in the probabilistic transitions.

机译：在不可数状态空间上演化的一般Markov决策过程（gMDP）的形式验证和控制器综合在计算上比较困难，因此通常依赖于近似抽象的使用。在本文中，我们通过计算和量化线性随机动态系统的连续状态空间的保守程度较低的网格，并为控制综合和验证提供了一种新方法，从而为时间逻辑属性的控制综合提供了最新技术对所产生的近似误差具有鲁棒性。近似误差表示为gMDP的输出和概率跃迁的偏差。

著录项

来源
《IFAC PapersOnLine》 |2018年第16期|共6页
作者
Sofie Haesaert; Sadegh Soudjani; Alessandro Abate;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. VERIFICATION OF GENERAL MARKOV DECISION PROCESSES BY APPROXIMATE SIMILARITY RELATIONS AND POLICY REFINEMENT [J] . Haesaert Sofie, Soudjani Sadegh Esmaeil Zadeh, Abate Alessandro SIAM Journal on Control and Optimization . 2017,第4期

机译：通过近似相似性关系和政策细化验证通用马尔可夫决策过程
2. Optimal Control of Markov Decision Processes With Linear Temporal Logic Constraints [J] . Ding X., Smith S.L., Belta C., IEEE Transactions on Automatic Control . 2014,第5期

机译：具有线性时间逻辑约束的马尔可夫决策过程的最优控制
3. Formal Synthesis of Control Policies for Continuous Time Markov Processes From Time-Bounded Temporal Logic Specifications [J] . Medina Ayala A., Andersson S.B., Belta C. Automatic Control, IEEE Transactions on . 2014,第9期

机译：时限时间逻辑规范对连续时间马尔可夫过程控制策略的形式综合
4. Temporal logic control of general Markov decision processes by approximate policy refinement [C] . Sofie Haesaert, Sadegh Soudjani, Alessandro Abate IFAC Conference on Analysis and Design of Hybrid Systems . 2018

机译：通过近似政策细化的马尔可夫决策过程的时间逻辑控制
5. Multistage decisions and risk in Markov decision processes: Towards effective approximate dynamic programming architectures. [D] . Pratikakis, Nikolaos E. 2009

机译：马尔可夫决策过程中的多阶段决策和风险：建立有效的近似动态编程体系结构。
6. Evolving Robust Policy Coverage Sets in Multi-Objective Markov Decision Processes Through Intrinsically Motivated Self-Play [O] . Sherif Abdelfattah, Kathryn Kasmarik, Jiankun Hu 2018

机译：通过内在动机的自我博弈在多目标马尔可夫决策过程中发展稳健的政策覆盖范围
7. Temporal logic control of general Markov decision processes by approximate policy refinement [O] . Haesaert, Sofie, Soudjani, Sadegh, Abate, Alessandro 2017

机译：一般马尔可夫决策过程的时态逻辑控制近似政策改进
8. Learning Based Approach to Control Synthesis of Markov Decision Processes for Linear Temporal Logic Specifications. [R] . Sadigh, D., Kim, E., Coogan, S., 2014

机译：基于学习的线性时序逻辑规范马尔可夫决策过程综合控制方法。

Temporal logic control of general Markov decision processes by approximate policy refinement

摘要

著录项

相似文献

相关主题

期刊订阅