qMDP: DASH Adaptation using Queueing Theory within a Markov Decision Process

机译：QMDP：在马尔可夫决策过程中使用排队理论进行延线适应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Adaptive bitrate (ABR) streaming algorithms play an important role in ensuring a high Quality of Experience (QoE) for the consumer. However, a lot of ABR algorithms tend to be too ad hoc. In response, methods based on a Markov Decision Process (MDP) offer more intelligent models. In particular, Reinforcement Learning (RL) methods typically do so via QoE metrics. However, RL methods are plagued by high complexity and long convergence times due to their model-free nature. This paper proposes qMDP, which is an RL method with an MDP partially modeled by an M/D/1/K queue. Our study shows that qMDP results in higher QoE and faster convergence compared to a QoE-only model-free version.

机译：自适应比特率（ABR）流媒体算法在确保消费者的高质量经验（QoE）方面发挥着重要作用。然而，很多ABR算法往往是太临时。作为响应，基于Markov决策过程的方法（MDP）提供了更智能的模型。特别地，加强学习（RL）方法通常通过QoE度量来实现。然而，由于无模型性质，RL方法血液高复杂性和长收敛时间困扰。本文提出了QMDP，其是具有由M / D / 1 / k队列部分建模的MDP的RL方法。我们的研究表明，与QoE的无模型版本相比，QMDP会导致更高的QoE和更快的融合。

著录项

来源
《IEEE Annual Consumer Communications and Networking Conference》|2021年|1-6|共6页
会议地点
作者
Kevin Gatimu; Ben Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Measurement; Adaptation models; Bit rate; Reinforcement learning; Quality of experience; Queueing analysis; Convergence;

机译：测量;适应模型;比特率;加固学习;经验质量;排队分析;收敛;

相似文献

外文文献
中文文献
专利

1. Examining military medical evacuation dispatching policies utilizing a Markov decision process model of a controlled queueing system [J] . Jenkins Phillip R., Robbins Matthew J., Lunday Brian J. Annals of Operations Research . 2018,第2期

机译：使用受控排队系统的马尔可夫决策过程模型检查军事医疗后送调度策略
2. Fuzzy Markovian decision processes: Application to queueing systems [J] . Maria Jose Pardo, David de la Fuente Computers & mathematics with applications . 2010,第9期

机译：模糊马尔可夫决策过程：在排队系统中的应用
3. Prediction of Arrival Profiles and Queue Lengths Along Signalized Arterials by Using a Markov Decision Process [J] . Nikolaos Geroliminis, Alexander Skabardonis Transportation Research Record . 2005,第1934期

机译：使用马尔可夫决策过程预测沿信号动脉的到达轮廓和队列长度
4. Modelling and Optimisation of a Traffic Intersection Based on Queue Theory and Markov Decision Control Methods [C] . Azura Che Soh, Mohammad Hamiruce Marhaban, Marzuki Khalid, Asia International Conference on Modelling Simulation . 2007

机译：基于队列理论和马尔可夫决策控制方法的交通交叉口的建模与优化
5. Decision theory made tractable: The value of deliberation, with applications to Markov decision process planning. [D] . Tash, Jonathan King. 1996

机译：决策理论变得易于处理：审议的价值，及其在马尔可夫决策过程规划中的应用。
6. Study on the Calculation Models of Bus Delay at Bays Using Queueing Theory and Markov Chain [O] . Feng Sun, Li Sun, Shao-wei Sun, 2015

机译：基于排队论和马尔可夫链的海湾客车延误计算模型研究。
7. Prediction of arrival profiles and queue lengths along signalized arterials by using a Markov decision process [O] . Nikolaos Geroliminis, Er Skabardonis 2013

机译：使用马尔可夫决策过程预测信号化动脉的到达剖面和队列长度
8. Theory for Semi-Markov Decision Processes with Unbounded Costs and Its Application to the Optimal Control of Queueing Systems. [R] . Orkenyi, P. 1976

机译：无界成本半马尔可夫决策过程理论及其在排队系统最优控制中的应用。

qMDP: DASH Adaptation using Queueing Theory within a Markov Decision Process

摘要

著录项

相似文献

相关主题

期刊订阅