MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework

Wang Puming; Yang Laurence T.; Li Jintao; Li Xue; Zhou Xiaokang

首页> 外文期刊>Services Computing, IEEE Transactions on >MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework

【24h】

MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework

机译：MMDP：基于移动IOT的多模态强化学习服务框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the development of GPS technology, a new Mobile Internet of Things (M-IoT) is emerging, which perceives the city's rhythm and pulse day and night to collect a large scale of city data. It is urgent to innovate M-IoT service system for these large-scale and heterogeneous data. To cope with the problem, this article proposes a Mobile-IoT based multi-modal reinforcement learning service framework from data perspective, which has three highlights, i) Developing Action-aware High-order Transition Tensor (AHTT) to fuse the heterogeneous data from M-IoTs in a unified form. ii) Developing Multi-modal Markov Decision Process (MMDP) to model the multi-modal reinforcement learning for M-IoT service framework. iii) Developing Tensor Policy Iteration algorithm (TPIA) to solve the optimal tensor policy. Due to using tensor keeps the multi-modal relations of the context information in the process of solving the optimal policy. The proposed M-IoT service system provides more personalized service for taxi drivers. The experiment results shows that most taxi drivers earn more revenue according to the tensor policy.

机译：随着GPS技术的发展，新的移动物联网（M-IOT）正在出现，这让城市的节奏和脉搏日夜感知到收集大规模的城市数据。迫切需要创新这些大规模和异构数据的M-IOT服务系统。为了应对问题，本文提出了一种基于数据透视图的移动IOT的多模态强化学习服务框架，它具有三个亮点，i）开发动作感知的高阶转换卷（AHTT）来熔断异构数据M-IOTS以统一的形式。 ii）开发多模态马尔可夫决策过程（MMDP）以模拟M-IOT服务框架的多模态强化学习。 iii）开发张统称迭代算法（TPIA）以解决最佳张解策略。由于使用张量，在解决最佳政策的过程中，保持上下文信息的多模态关系。提议的M-IOT服务系统为出租车驱动程序提供了更个性化的服务。实验结果表明，大多数出租车司机根据张解人的政策赚取更多收入。

著录项

来源
《Services Computing, IEEE Transactions on》 |2020年第4期|675-684|共10页
作者
Wang Puming; Yang Laurence T.; Li Jintao; Li Xue; Zhou Xiaokang;
展开▼
作者单位

Huazhong Univ Sci & Technol Sch Comp Sci & Technol Wuhan 430074 Peoples R China;

Huazhong Univ Sci & Technol Sch Comp Sci & Technol Wuhan 430074 Peoples R China|St Francis Xavier Univ Dept Comp Sci Antigonish NS B2G 2W5 Canada;

Meituan Dianping Co Beijing 100010 Peoples R China;

Henan Inst Technol Sch Elect Informat Engn Xinxiang 453003 Henan Peoples R China;

Shiga Univ Fac Data Sci Hikone Shiga 5228522 Japan|RIKEN Ctr Adv Intelligence Project Tokyo Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Tensors; Internet of Things; Markov processes; Reinforcement learning; Public transportation; Sensor systems; Multi-modal reinforcement learning; mobile Internet of Things; service framework; social sensors; multi-modal Markov decision process; action-aware high-order transition tensor; tensor policy iteration algorithm; optimal tensor policy;

机译：张量;东西互联网;马尔可夫进程;加强学习;公共交通;传感器系统;多模态强化学习;移动互联网;服务框架;社会传感器;多模态马尔可夫决策过程;动作感知的高阶转换张量;张解人策略迭代算法;最优张解政策;

相似文献

外文文献
中文文献
专利

1. The "Proactive" Model of Learning: Integrative Framework for Model-Free and Model-Based Reinforcement Learning Utilizing the Associative Learning-Based Proactive Brain Concept [J] . Zsuga Judit, Biro Klara, Papp Csaba, Behavioral neuroscience . 2016,第1期

机译：“主动”学习模型：利用基于联合学习的主动脑概念进行无模型和基于模型的强化学习的集成框架
2. Application of Multi-Modal Imaging Mediated by Iron Carbon Nanoparticles Based on Reinforcement Learning in the Diagnosis of Breast Nodules [J] . Zeng Yanni, Zhang Jiuxia, Meng Jun Journal of nanoscience and nanotechnology . 2021,第2期

机译：铁碳纳米粒子介导的多模态成像在乳腺结节诊断中的应用
3. Reinforcement learning-based dynamic bandwidth provisioning for quality of service in differentiated services networks [J] . Chen-Khong Tham, Timothy Chee-Kin Hui Computer Communications . 2005,第15期

机译：基于增强学习的动态带宽配置，用于差异化服务网络中的服务质量
4. Deep-NFVOrch: Deep Reinforcement Learning based Service Framework for Adaptive vNF Service Chaining in IDC-EONs [C] . Baojia Li, Wei Lu, Zuqing Zhu Optical Fiber Communications Conference and Exhibition . 2019

机译：Deep-NFVOrch：IDC-EON中用于自适应vNF服务链接的基于深度强化学习的服务框架
5. A Reinforcement Learning-based Framework for Resource Allocation and Task Assignment in Mobile Edge Computing Networks [D] . Hsieh, Li-Tse. 2021

机译：基于加强学习的移动边缘计算网络中的资源分配和任务分配框架
6. A Novel Dynamic Spectrum Access Framework Based on Reinforcement Learning for Cognitive Radio Sensor Networks [O] . Yun Lin, Chao Wang, Jiaxing Wang, 2016

机译：基于增强学习的认知无线电传感器网络动态频谱接入框架
7. Reinforcement Learning Based Novel Adaptive Learning Framework for Smart Grid Prediction [O] . Tian Li, Yongqian Li, Baogang Li 2017

机译：基于加强学习的智能电网预测的新型自适应学习框架

MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework

摘要

著录项

相似文献

相关主题

期刊订阅