首页> 外文期刊>Services Computing, IEEE Transactions on >MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework
【24h】

MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework

机译:MMDP:基于移动IOT的多模态强化学习服务框架

获取原文
获取原文并翻译 | 示例
           

摘要

With the development of GPS technology, a new Mobile Internet of Things (M-IoT) is emerging, which perceives the city's rhythm and pulse day and night to collect a large scale of city data. It is urgent to innovate M-IoT service system for these large-scale and heterogeneous data. To cope with the problem, this article proposes a Mobile-IoT based multi-modal reinforcement learning service framework from data perspective, which has three highlights, i) Developing Action-aware High-order Transition Tensor (AHTT) to fuse the heterogeneous data from M-IoTs in a unified form. ii) Developing Multi-modal Markov Decision Process (MMDP) to model the multi-modal reinforcement learning for M-IoT service framework. iii) Developing Tensor Policy Iteration algorithm (TPIA) to solve the optimal tensor policy. Due to using tensor keeps the multi-modal relations of the context information in the process of solving the optimal policy. The proposed M-IoT service system provides more personalized service for taxi drivers. The experiment results shows that most taxi drivers earn more revenue according to the tensor policy.
机译:随着GPS技术的发展,新的移动物联网(M-IOT)正在出现,这让城市的节奏和脉搏日夜感知到收集大规模的城市数据。迫切需要创新这些大规模和异构数据的M-IOT服务系统。为了应对问题,本文提出了一种基于数据透视图的移动IOT的多模态强化学习服务框架,它具有三个亮点,i)开发动作感知的高阶转换卷(AHTT)来熔断异构数据M-IOTS以统一的形式。 ii)开发多模态马尔可夫决策过程(MMDP)以模拟M-IOT服务框架的多模态强化学习。 iii)开发张统称迭代算法(TPIA)以解决最佳张解策略。由于使用张量,在解决最佳政策的过程中,保持上下文信息的多模态关系。提议的M-IOT服务系统为出租车驱动程序提供了更个性化的服务。实验结果表明,大多数出租车司机根据张解人的政策赚取更多收入。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号