Context-Aware Deep Model Compression for Edge Cloud Computing

机译：关于边缘云计算的背景感知深模模型压缩

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While deep neural networks (DNNs) have led to a paradigm shift, its exorbitant computational requirement has always been a roadblock in its deployment to the edge, such as wearable devices and smartphones. Hence a hybrid edge-cloud computational framework is proposed to transfer part of the computation to the cloud, by naively partitioning the DNN operations under the constant network condition assumption. However, real-world network state varies greatly depending on the context, and DNN partitioning only has limited strategy space. In this paper, we explore the structural flexibility of DNN to fit the edge model to varying network contexts and different deployment platforms. Specifically, we designed a reinforcement learning-based decision engine to search for model transformation strategies in response to a combined objective of model accuracy and computation latency. The engine generates a context-aware model tree so that the DNN can decide the model branch to switch to at runtime. By the emulation and field experimental results, our approach enjoys a 30% − 50% latency reduction while retaining the model accuracy.

机译：虽然深度神经网络（DNN）导致范式转变，但其过高的计算要求一直是其部署到边缘的障碍，例如可穿戴设备和智能手机。因此，提出了一种混合边缘云计算框架来将部分计算到云，通过恒定网络条件假设下的DNN操作天然划分DNN操作。但是，实际网络状态根据上下文而变化大大变化，DNN分区仅具有有限的策略空间。在本文中，我们探讨了DNN将边缘模型与不同网络上下文和不同部署平台的结构灵活性。具体而言，我们设计了一种基于加强学习的决策引擎，以响应于模型精度和计算延迟的组合目标来搜索模型转换策略。该引擎生成上下文感知模型树，使DNN可以决定在运行时切换到模型分支。通过仿真和现场实验结果，我们的方法在保持模型精度的同时享有30％ - 50％的延迟减少。

著录项

来源
《IEEE International Conference on Distributed Computing Systems》|2020年|787-797|共11页
会议地点
作者
Lingdong Wang; Liyao Xiang; Jiayu Xu; Jiaju Chen; Xing Zhao; Dixi Yao; Xinbing Wang; Baochun Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cloud computing; Runtime; Computational modeling; Wearable computers; Computer architecture; Engines; Context modeling;

机译：云计算;运行时;计算建模;可穿戴电脑;计算机架构;引擎;上下文建模;

相似文献

外文文献
中文文献
专利

1. Context-Aware Cloud Service Selection Model for Mobile Cloud Computing Environments [J] . Wu Xu Wireless communications & mobile computing . 2018,第1期

机译：移动云计算环境的上下文感知云服务选择模型
2. Edge-Cloud Computing for Internet of Things Data Analytics: Embedding Intelligence in the Edge With Deep Learning [J] . Ghosh Ananda Mohon, Grolinger Katarina IEEE transactions on industrial informatics . 2021,第3期

机译：用于物联网的边缘云计算数据分析：深入学习的边缘嵌入智能
3. A context-aware and self-adaptive offloading decision support model for mobile cloud computing system [J] . Flávio Akira Nakahara, Delano Medeiros Beder Journal of ambient intelligence and humanized computing . 2018,第5期

机译：用于移动云计算系统的上下文感知和自适应卸载决策支持模型
4. Scission: Performance-driven and Context-aware Cloud-Edge Distribution of Deep Neural Networks [C] . Luke Lockhart, Paul Harvey, Pierre Imai, IEEE/ACM International Conference on Utility and Cloud Computing . 2020

机译：侦察：深神经网络的性能驱动和背景感知云边缘分布
5. Context-Aware, Sustainable Mobile Cloud Computing for Pervasive Health Monitoring [D] . Wang, Xiaoliang. 2018

机译：上下文感知，可持续的移动云计算，可进行全面的健康监控
6. DeepBrain: Experimental Evaluation of Cloud-Based Computation Offloading and Edge Computing in the Internet-of-Drones for Deep Learning Applications [O] . Anis Koubaa, Adel Ammar, Mahmoud Alahdab, 2020

机译：DeepBrain：基于云的计算卸载和边缘计算的实验评估用于深入学习应用程序
7. Edge-Cloud Computing for IoT Data Analytics: Embedding Intelligence in the Edge with Deep Learning [O] . Ananda Ghosh, Katarina Grolinger 2020

机译：IOT数据分析的边缘云计算：深入学习边缘的嵌入智能

Context-Aware Deep Model Compression for Edge Cloud Computing

摘要

著录项

相似文献

相关主题

期刊订阅