Tracking the State of Large Dynamic Networks via Reinforcement Learning

机译：通过强化学习跟踪大型动态网络的状态

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A Network Inventory Manager (NIM) is a software solution that scans, processes and records data about all devices in a network. We consider the problem faced by a NIM that can send out a limited number of probes to track changes in a large, dynamic network. The underlying change rate for the Network Elements (NEs) is unknown and may be highly non-uniform. The NIM should concentrate its probe budget on the NEs that change most frequently with the ultimate goal of minimizing the weighted Fraction of Stale Time (wFOST) of the inventory. However, the NIM cannot discover the change rate of a NE unless the NE is repeatedly probed.We develop and analyze two algorithms based on Reinforcement Learning to solve this exploration-vs-exploitation problem. The first is motivated by the Thompson Sampling method and the second is derived from the Robbins-Monro stochastic learning paradigm. We show that for a fixed probe budget, both of these algorithms produce a potentially unbounded improvement in terms of wFOST compared to the baseline algorithm that divides the probe budget equally between all NEs. Our simulations of practical scenarios show optimal performance in minimizing wFOST while discovering the change rate of the NEs.

机译：网络清单管理器（NIM）是一种软件解决方案，可以扫描，处理和记录有关网络中所有设备的数据。我们考虑NIM所面临的问题，该NIM可以发出有限数量的探测来跟踪大型动态网络中的变化。网络元素（NE）的基础变化率未知，并且可能高度不一致。 NIM应该将探测预算集中在最频繁更改的NE上，最终目标是最大程度地减少清单的加权过时时间（wFOST）。但是，除非反复探测网元，否则NIM才能发现网元的变化率。我们开发和分析了两种基于强化学习的算法来解决此探索与开发的问题。第一种是由汤普森抽样方法驱动的，第二种是从Robbins-Monro随机学习范例中得出的。我们显示出，对于固定的探测预算，与基线算法相比，这两种算法在wFOST方面都可能产生无穷的改进，而基线算法则将探测预算平均分配给所有网元。我们对实际场景的仿真表明，在发现网元的变化率的同时，可以最大限度地降低wFOST的最佳性能。

著录项

来源
《IEEE Conference on Computer Communications》|2020年|416-425|共10页
会议地点
作者
Matthew Andrews; Sem Borst; Jeongran Lee; Enrique Martin-Lopez; Karina Palyutina;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Probes; Mathematical model; Measurement; Aggregates; Wireless fidelity; Random variables; Learning (artificial intelligence);

机译：探针;数学模型;测量;集合体;无线保真度;随机变量;学习（人工智能）;

相似文献

外文文献
中文文献
专利

1. Reinforcement learning-based asymptotic cooperative tracking of a class multi-agent dynamic systems using neural networks [J] . Cui Lili, Wang Xiaowei, Zhang Yong Neurocomputing . 2016,第JANa1期

机译：基于神经网络的一类多智能体动态系统的基于强化学习的渐近协作跟踪
2. Neural networks-based optimal tracking control for nonzero-sum games of multi-player continuous-time nonlinear systems via reinforcement learning [J] . Zhao Jingang Neurocomputing . 2020,第Octa28期

机译：基于神经网络的基于神经网络，通过加固学习的非播放连续时间非线性系统非零和游戏的最佳跟踪控制
3. Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking [J] . Dong Xingping, Shen Jianbing, Wang Wenguan, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2021,第5期

机译：深度加固学习追踪动态近双参数优化
4. Trajectory Tracking of Underactuated Sea Vessels With Uncertain Dynamics: An Integral Reinforcement Learning Approach [C] . Mohammed Abouheaf, Wail Gueaieb, Md Suruz Miah, IEEE International Conference on Systems, Man, and Cybernetics . 2020

机译：不确定动态的废除型海船的轨迹跟踪：一种整体加强学习方法
5. Linear Quadratic Tracking Based on Reinforcement Learning and Motor Speed Control Without System Dynamics [D] . Tang, Shuo. 2020

机译：基于钢筋学习的线性二次跟踪，电机速度控制，无系统动态
6. A Novel Dynamic Spectrum Access Framework Based on Reinforcement Learning for Cognitive Radio Sensor Networks [O] . Yun Lin, Chao Wang, Jiaxing Wang, 2016

机译：基于增强学习的认知无线电传感器网络动态频谱接入框架
7. Vehicle Tracking in Wireless Sensor Networks via Deep Reinforcement Learning [O] . Jun Li, Zhichao Xing, Weibin Zhang, 2020

机译：通过深度加强学习在无线传感器网络中跟踪车辆跟踪

Tracking the State of Large Dynamic Networks via Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅