首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >FlexSketch: Estimation of Probability Density for Stationary and Non-Stationary Data Streams
【2h】

FlexSketch: Estimation of Probability Density for Stationary and Non-Stationary Data Streams

机译:FlexSketch:估计静止和非静止数据流的概率密度

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Efficient and accurate estimation of the probability distribution of a data stream is an important problem in many sensor systems. It is especially challenging when the data stream is non-stationary, i.e., its probability distribution changes over time. Statistical models for non-stationary data streams demand agile adaptation for concept drift while tolerating temporal fluctuations. To this end, a statistical model needs to forget old data samples and to detect concept drift swiftly. In this paper, we propose FlexSketch, an online probability density estimation algorithm for data streams. Our algorithm uses an ensemble of histograms, each of which represents a different length of data history. FlexSketch updates each histogram for a new data sample and generates probability distribution by combining the ensemble of histograms while monitoring discrepancy between recent data and existing models periodically. When it detects concept drift, a new histogram is added to the ensemble and the oldest histogram is removed. This allows us to estimate the probability density function with high update speed and high accuracy using only limited memory. Experimental results demonstrate that our algorithm shows improved speed and accuracy compared to existing methods for both stationary and non-stationary data streams.
机译:高效和准确地估计数据流的概率分布是许多传感器系统中的重要问题。当数据流是非静止的时,即,其概率分布随时间变化而言特别具有挑战性。非静止数据流的统计模型需要敏捷适应概念漂移,同时容忍时间波动。为此,统计模型需要忘记旧数据样本并迅速检测概念漂移。在本文中,我们提出FlexSketch,一种用于数据流的在线概率密度估计算法。我们的算法使用直方图的集合,每个算法表示不同长度的数据历史。 FlexSketch更新每个直方图,用于新数据样本,并通过组合直方图的集合来生成概率分布,同时定期监视近期数据和现有模型之间的差异。当它检测到概念漂移时,将添加一个新的直方图,并将最旧的直方图被删除。这使我们能够估计具有高更新速度和高精度的概率密度函数,仅使用有限的存储器。实验结果表明,与用于静止和非静止数据流的现有方法相比,我们的算法显示出改善的速度和准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号