【24h】

Estimating Entropy and Entropy Norm on Data Streams

机译:估计数据流上的熵和熵范数

获取原文
获取原文并翻译 | 示例

摘要

We consider the problem of computing information theoretic functions such as entropy on a data stream, using sublinear space. Our first result deals with a measure we call the "entropy norm" of an input stream: it is closely related to entropy but is structurally similar to the well-studied notion of frequency moments. We give a polyloga-rithmic space one-pass algorithm for estimating this norm under certain conditions on the input stream. We also prove a lower bound that rules out such an algorithm if these conditions do not hold. Our second group of results are for estimating the empirical entropy of an input stream. We first present a sublinear space one-pass algorithm for this problem. For a stream of m items and a given real parameter α, our algorithm uses space O(m~(2α)) and provides an approximation of 1/α in the worst case and (1 + ε) in "most" cases. We then present a two-pass polylogarithmic space (1 + ε)-approximation algorithm. All our algorithms are quite simple.
机译:我们考虑使用亚线性空间计算信息理论功能(例如数据流上的熵)的问题。我们的第一个结果涉及一种称为输入流的“熵范数”的量度:它与熵密切相关,但在结构上类似于经过精心研究的频率矩概念。我们给出了一个多元对数空间单次通过算法,用于在输入流的某些条件下估计该范数。如果这些条件不成立,我们还将证明可以排除这种算法的下限。我们的第二组结果用于估计输入流的经验熵。我们首先针对此问题提出一种亚线性空间单次通过算法。对于m个项的流和给定的实参α,我们的算法使用空间O(m〜(2α)),在最坏的情况下提供近似值1 /α,在“最”的情况下提供近似值(1 +ε)。然后,我们提出了两遍多对数空间(1 +ε)-逼近算法。我们所有的算法都非常简单。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号