...
首页> 外文期刊>Physical Review X >Complex Distributions Emerging in Filtering and Compression
【24h】

Complex Distributions Emerging in Filtering and Compression

机译:复杂的分布在过滤和压缩中出现

获取原文
           

摘要

In filtering, each output is produced by a certain number of different inputs. We explore the statistics of this degeneracy in an explicitly treatable filtering problem in which filtering performs the maximal compression of relevant information contained in inputs (arrays of zeros and ones). This problem serves as a reference model for the statistics of filtering and related sampling problems. The filter patterns in this problem conveniently allow a microscopic, combinatorial consideration. This allows us to find the statistics of outputs, namely the exact distribution of output degeneracies, for arbitrary input sizes. We observe that the resulting degeneracy distribution of outputs decays as e?clogαd with degeneracy d, where c is a constant and exponent α 1, i.e., faster than a power law. Importantly, its form essentially depends on the size of the input dataset, appearing to be closer to a power-law dependence for small dataset sizes than for large ones. We demonstrate that for sufficiently small input dataset sizes typical for empirical studies, this distribution could be easily perceived as a power law. We extend our results to filter patterns of various sizes and demonstrate that the shortest filter pattern provides the maximum informative representations of the inputs.
机译:在过滤时,每个输出由一定数量的不同输入产生。在明确的可治疗过滤问题中,我们探讨了这种退化的统计数据,其中过滤执行输入中包含的相关信息的最大压缩(零和1阵列)。此问题用作过滤和相关采样问题的统计的参考模型。在该问题中的滤波器图案方便地允许微观的组合考虑。这使我们能够找到输出的统计信息,即输出退化的精确分布,用于任意输入大小。我们观察到输出的结果退化分布作为E?CLOGαd与退化性D,其中C是常数和指数α> 1,即比电力法更快。重要的是,其形式基本上取决于输入数据集的大小,看起来更接近幂律对小型数据集大小的依赖性而不是大型数据集大小。我们证明,对于对实证研究的典型的足够小的输入数据集大小,可以很容易被认为是权力法的这种分布。我们将结果扩展到过滤各种尺寸的模式,并证明最短的过滤器模式提供了输入的最大信息表示。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号