Ensemble Estimation of Generalized Mutual Information With Applications to Genomics

Moon Kevin R.; Sricharan Kumar; Hero Alfred O. III

首页> 外文期刊>IEEE Transactions on Information Theory >Ensemble Estimation of Generalized Mutual Information With Applications to Genomics

【24h】

Ensemble Estimation of Generalized Mutual Information With Applications to Genomics

机译：与基因组学应用的集合估算广义互信息

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mutual information is a measure of the dependence between random variables that has been used successfully in myriad applications in many fields. Generalized mutual information measures that go beyond classical Shannon mutual information have also received much interest in these applications. We derive the mean squared error convergence rates of kernel density-based plug-in estimators of general mutual information measures between two multidimensional random variables X and Y for two cases: 1) X and Y are continuous; 2) X and Y may have a mixture of discrete and continuous components. Using the derived rates, we propose an ensemble estimator of these information measures called GENIE by taking a weighted sum of the plug-in estimators with varied bandwidths. The resulting ensemble estimators achieve the 1/N parametric mean squared error convergence rate when the conditional densities of the continuous variables are sufficiently smooth. To the best of our knowledge, this is the first nonparametric mutual information estimator known to achieve the parametric convergence rate for the mixture case, which frequently arises in applications (e.g. variable selection in classification). The estimator is simple to implement and it uses the solution to an offline convex optimization problem and simple plug-in estimators. A central limit theorem is also derived for the ensemble estimators and minimax rates are derived for the continuous case. We demonstrate the ensemble estimator for the mixed case on simulated data and apply the proposed estimator to analyze gene relationships in single cell data.

机译：相互信息是在许多字段中在MYRIAD应用程序中成功使用的随机变量之间的依赖的衡量标准。超越古典香农互联信息的广义互信息措施也在这些应用中获得了很多兴趣。我们推导出常用互动变量X和Y之间的一般互动仪表的常用互动估计的均方方的误差收敛率：1）x和y是连续的; 2）X和Y可具有离散和连续组分的混合物。使用衍生率，我们提出了一种由具有各种带宽的插件估计量的加权之和来提出这些信息措施的集合估计。当连续变量的条件密度足够平滑时，所产生的集合估计器实现1 / N的参数均方误差会聚速率。据我们所知，这是已知的第一个非参数互信息估计，以实现混合案例的参数收敛速率，其经常出现在应用中（例如，分类中的变量选择）。估算器易于实现，它使用解决方案到脱机凸优化问题和简单的插件估计器。还导出了集合估计的中央限位定理，并且导出了连续情况的Minimax速率。我们展示了用于模拟数据的混合案例的集合估计，并应用所提出的估计器来分析单个细胞数据中的基因关系。

著录项

来源
《IEEE Transactions on Information Theory》 |2021年第9期|5963-5996|共34页
作者
Moon Kevin R.; Sricharan Kumar; Hero Alfred O. III;
展开▼
作者单位

Utah State Univ Dept Math & Stat Logan UT 84322 USA;

Intuit Inc Mountain View CA 94043 USA;

Univ Michigan Elect Engn & Comp Sci Dept Ann Arbor MI 48109 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Convergence; Estimation; Random variables; Feature extraction; Entropy; Density measurement; Kernel; Mutual information; nonparametric estimation; central limit theorem; single cell data; feature selection; minimax rate;

机译：收敛;估计;随机变量;特征提取;熵;密度测量;核;互相信息;非参数估计;中央限位定理;单个细胞数据;特征选择;特征选择;特征选择;特征选择;特征选择;特征选择;特征选择;特征选择;特征选择;特征选择;特征选择;特征选择;特征选择;最小值;

相似文献

外文文献
中文文献
专利

1. ON THE ESTIMATION OF GENERALIZED LEBESGUE CONSTANT AND MODULUS OF GENERALIZED SINGULAR QUADRATURE FORMULAS AND ITS APPLICATION [J] . 数学物理学报（英文版） . 2006,第004期
2. New bounds on the mutual information for discrete constellations and application to wireless channel estimation [J] . A. Taufiq Asyhari 数字通信与网络（英文） . 2020,第004期
3. Detecting dynamical interdependence and generalized synchrony through mutual prediction in a neural ensemble [J] . Steven J. Schiff, Paul So, Taeum Chang, Physical review, E. Statistical physics, plasmas, fluids, and related interdisciplinary topics . 1996,第6期

机译：在神经系中通过相互预测来检测动态相互依赖和广义同步
4. Mutual Information and Parameter Estimation in the Generalized Inverse Gaussian Diffusion Model of Cortical Neurons [J] . Mustafa Sungkar, Toby Berger, William B Levy IEEE Transactions on Molecular, Biological and Multi-Scale Communications . 2016,第2期

机译：皮质神经元广义逆高斯扩散模型中的互信息和参数估计
5. Mutual information-based CT-MR brain image registration using generalized partial volume joint histogram estimation [J] . Hua-mei Chen, Varshney P.K. IEEE Transactions on Medical Imaging . 2003,第9期

机译：基于广义局部体积联合直方图估计的基于互信息的CT-MR脑图像配准
6. Ensemble estimation of mutual information [C] . Kevin R. Moon, Kumar Sricharan, Alfred O. Hero IEEE International Symposium on Information Theory . 2017

机译：相互信息的综合估计
7. Compositional Applications of Generalized Lead Structures, SP-Cycles, and Syntax Graphs and Hardscrabble for Wind Ensemble. [D] . Rice, Steven. 2014

机译：广义引线结构，SP循环以及语法图和Hardscrabble的合成应用。
8. A New Method of Probability Density Estimation with Application to Mutual Information Based Image Registration [O] . Ajit Rajwade, Arunava Banerjee, Anand Rangarajan -1

机译：一种概率密度估计的新方法及其在基于互信息的图像配准中的应用
9. Generalized Wavelet Thresholding: Estimation and Hypothesis Testing with Applications to Array Comparative Genomic Hybridization [O] . Schifano Elizabeth Danielle 2007

机译：广义小波阈值：估计和假设检验及其在阵列比较基因组杂交中的应用
10. Ensemble Single Column Modeling in the Tropics: Derivation of Observed Forcing Data Sets, Estimation of Observation Uncertainty and Application to Parametrization Improvements. [R] . Jakob, C., May, P., Seed, A., 2012

机译：热带地区的集合单柱建模：观测强迫数据集的推导，观测不确定度的估计及其在参数化改进中的应用。

Ensemble Estimation of Generalized Mutual Information With Applications to Genomics

摘要

著录项

相似文献

相关主题

期刊订阅