首页> 美国卫生研究院文献>other >Estimation of Distribution Overlap of Urn Models
【2h】

Estimation of Distribution Overlap of Urn Models

机译:瓮模型分布重叠的估计

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A classical problem in statistics is estimating the expected coverage of a sample, which has had applications in gene expression, microbial ecology, optimization, and even numismatics. Here we consider a related extension of this problem to random samples of two discrete distributions. Specifically, we estimate what we call the dissimilarity probability of a sample, i.e., the probability of a draw from one distribution not being observed in draws from another distribution. We show our estimator of dissimilarity to be a -statistic and a uniformly minimum variance unbiased estimator of dissimilarity over the largest appropriate range of . Furthermore, despite the non-Markovian nature of our estimator when applied sequentially over , we show it converges uniformly in probability to the dissimilarity parameter, and we present criteria when it is approximately normally distributed and admits a consistent jackknife estimator of its variance. As proof of concept, we analyze V35 16S rRNA data to discern between various microbial environments. Other potential applications concern any situation where dissimilarity of two discrete distributions may be of interest. For instance, in SELEX experiments, each urn could represent a random RNA pool and each draw a possible solution to a particular binding site problem over that pool. The dissimilarity of these pools is then related to the probability of finding binding site solutions in one pool that are absent in the other.
机译:统计中的经典问题是估计样本的预期覆盖率,该样本已应用于基因表达,微生物生态学,优化甚至钱币学领域。在这里,我们考虑将此问题扩展到两个离散分布的随机样本。具体来说,我们估计了我们所说的样本的相异概率,即在一个分布中未观察到另一分布中的概率。我们证明了我们的相异性估计量是a统计量,并且在的最大适当范围内是一个一致的最小方差无偏均匀度估计量。此外,尽管我们的估计器在顺序应用时具有非马尔可夫性质,但我们证明它在概率上均匀地收敛于相异性参数,并且当它近似呈正态分布并接受其方差的一致折弯估计器时,我们给出了标准。作为概念验证,我们分析了V35 16S rRNA数据以区分各种微生物环境。其他潜在的应用涉及可能需要关注两个离散分布的不相似性的任何情况。例如,在SELEX实验中,每个可能代表一个随机的RNA池,并且每个池都为该池上的特定结合位点问题画出了可能的解决方案。这些池的不相似性则与在一个池中找到结合位点溶液而另一个池中找不到结合位点溶液的可能性有关。

著录项

  • 期刊名称 other
  • 作者单位
  • 年(卷),期 -1(7),11
  • 年度 -1
  • 页码 e42368
  • 总页数 16
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号