Sequential sampling procedures for query size estimation

机译：用于查询大小估计的顺序采样过程

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We provide a procedure, based on random sampling, for estimation of the size of a query result. The procedure is sequential in that sampling terminates after a random number of steps according to a stopping rule that depends upon the observations obtained so far. Enough observations are obtained so that, with a pre-specified probability, the estimate differs from the true size of the query result by no more than a prespecified amount. Unlike previous sequential estimation procedures for queries, our procedure is asymptotically efficient and requires no ad hoc pilot sample or a a priori assumptions about data characteristics. In addition to establishing the asymptotic properties of the estimation procedure, we provide techniques for reducing undercoverage at small sample sizes and show that the sampling cost of the procedure can be reduced through stratified sampling techniques.

机译：

我们提供了一个基于随机采样的过程，用于估计查询结果的大小。该过程是顺序的，因为根据停止规则（取决于到目前为止获得的观察结果），在随机数的步骤后采样将终止。获得足够的观察结果，以便以预先指定的概率，估计值与查询结果的真实大小之差不超过预先指定的数量。与先前的查询顺序估计程序不同，我们的程序在渐近效率上是有效的，不需要任何即席导频样本或有关数据特征的先验假设。除了建立估计程序的渐近性质外，我们还提供了减少小样本量下的隐蔽性的技术，并表明可以通过分层采样技术来减少该程序的采样成本。展开▼

著录项

来源
《ACM SIGMOD international conference on Management of data》|1992年|P.341-350|共10页
会议地点
作者
Peter J. Haas; Arun N. Swami;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274.23;
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive extensions of a two-stage group sequential procedure for testing primary and secondary endpoints (II): Sample size re-estimation [J] . TamhaneA.C., WuY., MehtaC.R. Statistics in medicine . 2012,第19期

机译：用于测试主要和次要终点（II）的两阶段小组顺序过程的自适应扩展：样本量重新估计
2. Optimal sample sizes for precise interval estimation of Welch’s procedure under various allocation and cost considerations [J] . Gwowen Shieh, Show-Li Jan Behavior Research Methods . 2012,第1期

机译：在各种分配和成本考虑下，用于Welch手术的精确间隔估计的最佳样本量
3. Query Size Estimation for Joins Using Systematic Sampling [J] . A.H.H. NGU, B. HARANGSRI, J. SHEPHERD Distributed and Parallel Databases . 2004,第3期

机译：使用系统抽样的联接查询大小估计
4. Error Estimation Procedure for Large Dimensionality Data with Small Sample Sizes [C] . Arnold Williams, Gregory Wagner Conference on automatic target recognition . 2009

机译：小样本量大维数据的误差估计程序
5. A Bayesian decision theoretic approach to fixed sample size determination and blinded sample size re-estimation for hypothesis testing. [D] . Banton, Dwaine Stephen. 2016

机译：用于假设检验的固定样本大小确定和盲样本大小重新估计的贝叶斯决策理论方法。
6. A review and re-interpretation of a group-sequential approach to sample size re-estimation in two-stage trials [O] . J Bowden, A Mander -1

机译：在两阶段试验中对样本量重新估计的群体顺序方法的回顾和重新解释
7. Bounded Risk Estimation of the Gamma Scale Parameter in a Purely Sequential Sampling Procedure [O] . Eisa Mahmoudi, Ghahraman Roughani, Ashkan Khalifeh 2019

机译：纯粹顺序采样过程中伽马比例参数的有界风险估计
8. Sample-Size Optimal Bayesian Procedure for Sequential Pharmaceutical Trials [R] . Cressie, N., Biele, J. 1992

机译：序贯药物试验的样本量最优贝叶斯过程

Sequential sampling procedures for query size estimation

摘要

著录项

相似文献

相关主题

期刊订阅