...
首页> 外文期刊>Quality Control and Applied Statistics >A scalable nonparametric specification testing for massive data
【24h】

A scalable nonparametric specification testing for massive data

机译:A scalable nonparametric specification testing for massive data

获取原文
获取原文并翻译 | 示例
           

摘要

Regression problems with too many observations are now commonplace. Such voluminous data require modeling approaches that are different from those used in classical analysis. Though some newly developed approaches are feasible theoretically, they lack computational ease. This article considers the problem of verifying a pre-specified parametric model and massive data sets employing scalable nonparametric tests. Assume a set of independent observations coming from a population in which the unknown regression function is assumed to be smooth. To justify the use a parametric model, a specification test on the functional form of the regression is needed. Given a parametric family of known real functions g(x,θ) the null and alternate hypotheses are The problem is to assess the validity of a given model for an observed data. A massive data source will usually not be unique and it is necessary to verify the correctness of parametric models for data sets from different sources. If the hypotheses are accepted, then some modeling method can be found by an aggregation mechanism. If the volume of the data involved is unimaginably large, then computation may be a problem even with current high-speed parallel processing. The article proposes simple strategies for construction test statistics that can avoid detailed computations for model checking. This is made possible by partitioning the massive data set into K subsets with equal sample size while K varies with the total data size N. A test statistic is created based on each subset and the results are aggregated by taking an average. This makes the computation far easier.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号