A scalable nonparametric specification testing for massive data

Yanyan Zhao; Changliang Zou; Zhaojun Wang

首页> 外文期刊>Quality Control and Applied Statistics >A scalable nonparametric specification testing for massive data

【24h】

A scalable nonparametric specification testing for massive data

机译：A scalable nonparametric specification testing for massive data

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Regression problems with too many observations are now commonplace. Such voluminous data require modeling approaches that are different from those used in classical analysis. Though some newly developed approaches are feasible theoretically, they lack computational ease. This article considers the problem of verifying a pre-specified parametric model and massive data sets employing scalable nonparametric tests. Assume a set of independent observations coming from a population in which the unknown regression function is assumed to be smooth. To justify the use a parametric model, a specification test on the functional form of the regression is needed. Given a parametric family of known real functions g(x,θ) the null and alternate hypotheses are The problem is to assess the validity of a given model for an observed data. A massive data source will usually not be unique and it is necessary to verify the correctness of parametric models for data sets from different sources. If the hypotheses are accepted, then some modeling method can be found by an aggregation mechanism. If the volume of the data involved is unimaginably large, then computation may be a problem even with current high-speed parallel processing. The article proposes simple strategies for construction test statistics that can avoid detailed computations for model checking. This is made possible by partitioning the massive data set into K subsets with equal sample size while K varies with the total data size N. A test statistic is created based on each subset and the results are aggregated by taking an average. This makes the computation far easier.

著录项

来源
《Quality Control and Applied Statistics》 |2021年第2期|67-68|共2页
作者
Yanyan Zhao; Changliang Zou; Zhaojun Wang;
展开▼
作者单位

Institute of Statistics and LPMC, Nankai University, China;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类概率论、数理统计的应用;
关键词

A scalable nonparametric specification testing for massive data

摘要

著录项

相关主题

期刊订阅