MIMCA: multiple imputation for categorical variables with multiple correspondence analysis

Audigier Vincent; Husson Francois; Josse Julie

首页> 外文期刊>Statistics and computing >MIMCA: multiple imputation for categorical variables with multiple correspondence analysis

【24h】

MIMCA: multiple imputation for categorical variables with multiple correspondence analysis

机译：MIMCA：具有多重对应分析的类别变量的多重插补

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a multiple imputation method to deal with incomplete categorical data. This method imputes the missing entries using the principal component method dedicated to categorical data: multiple correspondence analysis (MCA). The uncertainty concerning the parameters of the imputation model is reflected using a non-parametric bootstrap. Multiple imputation using MCA (MIMCA) requires estimating a small number of parameters due to the dimensionality reduction property of MCA. It allows the user to impute a large range of data sets. In particular, a high number of categories per variable, a high number of variables or a small number of individuals are not an issue for MIMCA. Through a simulation study based on real data sets, the method is assessed and compared to the reference methods (multiple imputation using the loglinear model, multiple imputation by logistic regressions) as well to the latest works on the topic (multiple imputation by random forests or by the Dirichlet process mixture of products of multinomial distributions model). The proposed method provides a good point estimate of the parameters of the analysis model considered, such as the coefficients of a main effects logistic regression model, and a reliable estimate of the variability of the estimators. In addition, MIMCA has the great advantage that it is substantially less time consuming on data sets of high dimensions than the other multiple imputation methods.

机译：我们提出了一种多重插补方法来处理不完整的分类数据。此方法使用专用于分类数据的主成分方法：多重对应分析（MCA）来估算缺少的条目。使用非参数自举法可反映有关插补模型参数的不确定性。由于MCA的降维特性，使用MCA（MIMCA）的多重插补需要估算少量参数。它允许用户估算大范围的数据集。特别地，对于MIMCA而言，每个变量的类别数量很大，变量的数量很大或个体数量很少。通过基于真实数据集的模拟研究，对该方法进行评估，并将其与参考方法（使用对数线性模型进行多次插补，通过逻辑回归进行多次插补）以及该主题的最新著作（通过随机森林或由Dirichlet过程的乘积的多项式分布的乘积模型）。所提出的方法为所考虑的分析模型的参数提供了良好的点估计，例如主效应逻辑回归模型的系数，以及估计量变化的可靠估计。另外，MIMCA具有很大的优势，即与其他多种插补方法相比，在高维数据集上的耗时要少得多。

著录项

来源
《Statistics and computing》 |2017年第2期|501-518|共18页
作者
Audigier Vincent; Husson Francois; Josse Julie;
展开▼
作者单位

Agrocampus Quest, Appl Math Dept, 65 rue St Brieuc, F-35042 Rennes, France;

Agrocampus Quest, Appl Math Dept, 65 rue St Brieuc, F-35042 Rennes, France;

Agrocampus Quest, Appl Math Dept, 65 rue St Brieuc, F-35042 Rennes, France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Missing values; Categorical data; Multiple imputation; Multiple correspondence analysis; Bootstrap;

机译：缺失值;分类数据;多重插补;多重对应分析;引导程序;

相似文献

外文文献
中文文献
专利

1. Investigating the Performance of a Variation of Multiple Correspondence Analysis for Multiple Imputation in Categorical Data Sets [J] . Johané Nienkemper-Swanepoel, Michael J von Maltitz Journal of classification . 2017,第3期

机译：研究分类数据集中多个估算的多对应分析变化的性能
2. Multiple Correspondence Analysis via Polynomial Transformations of Ordered Categorical Variables [J] . Rosaria Lombardo, Jacqueline J. Meulman Journal of Classification . 2010,第2期

机译：通过有序分类变量的多项式变换进行多重对应分析
3. Multiple Correspondence Analysis via Polynomial Transformations of Ordered Categorical Variables [J] . Lombardo R, Meulman JJ Journal of classification . 2010,第2期

机译：有序分类变量多项式变换的多对应分析
4. Multiple Correspondence Analysis for Handling Large Binary Variables in Smoothed Location Model [C] . Penny Ngu Ai Huong, Hashibah binti Hamid, Nazrina binti Aziz Innovation and Analytics Conference and Exhibition . 2015

机译：处理平滑位置模型中大二元变量的多对应分析
5. Investigation of Multiple Imputation Methods for Categorical Variables [D] . Miranda, Samantha. 2020

机译：分类变量多重估算方法的研究
6. Multiple imputation methods for handling missing values in a longitudinal categorical variable with restrictions on transitions over time: a simulation study [O] . Anurika Priyanjali De Silva, Margarita Moreno-Betancur, Alysha Madhu De Livera, 2019

机译：多种插补方法用于处理纵向分类变量中的缺失值并随时间推移而受限制：模拟研究
7. MIMCA: Multiple imputation for categorical variables with multiple correspondence analysis [O] . Audigier, Vincent, Husson, François, Josse, Julie 2017

机译：MIMCA：具有多重对应分析的类别变量的多重插补

MIMCA: multiple imputation for categorical variables with multiple correspondence analysis

摘要

著录项

相似文献

相关主题

期刊订阅