首页>
外国专利>
Multiple imputation of missing data in multi-dimensional retail sales data sets via tensor factorization
Multiple imputation of missing data in multi-dimensional retail sales data sets via tensor factorization
展开▼
机译:通过张量分解对多维零售数据集中的缺失数据进行多次插补
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system, method and computer program product provides for multiple imputation of missing data elements in retail data sets used for modeling and decision-support applications based on the multi-dimensional, tensor structure of the data sets, and a fast, scalable scheme is implemented that is suitable for large data sets. The method generates multiple imputations comprising a set of complete data sets each containing one of a plurality of imputed realizations for the missing data values in the original data set, so that the variability in the magnitudes of these missing data values can be captured for subsequent statistical analysis. The method is based on the multi-dimensional structure of the retail data sets incorporating tensor factorization, that in a preferred embodiment can be implemented using fast, scalable imputation methods suitable for large data sets, to obtain multiple complete data sets in which the original missing values are replaced by various imputed values.
展开▼