The Design of Pre-Processing Multidimensional Data Based on Component Analysis

Rahmat Widia Sembiring; Jasni Mohamad Zain

首页> 外文期刊>Computer and information science >The Design of Pre-Processing Multidimensional Data Based on Component Analysis

【24h】

The Design of Pre-Processing Multidimensional Data Based on Component Analysis

机译：基于分量分析的多维数据预处理设计

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Increased implementation of new databases related to multidimensional data involving techniques to support efficient query process, create opportunities for more extensive research. Pre-processing is required because of lack of data attribute values, noisy data, errors, inconsistencies or outliers and differences in coding. Several types of pre-processing based on component analysis will be carried out for cleaning, data integration and transformation, as well as to reduce the dimensions. Component analysis can be done by statistical methods, with the aim to separate the various sources of data into a statistical pattern independent. This paper aims to improve the quality of pre-processed data based on component analysis. RapidMiner is used for data pre-processing using FastICA algorithm. Kernel K-mean is used to cluster the pre-processed data and Expectation Maximization (EM) is used to model. The model was tested using Wisconsin breast cancer datasets, lung cancer datasets and prostate cancer datasets. The result shows that the performance of the cluster vector value is higher and the processing time is shorter.

机译：与涉及支持有效查询过程的技术的多维数据有关的新数据库的实施增加，为更广泛的研究创造了机会。由于缺少数据属性值，嘈杂的数据，错误，不一致或离群值以及编码差异，因此需要进行预处理。将进行基于组件分析的几种预处理，以进行清洗，数据集成和转换以及减小尺寸。成分分析可以通过统计方法完成，目的是将各种数据源分离为独立的统计模式。本文旨在基于组件分析提高预处理数据的质量。 RapidMiner用于使用FastICA算法进行数据预处理。内核K均值用于对预处理数据进行聚类，而期望最大化（EM）用于建模。使用威斯康星州乳腺癌数据集，肺癌数据集和前列腺癌数据集对模型进行了测试。结果表明，聚类向量值的性能较高，处理时间较短。

著录项

来源
《Computer and information science》 |2011年第3期|p.106-115|共10页
作者
Rahmat Widia Sembiring; Jasni Mohamad Zain;
展开▼
作者单位

Faculty of Computer System and Software Engineering, Universiti Malaysia Pahang Lebuhraya Tun Razak, 26300, Kuantan, Pahang Darul Makmur, Malaysia;

Faculty of Computer System and Software Engineering, Universiti Malaysia Pahang Lebuhraya Tun Razak, 26300, Kuantan, Pahang Darul Makmur, Malaysia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
pre-processing data; data cleansing; data noisy; fastICA;

机译：预处理数据;数据清理;数据嘈杂fastICA;

相似文献

外文文献
中文文献
专利

1. The Design of Pre-Processing Multidimensional Data Based on Component Analysis [J] . Rahmat Widia Sembiring, Jasni Mohamad Zain Computer and Information Science . 2011,第3期

机译：基于分量分析的多维数据预处理设计
2. Principal component analysis of turbulent combustion data: Data pre-processing and manifold sensitivity [J] . Alessandro Parente, James C. Sutherland Combustion and Flame . 2013,第2期

机译：湍流燃烧数据的主成分分析：数据预处理和歧管灵敏度
3. Design of face recognition algorithm using PCA -LDA combined for hybrid data pre-processing and polynomial-based RBF neural networks : Design and its application [J] . Sung-Kwun Oh, Sung-Hoon Yoo, Witold Pedrycz Expert Systems with Application . 2013,第5期

机译：基于PCA -LDA的混合数据预处理和基于多项式的RBF神经网络的人脸识别算法设计：设计与应用。
4. Component-based Data Layout for Efficient Slicing of Very Large Multidimensional Volumetric Data [C] . Kim, Jusub, JaJa, . 2007

机译：基于组件的数据布局，可对大型多维体数据进行高效切片
5. BISPECTRUM AND MULTIDIMENSIONAL POWER SPECTRUM ESTIMATION ALGORITHMS BASED ON PARAMETRIC MODELS WITH APPLICATIONS TO THE ANALYSIS OF ECG DATA (SPECTRAL ANALYSIS, NONLINEAR INTERACTIONS) [D] . RAGHUVEER, MYSORE RANGARAO 1984

机译：基于参数模型的双谱和多维功率谱估计算法及其在心电图数据分析中的应用（谱分析，非线性相互作用）
6. Identification and verification of ultrafine particle affinity zones in urban neighbourhoods: sample design and data pre-processing [O] . Paul Harris, Sarah Lindley, Martin Gallagher, 2009

机译：识别和验证城市社区中的超细颗粒亲和区：样品设计和数据预处理
7. The Design of Pre-Processing Multidimensional Data Based onudComponent Analysis [O] . Jasni Mohamad Zain, Rahmat Widia Sembiring 2011

机译：基于 ud的多维数据预处理设计成分分析
8. Multidimensional Analysis Based on a Two-Fluid Model of Fluid Flow in a Component of the LOFT System During a Loss of Coolant Experiment [R] . Demmie, P. N. 1979

机译：基于双流体模型的LOFT系统部件流体流动的多维分析

The Design of Pre-Processing Multidimensional Data Based on Component Analysis

摘要

著录项

相似文献

相关主题

期刊订阅