An unsupervised attribute clustering algorithm for unsupervised feature selection

机译：用于无监督特征选择的无监督属性聚类算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The curse of dimensionality refers to the problem that one faces when analyzing datasets with thousands or hundreds of thousands of attributes. This problem is usually tackled by different feature selection methods which have been shown to effectively reduce computation time, improve prediction performance, and facilitate better understanding of datasets in various application areas. These methods can be classified into filter methods, wrapper methods and embedded methods. All of these feature selection methods require class label information to perform their tasks. Hence, when such information is unavailable, the feature selection problem can be very challenging. In order to overcome the above challenges, we propose an unsupervised feature selection method which is called Unsupervised Attribute Clustering Algorithm (UACA) involved in several steps: i) calculate the value of Maximal Information Coefficient for each pair of attributes to construct an attributes distance matrix; ii) cluster all attributes using optimal k-mode clustering method to find out k modes attributes as features of each cluster. For evaluating the performance of the proposed algorithm, classification problems with different classifiers were tested to validate the method and compare with other methods. The results of data experiments exhibit the proposed unsupervised algorithm which is comparable with classical feature selection methods and even outperforms some supervised learning algorithm.

机译：维度诅咒是指人们在分析具有成千上万个属性的数据集时面临的问题。通常通过不同的特征选择方法来解决此问题，这些特征选择方法已被证明可以有效地减少计算时间，提高预测性能并有助于更好地理解各个应用领域中的数据集。这些方法可以分为过滤方法，包装方法和嵌入方法。所有这些功能选择方法都需要类标签信息来执行其任务。因此，当此类信息不可用时，特征选择问题可能会非常具有挑战性。为了克服上述挑战，我们提出了一种无监督的特征选择方法，称为无监督属性聚类算法（UACA），涉及以下几个步骤：i）计算每对属性的最大信息系数的值，以构建属性距离矩阵; ii）使用最佳k模式聚类方法对所有属性进行聚类，以找出k个模式属性作为每个聚类的特征。为了评估所提出算法的性能，测试了具有不同分类器的分类问题，以验证该方法并与其他方法进行比较。数据实验结果表明，提出的无监督算法可与经典特征选择方法相提并论，甚至优于某些有监督的学习算法。

著录项

来源
《IEEE International Conference on Data Science and Advanced Analytics》|2015年|1-7|共7页
会议地点
作者
Zhou Pei-Yuan; Chan Keith C.C.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Classification algorithms; Clustering algorithms; Clustering methods; Correlation; Data mining; Microwave integrated circuits; Mutual information; mode; unsupervised attribute clustering; unsupervised feature selection;

机译：分类算法;聚类算法;聚类方法;相关性;数据挖掘;微波集成电路;互信息;模式;无监督属性聚类;无监督特征选择;

相似文献

外文文献
中文文献
专利

1. A new unsupervised feature selection algorithm using similarity-based feature clustering [J] . Zhu Xiaoyan, Wang Yu, Li Yingbin, Computational Intelligence . 2019,第1期

机译：一种新的基于相似度特征聚类的无监督特征选择算法
2. A new unsupervised feature selection method for text clustering based on genetic algorithms [J] . Pirooz Shamsinejadbabki, Mohammad Saraee Journal of Intelligent Information Systems . 2012,第3期

机译：基于遗传算法的文本聚类无监督特征选择新方法
3. Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering [J] . Abualigah Laith Mohammad, Khader Ahamad Tajudin Journal of supercomputing . 2017,第11期

机译：基于混合遗传算法和遗传算子的无监督文本特征选择技术
4. An Unsupervised Attribute Clustering Algorithm for Unsupervised Feature Selection [C] . Pei-Yuan Zhou, Keith C.C. Chan IEEE International Conference on Data Science and Advanced Analytics . 2015

机译：无监督特征选择的无监督属性聚类算法
5. Pattern classification and clustering algorithms with supervised and unsupervised neural networks in financial applications. [D] . Lee, Ki-Dong. 2001

机译：金融应用中具有监督和无监督神经网络的模式分类和聚类算法。
6. The Unsupervised Feature Selection Algorithms Based on Standard Deviation and Cosine Similarity for Genomic Data Analysis [O] . Juanying Xie, Mingzhao Wang, Shengquan Xu, 2021

机译：基于标准偏差和基因组数据分析的余弦相似性的无监督特征选择算法
7. An unsupervised attribute clustering algorithm for unsupervised feature selection [O] . Zhou P, Chan KCC 2015

机译：用于无监督特征选择的无监督属性聚类算法
8. Improved Feature Extraction, Feature Selection, and Identification Techniques That Create a Fast Unsupervised Hyperspectral Target Detection Algorithm [R] . Johnson, R. J. 2008

机译：改进的特征提取，特征选择和识别技术，创建快速无监督的高光谱目标检测算法

An unsupervised attribute clustering algorithm for unsupervised feature selection

摘要

著录项

相似文献

相关主题

期刊订阅