Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph

Pengfei Xu; Shuchu Han; Hao Huang; Hong Qin

首页> 外文期刊>International Journal of Data Science and Analytics >Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph

【24h】

Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph

机译：无监督频谱特征选择算法的冗余特征去除：基于非参数稀疏特征图的实证研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

For existing unsupervised spectral feature selection algorithms, the quality of the eigenvectors decides the performance. There eigenvectors are calculated from the Laplacian matrix of similarity graph which is built from samples. When applying these algorithms to high-dimensional data, we meet the very embarrassing chicken-and-egg problem: "the success of feature selection depends on the quality of indication vectors which are related to the structure of data. But the purpose of feature selection is to give more accurate data structure." To alleviate this problem, we propose a graph-based approach to reduce the dimension of data by searching and removing redundant features automatically. A sparse graph is generated at feature side and is used to learn the redundant relationship among features. We name this novel graph as sparse feature graph (SFG). To avoid the inaccurate distance information among high-dimensional vectors, the construction of SFG does not utilize the pairwise relationship among samples, which means the structure info of data is not used. Our proposed algorithm is also a nonparametric one as it does not make any assumption about the data distribution. We treat this proposed redundant feature removal algorithm as a data preprocessing approach for existing popular unsupervised spectral feature selection algorithms like multi-cluster feature selection (MCFS) which requires accurate cluster structure information based on samples. Our experimental results on benchmark datasets show that the proposed SFG and redundant feature remove algorithm can improve the performance of those unsupervised spectral feature selection algorithms consistently.

机译：对于现有的无监督频谱特征选择算法，特征向量的质量决定了性能。特征向量是根据样本建立的相似度拉普拉斯矩阵来计算的。当将这些算法应用于高维数据时，我们遇到了一个非常尴尬的“鸡与蛋”问题：“特征选择的成功取决于与数据结构相关的指示向量的质量。但是特征的目的为了减轻这个问题，我们提出了一种基于图的方法，通过自动搜索和删除冗余特征来减少数据量。稀疏图在特征侧生成，用于了解特征之间的冗余关系。我们将此新颖的图命名为稀疏特征图（SFG）。为了避免高维向量之间的距离信息不准确，SFG的构造不利用样本之间的成对关系，这意味着不使用数据的结构信息。我们提出的算法也是一种非参数算法，因为它没有对数据分布进行任何假设。我们将这种提议的冗余特征去除算法视为现有流行的无监督频谱特征选择算法（如多集群特征选择（MCFS））的数据预处理方法，该算法需要基于样本的准确集群结构信息。我们在基准数据集上的实验结果表明，所提出的SFG和冗余特征消除算法可以一致地提高那些无监督频谱特征选择算法的性能。

著录项

来源
《International Journal of Data Science and Analytics》 |2019年第1期|77-93|共17页
作者
Pengfei Xu; Shuchu Han; Hao Huang; Hong Qin;
展开▼
作者单位

College of Information Science and Technology, Beijing Normal University, Beijing, China;

Department of Computer Science, Stony Brook University, Stony Brook, USA;

Machine Learning Laboratory, General Electric Global Research, San Ramon, CA, USA;

Department of Computer Science, Stony Brook University, Stony Brook, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Sparse graph representation; Unsupervised spectral feature selection; Dense subgraph;

机译：稀疏的图形表示;无监督频谱特征选择;密集子图;

相似文献

外文文献
中文文献
专利

1. Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph [J] . Pengfei Xu, Shuchu Han, Hao Huang, International Journal of Data Science and Analytics . 2019,第1期

机译：无监督谱特征选择算法的冗余功能拆除：基于非参数稀疏特征图的实证研究
2. Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection [J] . Xiaofeng Zhu, Xuelong Li, Shichao Zhang, Neural Networks and Learning Systems, IEEE Transactions on . 2017,第6期

机译：用于无监督光谱特征选择的鲁棒联合图稀疏编码
3. Unsupervised feature selection based on joint spectral learning and general sparse regression [J] . Neural computing & applications . 2020,第11期

机译：基于联合光谱学习和普通稀疏回归的无监督特征选择
4. Unsupervised feature selection algorithm based on sparse representation [C] . Guoqing Cui, Jie Yang, Masoumeh Zareapoor International Conference on Systems and Informatics . 2016

机译：基于稀疏表示的无监督特征选择算法
5. Investigation of Extending Feature Selection Algorithms to Explicit Feature Selection in Kernel Space [D] . Li, Qiaozhi. 2018

机译：核空间中扩展特征选择算法用于显式特征选择的研究
6. Unsupervised Spectral-Spatial Feature Selection-Based Camouflaged Object Detection Using VNIR Hyperspectral Camera [O] . Sungho Kim 2015

机译：基于无监督光谱空间特征选择的VNIR高光谱相机伪装目标检测
7. Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data [O] . Patrick Bradley, Sina Keller, Martin Weinmann 2018

机译：基于Ultrametricity和稀疏训练数据的无监督特征选择：高维超光谱数据分类的案例研究
8. Improved Feature Extraction, Feature Selection, and Identification Techniques That Create a Fast Unsupervised Hyperspectral Target Detection Algorithm [R] . Johnson, R. J. 2008

机译：改进的特征提取，特征选择和识别技术，创建快速无监督的高光谱目标检测算法

Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph

摘要

著录项

相似文献

相关主题

期刊订阅