Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph

Pengfei Xu; Shuchu Han; Hao Huang; Hong Qin

首页> 外文期刊>International Journal of Data Science and Analytics >Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph

【24h】

Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph

机译：无监督谱特征选择算法的冗余功能拆除：基于非参数稀疏特征图的实证研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

For existing unsupervised spectral feature selection algorithms, the quality of the eigenvectors decides the performance. There eigenvectors are calculated from the Laplacian matrix of similarity graph which is built from samples. When applying these algorithms to high-dimensional data, we meet the very embarrassing chicken-and-egg problem: "the success of feature selection depends on the quality of indication vectors which are related to the structure of data. But the purpose of feature selection is to give more accurate data structure." To alleviate this problem, we propose a graph-based approach to reduce the dimension of data by searching and removing redundant features automatically. A sparse graph is generated at feature side and is used to learn the redundant relationship among features. We name this novel graph as sparse feature graph (SFG). To avoid the inaccurate distance information among high-dimensional vectors, the construction of SFG does not utilize the pairwise relationship among samples, which means the structure info of data is not used. Our proposed algorithm is also a nonparametric one as it does not make any assumption about the data distribution. We treat this proposed redundant feature removal algorithm as a data preprocessing approach for existing popular unsupervised spectral feature selection algorithms like multi-cluster feature selection (MCFS) which requires accurate cluster structure information based on samples. Our experimental results on benchmark datasets show that the proposed SFG and redundant feature remove algorithm can improve the performance of those unsupervised spectral feature selection algorithms consistently.

机译：对于现有的无监督谱特征选择算法，特征向量的质量决定性能。从样本构建的Laplacian矩阵计算特征向量。在将这些算法应用于高维数据时，我们符合非常尴尬的鸡肉和蛋问题：“特征选择的成功取决于与数据结构相关的指示向量。但要素选择的目的是给出更准确的数据结构。“为了缓解这个问题，我们提出了一种基于图形的方法来通过自动搜索和删除冗余功能来减少数据的维度。稀疏图是在特征侧生成的，并且用于学习特征之间的冗余关系。我们将此新颖的图表命名为稀疏功能图（SFG）。为了避免高维向量之间的不准确距离信息，SFG的构造不利用样本之间的成对关系，这意味着不使用数据的结构信息。我们所提出的算法也是一个非参数，因为它不会对数据分布进行任何假设。我们将该提出的冗余特征拆除算法视为现有流行的无监督谱特征选择算法的数据预处理方法，如多簇特征选择（MCF），这需要基于样本的准确集群结构信息。我们对基准数据集的实验结果表明，所提出的SFG和冗余特征删除算法可以始终如一地提高这些无监督谱特征选择算法的性能。

著录项

来源
《International Journal of Data Science and Analytics》 |2019年第1期|77-93|共17页
作者
Pengfei Xu; Shuchu Han; Hao Huang; Hong Qin;
展开▼
作者单位

College of Information Science and Technology Beijing Normal University Beijing China;

Department of Computer Science Stony Brook University Stony Brook USA;

Machine Learning Laboratory General Electric Global Research San Ramon CA USA;

Department of Computer Science Stony Brook University Stony Brook USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Sparse graph representation; Unsupervised spectral feature selection; Dense subgraph;

机译：稀疏图形表示;无监督的光谱特征选择;密集的子图;

相似文献

外文文献
中文文献
专利

1. Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph [J] . Pengfei Xu, Shuchu Han, Hao Huang, International Journal of Data Science and Analytics . 2019,第1期

机译：无监督频谱特征选择算法的冗余特征去除：基于非参数稀疏特征图的实证研究
2. Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection [J] . Xiaofeng Zhu, Xuelong Li, Shichao Zhang, Neural Networks and Learning Systems, IEEE Transactions on . 2017,第6期

机译：用于无监督光谱特征选择的鲁棒联合图稀疏编码
3. Unsupervised feature selection based on joint spectral learning and general sparse regression [J] . Neural computing & applications . 2020,第11期

机译：基于联合光谱学习和普通稀疏回归的无监督特征选择
4. Unsupervised feature selection algorithm based on sparse representation [C] . Guoqing Cui, Jie Yang, Masoumeh Zareapoor International Conference on Systems and Informatics . 2016

机译：基于稀疏表示的无监督特征选择算法
5. Investigation of Extending Feature Selection Algorithms to Explicit Feature Selection in Kernel Space [D] . Li, Qiaozhi. 2018

机译：核空间中扩展特征选择算法用于显式特征选择的研究
6. Unsupervised Spectral-Spatial Feature Selection-Based Camouflaged Object Detection Using VNIR Hyperspectral Camera [O] . Sungho Kim 2015

机译：基于无监督光谱空间特征选择的VNIR高光谱相机伪装目标检测
7. Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data [O] . Patrick Bradley, Sina Keller, Martin Weinmann 2018

机译：基于Ultrametricity和稀疏训练数据的无监督特征选择：高维超光谱数据分类的案例研究
8. Improved Feature Extraction, Feature Selection, and Identification Techniques That Create a Fast Unsupervised Hyperspectral Target Detection Algorithm [R] . Johnson, R. J. 2008

机译：改进的特征提取，特征选择和识别技术，创建快速无监督的高光谱目标检测算法

Redundant features removal for unsupervised spectral feature selection algorithms: an empirical study based on nonparametric sparse feature graph

摘要

著录项

相似文献

相关主题

期刊订阅