Distance Based Feature Selection for Clustering Microarray Data

机译：基于距离的聚类微阵列数据特征选择

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In microarray data, clustering is the fundamental task for separating genes into biologically functional groups or for classifying tissues and phenotypes. Recently, with innovative gene expression microarray data technologies, thousands of expression levels of genes (features) can be measured simultaneously in a single experiment. The large number of genes with a lot of noise causes high complexity for cluster analysis. This challenge has raised the demand for feature selection - an effective dimensionality reduction technique that removes noisy features. In this paper we propose a novel filter method for feature selection. The suggested method, called ClosestFS, is based on a distance measure. For each feature, the distance is evaluated by computing its impact on the histogram for the whole data. Our experimental results show that the quality of clustering results (evaluated by several widely used measures) of K-means algorithm using ClosestFS as the pre-processing step is significantly better than that of the pure K-means.

机译：在微阵列数据中，聚类是将基因分为生物学功能组或对组织和表型分类的基本任务。最近，借助创新的基因表达微阵列数据技术，可以在一个实验中同时测量成千上万个基因（特征）的表达水平。大量带有大量噪声的基因导致聚类分析的高度复杂性。这一挑战提出了对特征选择的需求-一种有效的降维技术，该技术可消除噪声特征。在本文中，我们提出了一种新颖的特征选择滤波方法。建议的方法称为ClosestFS，该方法基于距离度量。对于每个特征，通过计算整个数据对直方图的影响来评估距离。我们的实验结果表明，使用ClosestFS作为预处理步骤的K-means算法的聚类结果（通过几种广泛使用的评估方法）的质量明显优于纯K-means。

著录项

来源
《Database Systems for Advanced Applications》|2008年|P.512-519|共8页
会议地点 New Delhi(IN);New Delhi(IN)
作者
Manoranjan Dash; Vivekanand Gopalkrishnan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词
feature selection; clustering; distance function; microarray data;

机译：特征选择;聚类;距离函数;微阵列数据;

相似文献

外文文献
中文文献
专利

1. Feature selection model based on clustering and ranking in pipeline for microarray data [J] . Barnali Sahu, Satchidananda Dehuri, Alok Kumar Jagadev Informatics in Medicine Unlocked . 2017,第1期

机译：基于聚类和流水线排序的微阵列数据特征选择模型
2. Graph-based unsupervised feature selection and multiview clustering for microarray data [J] . Pabitra Mitra, Tripti Swarnkar1 2 Journal of biosciences . 2015,第4期

机译：基于图的无监督特征选择和微阵列数据多视图聚类
3. A Distributed Feature Selection Algorithm Based on Distance Correlation with an Application to Microarrays [J] . IEEE/ACM transactions on computational biology and bioinformatics . 2019,第6期

机译：基于距离相关的分布式特征选择算法及其在微阵列中的应用
4. Distance Based Feature Selection for Clustering Microarray Data [C] . Manoranjan Dash, Vivekanand Gopalkrishnan International Conference on Database Systems for Advanced Applications . 2008

机译：基于距离微阵列数据的特征选择
5. New clustering and feature selection procedures with applications to gene microarray data. [D] . Xu, Yaomin. 2008

机译：新的聚类和特征选择程序，应用于基因芯片数据。
6. Effective feature selection framework for cluster analysis of microarray data [O] . Gouchol Pok, Jyh-Charn Steve Liu, Keun Ho Ryu 2010

机译：用于微阵列数据聚类分析的有效特征选择框架
7. Feature selection model based on clustering and ranking in pipeline for microarray data [O] . Barnali Sahu, Satchidananda Dehuri, Alok Kumar Jagadev 2017

机译：基于微阵列数据管道聚类和排序的特征选择模型

Distance Based Feature Selection for Clustering Microarray Data

摘要

著录项

相似文献

相关主题

期刊订阅