Gene selection for high dimensional data using k-means clustering algorithm and statistical approach

机译：使用k均值聚类算法和统计方法对高维数据进行基因选择

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Microarray technology can measure thousands of genes which are useful for biologist to study and classify the cancer cells. However, this high dimensional data consists of large number of genes to be examined in regard of small samples size. Thus, selection of relevant genes is a challenging issue in microarray data analysis and has been a central research focus. This study proposed kmeans clustering algorithm to groups the relevant genes. Several statistical techniques such as Fisher criterion, Golub signal-to-noise, Mann Whitney rank and t-test have been used in deciding the clusters are well separated from one and others. Those genes with high discriminative score will later be used to train the k-NN classifier. The experimental results showed that the proposed gene selection methods able to identify differentially expressed genes with 0.86 ROC score.

机译：微阵列技术可以测量成千上万的基因，这对于生物学家研究和分类癌细胞非常有用。但是，这种高维数据由大量的基因组成，涉及的样本量较小。因此，相关基因的选择在微阵列数据分析中是一个具有挑战性的问题，并且一直是研究的重点。本研究提出了kmeans聚类算法对相关基因进行分组。一些统计技术（例如Fisher准则，Golub信噪比，Mann Whitney秩和t检验）已用于确定群集彼此之间很好地分离。那些具有高判别分数的基因将在以后用于训练k-NN分类器。实验结果表明，所提出的基因选择方法能够识别ROC评分为0.86的差异表达基因。

著录项

来源
《International Conference on Computational Science and Technology》|2014年|1-6|共6页
会议地点
作者
Ahmad Farzana Kabir; Yusof Yuhanis; Othman Nor Hayati;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
biology computing; cancer; genetics; pattern classification; pattern clustering; statistical analysis; Fisher criterion; Golub signal-to-noise; Mann Whitney rank; biologist; cancer cells; gene selection methods; high dimensional data; high discriminative score; k-NN classifier; k-means clustering algorithm; microarray data analysis; microarray technology; statistical techniques; t-test; Classification algorithms; Clustering algorithms; Data analysis; Gene expression; Information filtering; Scientific computing; Gene selection; microarray; statistical techniques;

机译：生物学计算;癌症;遗传学;模式分类;模式聚类;统计分析; Fisher准则; Golub信噪比; Mann Whitney等级;生物学家;癌细胞;基因选择方法;高维数据;高判别分数; k-NN分类器; k-均值聚类算法;微阵列数据分析;微阵列技术;统计技术; t检验;分类算法;聚类算法;数据分析;基因表达;信息过滤;科学计算;基因选择;微阵列;统计技术;

相似文献

外文文献
中文文献
专利

1. Gravitational search algorithm and K-means for simultaneous feature selection and data clustering: a multi-objective approach [J] . Prakash Jay, Singh Pramod Kumar Soft computing: A fusion of foundations, methodologies and applications . 2019,第6期

机译：引力搜索算法和K-means用于同时特征选择和数据聚类：多目标方法
2. Genetic Algorithm Based Dimensionality Reduction for Improving Performance of K-Means Clustering: A Case Study for Categorization of Medical Dataset [J] . Asha Gowda Karegowda, Vidya T. Shama, M.A. Jayaram, International journal of soft computing . 2012,第5期

机译：基于遗传算法的降维方法提高K-Means聚类性能：以医学数据集分类为例
3. Genetic Algorithm Based Dimensionality Reduction for Improving Performance of K-Means Clustering: A Case Study for Categorization of Medical Dataset [J] . Asha Gowda Karegowda, Vidya T. Shama, M.A. Jayaram, International journal of soft computing . 2012,第5期

机译：基于遗传算法的降维方法提高K-Means聚类性能：以医学数据集分类为例
4. Gene selection for high dimensional data using k-means clustering algorithm and statistical approach [C] . Ahmad Farzana Kabir, Yusof Yuhanis, Othman Nor Hayati International Conference on Computational Science and Technology . 2014

机译：使用K-Means聚类算法和统计方法的高维数据的基因选择
5. K-means clustering with automatic determination of K using a Multiobjective Genetic Algorithm with applications to microarray gene expression data. [D] . Shaw, Matthew Karl Ellis. 2015

机译：使用多目标遗传算法自动确定K值的K均值聚类，并应用于微阵列基因表达数据。
6. Statistical HOmogeneous Cluster SpectroscopY (SHOCSY):An Optimized Statistical Approach for Clustering of 1H NMR Spectral Data to ReduceInterference and Enhance Robust Biomarkers Selection [O] . Xin Zou, Elaine Holmes, Jeremy K. Nicholson, -1

机译：统计同质团簇光谱（SHOCSY）：1H NMR光谱数据聚类以减少的最佳统计方法干扰并增强健壮的生物标志物选择
7. Gene selection for high dimensional data using k-means clustering algorithm and statistical approach [O] . Farzana Kabir Ahmad, Yuhanis Yusof, Nor Hayati Othman 2014

机译：使用K-Means聚类算法和统计方法的高维数据的基因选择

Gene selection for high dimensional data using k-means clustering algorithm and statistical approach

摘要

著录项

相似文献

相关主题

期刊订阅