首页> 外文会议>Asia-Pacific Bioinformatics Conference(APBC 2003); 200302; Adelaide(AU) >Gene Expression Data Clustering and Visualization Based on a Binary Hierarchical Clustering Framework
【24h】

Gene Expression Data Clustering and Visualization Based on a Binary Hierarchical Clustering Framework

机译:基于二进制层次聚类框架的基因表达数据聚类和可视化

获取原文
获取原文并翻译 | 示例

摘要

We describe the use of a binary hierarchical clustering (BHC) framework for clustering of gene expression data. The BHC algorithm involves two major steps. Firstly, the K-means algorithm is used to split the data into two classes. Secondly, the Fisher criterion is applied to the classes to assess whether the splitting is acceptable. The algorithm is applied to the sub-classes recursively and ends when all clusters cannot be split any further. BHC does not require the number of clusters to be known. It does not place any assumption about the number of samples in each cluster or the class distribution. The hierarchical framework naturally leads to a tree structure representation. We show that by arranging the BHC clustered gene expression data in a tree structure, we can easily visualize the cluster results. In addition, the tree structure display allows user judgement in finalizing the clustering result using prior biological knowledge.
机译:我们描述了使用二进制层次聚类(BHC)框架对基因表达数据进行聚类。 BHC算法涉及两个主要步骤。首先,使用K-means算法将数据分为两类。其次,将Fisher准则应用于类别以评估拆分是否可以接受。该算法将递归应用于子类,并在无法进一步拆分所有聚类时结束。 BHC不需要知道簇数。它没有对每个聚类或类分布中的样本数量做出任何假设。层次框架自然会导致树结构表示。我们表明,通过以树形结构排列BHC聚类基因表达数据,我们可以轻松地可视化聚类结果。另外,树形结构显示允许用户使用现有的生物学知识来最终确定聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号