...
首页> 外文期刊>Frontiers in Cell and Developmental Biology >An Accurate Tool for Uncovering Cancer Subtypes by Fast Kernel Learning Method to Integrate Multiple Profile Data
【24h】

An Accurate Tool for Uncovering Cancer Subtypes by Fast Kernel Learning Method to Integrate Multiple Profile Data

机译:通过快速内核学习方法揭示癌症子类型的准确工具,以集成多个配置文件数据

获取原文
           

摘要

In recent years, cancer has become a severe threat to human health. If we can accurately identify the subtypes of cancer, it will be of great significance to the research of anti-cancer drugs, the development of personalized treatment methods, and finally conquer cancer. In this paper, we obtain three feature representation datasets (gene expression profile, isoform expression and DNA methylation data) on lung cancer and renal cancer from the Broad GDAC, which collects the standardized data extracted from The Cancer Genome Atlas (TCGA). Since the feature dimension is too large, Principal Component Analysis (PCA) is used to reduce the feature vector, thus eliminating the redundant features and speeding up the operation speed of the classification model. By multiple kernel learning (MKL), we use Kernel target alignment (KTA), fast kernel learning (FKL), Hilbert-Schmidt Independence Criterion (HSIC), Mean to calculate the weight of kernel fusion. Finally, we put the combined kernel function into the support vector machine (SVM) and get excellent results. Among them, in the classification of renal cell carcinoma subtypes, the maximum accuracy can reach 0.978 by using the method of MKL (HSIC calculation weight), while in the classification of lung cancer subtypes, the accuracy can even reach 0.990 with the same method (FKL calculation weight).
机译:近年来,癌症已成为对人体健康的严重威胁。如果我们可以准确识别癌症的亚型,它将对抗癌药物的研究具有重要意义,是个性化治疗方法的发展,最后征服癌症。在本文中,我们从宽GDAC获取肺癌和肾癌的三个特征表示数据集(基因表达谱,同种型表达和DNA甲基化数据),其收集从癌症基因组Atlas(TCGA)中提取的标准化数据。由于特征维度太大,主要成分分析(PCA)用于减少特征向量,从而消除冗余特征并加速分类模型的操作速度。通过多个内核学习(MKL),我们使用内核目标对齐(KTA),快速内核学习(FKL),HILBERT-SCHMIDT独立性标准(HSIC),意味着计算核心融合的重量。最后,我们将组合的内核函数放入支持向量机(SVM)并获得优异的结果。其中,在肾细胞癌亚型的分类中,通过使用MKL(HSIC计算重量)的方法可以达到0.978,而在肺癌亚型的分类中,甚至可以通过相同的方法达到0.990的精度( FKL计算重量)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号