首页> 中文期刊> 《计算机应用》 >基于PPI网络与机器学习的蛋白质功能预测方法

基于PPI网络与机器学习的蛋白质功能预测方法

         

摘要

Aiming at the problem that the prediction method of protein function based on the current Protein-Protein Interaction (PPI) network has low precision and is susceptible to data noise,a new machine learning protein function prediction method named HPMM (HC,PCA and MLP based Method) was proposed,which combined Hierarchical Clustering (HC),Principal Component Analysis (PCA) and Multi-layer Perception (MLP).HPMM took comprehensive consideration from macro and micro perspectives.It combined the information of protein families,domains and important sites into the vertex attributes of PPI networks to alleviate the effect from the data noise of networks.Firstly,the features of function modules and attribute principal components were extracted by using HC and PCA.Secondly,a mapping relationship between multi-feature and multi-function,used to predict protein functions,was constructed by training the MLP model.Three homo sapiens PPI networks,which were annotated by Molecular Functions (MF),Biological Processes (BP),and Cellular Components (CC) respectively,were adopted in the experiments.Comparisons were performed among the HPMM algorithm,the Cosine Iterative Algorithm (CIA) and the Diffusing GO Terms in the Directed PPI Network (GoDIN) Algorithm.The experimental results indicate that HPMM can obtain higher precision and F-measure than algorithms CIA and GoDIN,which are purely PPI network based methods.%针对现有的基于蛋白质相互作用(PPI)网络的蛋白质功能预测方法预测精度不高、易受数据噪声影响的问题,提出一种基于机器学习(层次聚类、主成分分析和多层感知器)的蛋白质功能预测方法HPMM.该方法综合考虑蛋白质宏观和微观层面的信息,将蛋白质家族、结构域和重要位点信息作为顶点属性整合到PPI网络中以减轻网络中数据噪声的影响.首先,基于层次聚类和主成分分析进行特征提取,得到功能模块和属性主成分特征,然后训练多层感知器模型,建立多特征与多功能之间的映射关系以用于功能预测.在三个分别被分子功能(MF)、生物过程(BP)和细胞组件(CC)注释的人类PPI网络上进行测试,对HPMM、余弦迭代算法(CIA)和有向PPI网络基因本体术语传播(GoDIN)算法的功能预测效果进行比较分析.实验结果表明,相比CIA和GoDIN这两种完全基于PPI网络的方法,HPMM的精确度与F值更高.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号