基于散列辞典的蛋白质二级结构预测方法

南雨宏; 陈绮

首页> 中文期刊> 《计算机技术与发展》 >基于散列辞典的蛋白质二级结构预测方法

基于散列辞典的蛋白质二级结构预测方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a kind of easy to modify protein secondary structure prediction algorithm. Using PDB files from Protein Data Bank as a data source, extract all the protein amino acid sequences and build up a database, then for a-helix, [3-sheet, use different improved methods based on hash dictionary to implements the fragments prediction of protein' s secondary structure. During the forecasting process, taking 68 421 samples as part of the protein in the test set. For unknown sequence according to the established the fragments of hash dictionary use positive maximal matching points for segmentation lexical contrast. The results shows the prediction of segment reached 83.9% accuracy,but also to better reflect the sequence of amino acids connection.%提出一种易于修改的蛋白质二级结构预测算法.以蛋白质数据银行中PDB文本数据作为数据源,提取所有蛋白质氨基酸序列并以此建立样本数据库,然后针对α-螺旋、β-折叠分别利用基于散列辞典的不同改进方法编程实现蛋白质二级结构序列片段预测,在预测过程中,随机抽取68 421个蛋白质中部分样本作为测试集,对未知序列根据建立的散列辞典中的片段使用正向最大匹配分词法进行切分对比.从实验结果来看,对未知序列片段预测的准确度达到了83.9％,而且能够较好地体现片段之间的连接顺序.

著录项

来源
《计算机技术与发展》 |2011年第10期|168-170175|共4页
作者
南雨宏; 陈绮;
展开▼
作者单位

海南大学信息科学技术学院;

海南海口570228;

海南大学信息科学技术学院;

海南海口570228;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
蛋白质二级结构; 序列片段; 散列辞典; α-螺旋; β-折叠;

相似文献

中文文献
外文文献
专利

1. 基于长度信息和深度卷积神经网络分类建模的蛋白质二级结构预测方法 [J] . 朱树平 ,刘毅慧 . 计算机应用与软件 . 2021,第011期
2. 基于GEP-BP网络集成的蛋白质二级结构预测方法研究 [J] . 王艳春 . 计算机应用研究 . 2009,第010期
3. 基于交叉覆盖算法的蛋白质二级结构预测方法 [J] . 张燕平 ,章晶 ,徐庆鹏 . 电脑知识与技术 . 2009,第001期
4. 基于遗传算法的蛋白质二级结构预测方法研究进展 [J] . 孟翔燕 ,孟军 ,葛家麒 . 农机化研究 . 2009,第005期
5. 基于SVM的蛋白质二级结构预测方法的研究 [J] . 李昆仑 ,崔丽娟 ,张伟 . 计算机研究与发展 . 2007,第0z2期
6. 基于SVM的蛋白质二级结构预测方法的研究 [C] . 李昆仑 ,崔丽娟 ,张伟 . 第二届中国分类技术及应用学术会议 . 2007
7. 基于词频统计编码和流形学习的蛋白质二级结构预测方法研究 [A] . 刘倩倩 . 2013

基于散列辞典的蛋白质二级结构预测方法

摘要

著录项

相似文献

相关主题

期刊订阅