Predicting coding region candidates in the DNA sequence based on visualization without training

机译：在没有训练的情况下，基于可视化预测DNA序列中的编码区域候选

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Identifying the protein coding regions in the DNA sequence is an active issue in computational biology. Presently, there are many outstanding methods in predicting the coding regions with extreme high accuracy, after conducting preceding training process. However, the training dependence may reduce adaptability of the methods, particularly for new sequences from unknown organisms with no or small training sets. In this paper, we firstly present a Self Adaptive Spectral Rotation (SASR) approach, which was first introduced in a previous work published in Nucleic Acids Research. This approach is adopted to visualize the Triplet Periodicity (TP) property, which is a simple and universal coding related property. After that, we use a segmentation technique to computationally analyze the visualization and provide a numerical prediction of the coding region candidates in the DNA sequence. This approach does not require any training process, so it can work before any extra information is available, especially is helpful when dealing with new sequences from unknown organisms. Hence, it could be an efficient tool for coding region prediction in the early stage study.

机译：鉴定DNA序列中的蛋白质编码区是计算生物学中的积极问题。目前，在进行前面的训练过程之后，在预测具有极高精度的编码区域有许多出色的方法。然而，训练依赖性可以降低该方法的适应性，特别是对于没有没有或小训练集的未知生物的新序列。在本文中，我们首先提出了一种自适应光谱旋转（SASR）方法，首先在核酸研究中发表的先前作品中引入。采用这种方法来可视化三态周期（TP）属性，这是一种简单且通用的编码相关财产。之后，我们使用分割技术来计算性地分析可视化并提供DNA序列中的编码区域候选的数值预测。这种方法不需要任何培训过程，因此它可以在任何额外信息提供之前工作，特别是在处理来自未知生物的新序列时有用。因此，它可能是早期研究中编码区域预测的有效工具。

著录项

来源
《IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology》|2011年||共6页
会议地点
作者
Chen Bo; Ji Ping;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. In search of coding and non-coding regions of DNA sequences based on balanced estimation of diffusion entropy [J] . Zhang Jin, Zhang Wenqing, Yang Huijie Journal of Biological Physics . 2016,第1期

机译：基于扩散熵平衡估计的DNA序列编码区和非编码区的搜索
2. Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform [J] . GuangchenLiu, YihuiLuan Abstract and applied analysis . 2014,第11期

机译：基于Marple算法和小波包变换的真核DNA序列中蛋白质编码区的鉴定
3. Genetic variation of Croton stellatopilosus Ohba based on non-coding DNA sequences of ITS, trnK and trnL-F regions [J] . Prasob-orn Rinthong, Shu Zhu, Katsuko Komatsu Suchart Chanama, Natural medicines =: 生薬学雜誌 . 2011,第3a4期

机译：基于ITS，trnK和trnL-F区非编码DNA序列的巴豆金龟子的遗传变异
4. Predicting coding region candidates in the DNA sequence based on visualization without training [C] . Chen Bo, Ji Ping 2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology . 2011

机译：无需培训即可基于可视化预测DNA序列中的编码区候选
5. Two Methods of Analyse DNA Sequences: Predicting Coding Regions and Clustering Homologous DNA. [D] . Zhao, Bo. 2011

机译：分析DNA序列的两种方法：预测编码区域和聚类同源DNA。
6. In search of coding and non-coding regions of DNA sequences based on balanced estimation of diffusion entropy [O] . Jin Zhang, Wenqing Zhang, Huijie Yang 2016

机译：基于平衡熵估计的搜索DNA序列的编码区和非编码区
7. Predicting coding region candidates in the DNA sequence based on visualization without training [O] . Chen BO, Ji P 2011

机译：无需培训即可基于可视化预测DNA序列中的候选编码区

Predicting coding region candidates in the DNA sequence based on visualization without training

摘要

著录项

相似文献

相关主题

期刊订阅