...
首页> 外文期刊>Nucleic acids research >Protein multiple sequence alignment by hybrid bio-inspired algorithms
【24h】

Protein multiple sequence alignment by hybrid bio-inspired algorithms

机译:通过混合生物启发算法进行蛋白质多序列比对

获取原文
           

摘要

This article presents an immune inspired algorithm to tackle the Multiple Sequence Alignment (MSA) problem. MSA is one of the most important tasks in biological sequence analysis. Although this paper focuses on protein alignments, most of the discussion and methodology may also be applied to DNA alignments. The problem of finding the multiple alignment was investigated in the study by Bonizzoni and Vedova and Wang and Jiang, and proved to be a NP-hard (non-deterministic polynomial-time hard) problem. The presented algorithm, called Immunological Multiple Sequence Alignment Algorithm (IMSA), incorporates two new strategies to create the initial population and specific ad hoc mutation operators. It is based on the ‘weighted sum of pairs' as objective function, to evaluate a given candidate alignment. IMSA was tested using both classical benchmarks of BAliBASE (versions 1.0, 2.0 and 3.0), and experimental results indicate that it is comparable with state-of-the-art multiple alignment algorithms, in terms of quality of alignments, weighted Sums-of-Pairs (SP) and Column Score (CS) values. The main novelty of IMSA is its ability to generate more than a single suboptimal alignment, for every MSA instance; this behaviour is due to the stochastic nature of the algorithm and of the populations evolved during the convergence process. This feature will help the decision maker to assess and select a biologically relevant multiple sequence alignment. Finally, the designed algorithm can be used as a local search procedure to properly explore promising alignments of the search space.
机译:本文提出了一种免疫启发算法来解决多序列比对(MSA)问题。 MSA是生物序列分析中最重要的任务之一。尽管本文着重于蛋白质比对,但大多数讨论和方法论也可应用于DNA比对。 Bonizzoni和Vedova以及Wang和Jiang在研究中研究了寻找多重比对的问题,并证明是NP-hard(非确定性多项式-时间-hard)问题。提出的算法称为免疫多序列比对算法(IMSA),它结合了两种新策略来创建初始种群和特定的特设突变算子。它基于“成对加权总和”作为目标函数,以评估给定的候选比对。 IMSA已使用BAliBASE的两个经典基准测试(版本1.0、2.0和3.0)进行了测试,并且实验结果表明,就对齐质量,加权求和而言,IMSA可与最新的多重对齐算法相媲美。对(SP)和列分数(CS)值。 IMSA的主要新颖之处在于,对于每个MSA实例,它都可以生成不止一个次优比对。此行为是由于算法的收敛性和收敛过程中演化出的总体的随机性所致。此功能将帮助决策者评估和选择生物学上相关的多序列比对。最后,设计的算法可以用作本地搜索过程,以正确探索搜索空间的有希望的对齐方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号