...
首页> 外文期刊>Current Bioinformatics >Searching Exact Tandem Repeats in DNA Sequences Using Enhanced Suffix Array
【24h】

Searching Exact Tandem Repeats in DNA Sequences Using Enhanced Suffix Array

机译:使用增强的后缀数组在DNA序列中搜索精确串联重复

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Background: Genomes of organisms contains a variety of repeated structures of variouslengths and type, interspersed or tandem. Tandem repeats play important role in molecular biology asthey are related to genetic backgrounds of inherited diseases, and also they can serve as markers forDNA mapping and DNA fingerprinting. Improving the efficiency of algorithms for searching thetandem repeats in DNA sequences can lead to many useful applications in the area of genomics.Objective: We introduce an efficient algorithm of O(n) for searching the maximum length exact tandemrepeats in genomes.Method: Algorithm is based on the use of the Enhanced Suffix Array (ESA). ESA consists of SuffixArray (SA) and Longest Common Prefix (LCP) array. SA is an array of all sorted suffixes of a stringand LCP array stores the lengths of the longest common prefixes between all pairs of consecutivesuffixes in a sorted suffix array.Results: We compare the results of our computation with other existing application: Burrows WheelerTandem Repeat Searcher (BWtrs) for searching the exact tandem repeats. We provided an open sourcestandalone application called TR-ESA (available at: www.algorithms-akgec-shivika.in/tandem), whichimplements searching of exact maximum length tandem repeat.Conclusion: Tool is remarkably efficient and powerful which allows the analysis of complete genomeshaving exact tandem repeats.
机译:背景:生物体的基因组含有各种各样的各种重复结构,互化或串联。串联重复在分子生物学中发挥重要作用,哮喘与遗传疾病的遗传背景有关,并且它们也可以作为标记的载体映射和DNA指纹识别。提高DNA序列中的算法算法的效率可以导致基因组学区域中的许多有用的应用程序。objective:我们介绍了一个有效的O(n)算法,用于搜索Genomes中的最大长度精确串联评分。方法:算法是基于增强型后缀阵列(ESA)的使用。 ESA由SuppixArray(SA)和最长的公共前缀(LCP)数组组成。 SA是Stringand LCP阵列的所有排序后缀的数组,将所有在排序的后缀数组中的所有继承症之间的最长常见前缀的长度存储在一个排序的后缀array.results:我们将计算结果与其他现有应用程序进行比较:Burrows Wheelertandem重复搜索器(bwtrs)搜索确切的串联重复。我们提供了一个名为TRE-ESA的开放源代码应用程序(可提供:www.algorithms -akgec-shivika.in/tandem),搜索精确的最大长度串联重复.Conclusion:工具非常高效,强大,允许分析完整的分析Genomeshaving精确的串联重复。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号