首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Compression of Multiple DNA Sequences Using Intra-Sequence and Inter-Sequence Similarities
【24h】

Compression of Multiple DNA Sequences Using Intra-Sequence and Inter-Sequence Similarities

机译:使用序列内和序列间相似性压缩多个DNA序列

获取原文
获取原文并翻译 | 示例
       

摘要

Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recently, remarkable compression performance of individual DNA sequence from the same population is achieved by encoding its difference with a nearly identical reference sequence. Nevertheless, there is lack of general algorithms that also allow less similar reference sequences. In this work, we extend the intra-sequence to the inter-sequence similarity in that approximate matches of subsequences are found between the DNA sequence and a set of reference sequences. Hence, a set of nearly identical DNA sequences from the same population or a set of partially similar DNA sequences like chromosome sequences and DNA sequences of related species can be compressed together. For practical compressors, the compressed size is usually influenced by the compression order of sequences. Fast search algorithms for the optimal compression order are thus developed for multiple sequences compression. Experimental results on artificial and real datasets demonstrate that our proposed multiple sequences compression methods with fast compression order search are able to achieve good compression performance under different levels of similarity in the multiple DNA sequences.
机译:传统上,利用序列内相似性来压缩单个DNA序列。最近,通过用几乎相同的参考序列编码其差异,实现了来自同一种群的单个DNA序列的卓越压缩性能。然而,缺乏通用算法也允许更少的相似参考序列。在这项工作中,我们将序列内相似性扩展到序列间相似性,因为在DNA序列和一组参考序列之间发现了子序列的近似匹配。因此,可以将来自同一种群的一组几乎相同的DNA序列或一组部分相似的DNA序列(如染色体序列和相关物种的DNA序列)压缩在一起。对于实际的压缩机,压缩大小通常受序列压缩顺序的影响。因此,针对多序列压缩开发了用于最佳压缩顺序的快速搜索算法。在人工和真实数据集上的实验结果表明,我们提出的具有快速压缩顺序搜索的多序列压缩方法能够在多个DNA序列的不同相似度下实现良好的压缩性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号