New Algorithms for Finding Monad Patterns in DNA Sequences

机译：在DNA序列中寻找Monad模式的新算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present two new algorithms for discovering monad patterns in DNA sequences. Monad patterns are of the form (l,d)-k, where l is the length of the pattern, d is the maximum number of mismatches allowed, and k is the minimum number of times the pattern is repeated in the given sample. The time-complexity of some of the best known algorithms to date is O(nt~2 l~d σ~d), where t is the number of input sequences, n is the length of each input sequence, and σ = |Σ| is the size of the alphabet. The first algorithm that we present in this paper takes O(n~2 t~2 l~(d/2)) time and O(ntl~(d/2) σ~(d/2)) space, and the second algorithm takes O(n~3 t~3 l~(d/2) σ~(d/2)) time using O(l~(d/2) σ~(d/2)) space. In practice, our algorithms have much better performance provided the d/l ratio is small. The second algorithm performs very well even for large values l and d as long as the d/l ratio is small.

机译：在本文中，我们提出了两种用于发现DNA序列中单峰模式的新算法。 Monad模式的形式为（l，d）-k，其中l是模式的长度，d是允许的最大不匹配数，k是在给定样本中重复模式的最小次数。迄今为止，一些最著名的算法的时间复杂度为O（nt〜2 l〜dσ〜d），其中t是输入序列的数量，n是每个输入序列的长度，并且σ= |Σ |是字母的大小。我们在本文中提出的第一个算法占用O（n〜2 t〜2 l〜（d / 2））时间和O（ntl〜（d / 2）σ〜（d / 2））空间，第二个算法占用该算法使用O（l〜（d / 2）σ〜（d / 2））空间花费O（n〜3 t〜3 l〜（d / 2）σ〜（d / 2））时间。实际上，只要d / l比很小，我们的算法就会有更好的性能。只要d / l之比很小，第二种算法即使对于较大的l和d也可以很好地执行。

著录项

来源
《International Conference on String Processing and Information Retrieval(SPIRE 2004); 20041005-08; Padova(IT)》|2004年|P.273-285|共13页
会议地点 Padova(IT)
作者
Ravi Vijaya Satya; Amar Mukherjee;
展开▼
作者单位

School of Computer Science, University of Central Florida Orlando, FL USA 32816-2362;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类数据备份与恢复;
关键词

相似文献

外文文献
中文文献
专利

1. Genetic algorithm for dyad pattern finding in DNA sequences [J] . Zare-Mirakabad Fatemeh, Ahrabian Hayedeh, Sadeghi Mehdi, Genes & Genetic Systems . 2009,第1期

机译：DNA序列中二分体模式发现的遗传算法
2. Genetic algorithm for dyad pattern finding in DNA sequences [J] . Abbas Nowzari-Dalini, Bahram Goliaei, Fatemeh Zare-Mirakabad, Genes & Genetic Systems . 2009,第1期

机译：DNA序列中二分体模式发现的遗传算法
3. Pattern locator: a new tool for finding local sequence patterns in genomic DNA sequences [J] . Mrazek J, Xie SH Bioinformatics . 2006,第24期

机译：模式定位器：一种用于寻找基因组DNA序列中局部序列模式的新工具
4. PRUNER: algorithms for finding monad patterns in DNA sequences [C] . VijayaSatya, R., Mukheqee, . 2004

机译：PRUNER：用于在DNA序列中查找monad模式的算法
5. Finding patterns in DNA sequences through visualization with symbolic scatter plots. [D] . Cox, David N. 2010

机译：通过使用符号散点图进行可视化查找DNA序列中的模式。
6. WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences. [O] . G Pesole, N Prunella, S Liuni, 1992

机译：WORDUP：一种有效的算法用于发现DNA序列中具有统计学意义的模式。
7. PRUNER: Algorithms for Finding Monad Patterns in DNA Sequences [O] . Ravi Vijayasatya, Amar Mukherjee 2014

机译：pRUNER：在DNa序列中寻找monad模式的算法

New Algorithms for Finding Monad Patterns in DNA Sequences

摘要

著录项

相似文献

相关主题

期刊订阅