首页> 外文期刊>Signal Processing, IET >Iterative sequential Monte Carlo algorithm for motif discovery
【24h】

Iterative sequential Monte Carlo algorithm for motif discovery

机译:迭代序贯蒙特卡洛算法用于主题发现

获取原文
获取原文并翻译 | 示例
           

摘要

The discovery of motifs in transcription factor binding sites is important in the transcription process, and is crucial for understanding the gene regulatory relationship and evolution history. Identifying weak motifs and reducing the effect of local optima, error propagation and computational complexity are still important, but challenging tasks for motif discovery. This study proposes an iterative sequential Monte Carlo (ISMC) motif discovery algorithm based on the position weight matrix and the Gibbs sampling model to locate conserved motifs in a given set of nucleotide sequences. Three sub-algorithms have been proposed. Algorithm 1 (see Fig. 1) deals with the case of one motif instance of fixed length in each nucleotide sequence. Furthermore, the proposed ISMC algorithm is extended to deal with more complex situations including unique motif of unknown length in Algorithm 2, unique motif with unknown abundance in Algorithm 3 (see Fig. 2) and multiple motifs. Experimental results over both synthetic and real datasets show that the proposed ISMC algorithm outperforms five other widely used motif discovery algorithms in terms of nucleotide and site-level sensitivity, nucleotide and site-level positive prediction value, nucleotide-level performance coefficient, nucleotide-level correlation coefficient and site-level average site performance.
机译:在转录因子结合位点的基序的发现在转录过程中很重要,对于理解基因调控关系和进化历史至关重要。识别弱的图案并减少局部最优,错误传播和计算复杂性的影响仍然很重要,但对于图案发现而言却是一项艰巨的任务。这项研究提出了一种基于位置权重矩阵和Gibbs采样模型的迭代顺序蒙特卡洛(ISMC)主题发现算法,以在给定的核苷酸序列集中定位保守的主题。已经提出了三个子算法。算法1(见图1)处理每个核苷酸序列中一个固定长度的基序实例的情况。此外,所提出的ISMC算法已扩展为处理更复杂的情况,包括算法2中长度未知的唯一主题,算法3中具有未知丰度的唯一主题(见图2)和多个主题。在合成数据集和真实数据集上的实验结果表明,在核苷酸和位点级敏感性,核苷酸和位点级阳性预测值,核苷酸级性能系数,核苷酸级方面,所提出的ISMC算法优于其他五种广泛使用的基元发现算法相关系数和站点级别的平均站点性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号