...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >ncRNA consensus secondary structure derivation using grammar strings.
【24h】

ncRNA consensus secondary structure derivation using grammar strings.

机译:使用语法字符串进行ncRNA共有二级结构推导。

获取原文
获取原文并翻译 | 示例
           

摘要

Many noncoding RNAs (ncRNAs) function through both their sequences and secondary structures. Thus, secondary structure derivation is an important issue in today's RNA research. The state-of-the-art structure annotation tools are based on comparative analysis, which derives consensus structure of homologous ncRNAs. Despite promising results from existing ncRNA aligning and consensus structure derivation tools, there is a need for more efficient and accurate ncRNA secondary structure modeling and alignment methods. In this work, we introduce a consensus structure derivation approach based on grammar string, a novel ncRNA secondary structure representation that encodes an ncRNA's sequence and secondary structure in the parameter space of a context-free grammar (CFG) and a full RNA grammar including pseudoknots. Being a string defined on a special alphabet constructed from a grammar, grammar string converts ncRNA alignment into sequence alignment. We derive consensus secondary structures from hundreds of ncRNA families from BraliBase 2.1 and 25 families containing pseudoknots using grammar string alignment. Our experiments have shown that grammar string-based structure derivation competes favorably in consensus structure quality with Murlet and RNASampler. Source code and experimental data are available at http://www.cse.msu.edu/~yannisun/grammar-string.
机译:许多非编码RNA(ncRNA)通过其序列和二级结构起作用。因此,二级结构推导是当今RNA研究中的重要问题。最新的结构注释工具基于比较分析,可得出同源ncRNA的共有结构。尽管现有的ncRNA比对和共有结构衍生工具带来了可喜的结果,但仍需要更有效和准确的ncRNA二级结构建模和比对方法。在这项工作中,我们介绍了一种基于语法字符串的共有结构推导方法,一种新颖的ncRNA二级结构表示形式,它在上下文无关文法(CFG)的参数空间中编码ncRNA的序列和二级结构,以及包括伪结的完整RNA语法。语法字符串是在由语法构成的特殊字母上定义的字符串,可将ncRNA比对转换为序列比对。我们使用文法字符串比对从数百个来自BraliBase 2.1的ncRNA家族和25个包含假结的家族中衍生出共有的二级结构。我们的实验表明,基于语法字符串的结构推导与Murlet和RNASampler在共有结构质量方面具有良好的竞争优势。源代码和实验数据可从http://www.cse.msu.edu/~yannisun/grammar-string获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号