首页> 外文期刊>Journal of Bioinformatics and Computational Biology >EXCLUSIVE SEQUENCES OF DIFFERENT GENOMES
【24h】

EXCLUSIVE SEQUENCES OF DIFFERENT GENOMES

机译:不同基因组的排他性序列

获取原文
获取原文并翻译 | 示例
           

摘要

We studied the distribution of 1–7 bp words in a dataset that includes 139 complete eukaryotic genomes, 33 masked eukaryotic genomes and coding regions from 35 genomes. We tested different statistical models to determine over- and under-represented words. The method described by Karlin et al. has the strongest predictive power compared to other methods. Using this method we identified over- and under-represented words consistent within a large array of taxonomic groups. Some of those words have not yet been described as exclusive. For example, CGCG is over-represented in CG-deficient organisms. We also describe exceptions for widely known exclusive words, such as CG and TA.
机译:我们研究了一个数据集中1–7 bp单词的分布,该数据集包含139个完整的真核基因组,33个蒙面的真核基因组和35个基因组的编码区。我们测试了不同的统计模型,以确定出现过多和不足的单词。 Karlin等人描述的方法。与其他方法相比,具有最强的预测能力。使用这种方法,我们确定了在大量分类学组中一致的过多和不足的单词。这些词中的一些尚未被描述为排他性的。例如,CGCG在缺乏CG的生物体中过分表达。我们还将描述CG和TA等广为人知的专有词的例外。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号