...
首页> 外文期刊>Proceedings of the National Academy of Sciences of the United States of America >A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome.
【24h】

A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome.

机译:一种计算和实验方法,用于验证果蝇果蝇基因组中的注释和基因预测。

获取原文
获取原文并翻译 | 示例
           

摘要

Five years after the completion of the sequence of the Drosophila melanogaster genome, the number of protein-coding genes it contains remains a matter of debate; the number of computational gene predictions greatly exceeds the number of validated gene annotations. We have assembled a collection of >10,000 gene predictions that do not overlap existing gene annotations and have developed a process for their validation that allows us to efficiently prioritize and experimentally validate predictions from various sources by sequencing RT-PCR products to confirm gene structures. Our data provide experimental evidence for 122 protein-coding genes. Our analyses suggest that the entire collection of predictions contains only approximately 700 additional protein-coding genes. Although we cannot rule out the discovery of genes with unusual features that make them refractory to existing methods, our results suggest that the D. melanogaster genome contains approximately 14,000 protein-coding genes.
机译:果蝇果蝇基因组序列完成五年后,它所包含的蛋白质编码基因的数量仍是一个有争议的问题。计算基因预测的数量大大超过了经过验证的基因注释的数量。我们已经收集了超过10,000个与现有基因注释不重叠的基因预测的集合,并开发了一种验证方法,使我们能够通过对RT-PCR产物进行测序来确认基因结构,从而对各种来源的预测进行有效的优先排序和实验验证。我们的数据为122种蛋白质编码基因提供了实验证据。我们的分析表明,预测的整个集合仅包含大约700个其他蛋白质编码基因。尽管我们不能排除发现具有不寻常特征的基因的发现,这些特征使它们无法抵抗现有方法,但我们的结果表明黑腹果蝇的基因组包含大约14,000个蛋白质编码基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号