...
【24h】

GC/AT-content spikes as genomic punctuation marks

机译:GC / AT含量飙升为基因组标点符号

获取原文
获取原文并翻译 | 示例
           

摘要

Large-scale analysis of the GC-content distribution at the gene level reveals both common features and basic differences in genomes of different groups of species. Sharp changes in GC content are detected at the transcription boundaries for all species analyzed, including human, mouse, rat, chicken, fruit fly, and worm. However, two substantially distinct groups of GC-content profiles can be recognized: warm-blooded vertebrates including human, mouse, rat, and chicken, and invertebrates including fruit fly and worm. In vertebrates, sharp positive and negative spikes of GC content are observed at the transcription start and stop sites, respectively, and there is also a progressive decrease in GC content from the 5' untranslated region to the 3' untranslated region along the gene. In invertebrates, the positive and negative GC-content spikes at the transcription start and stop sites are preceded by spikes of opposite value, and the highest GC content is found in the coding regions of the genes. Cross-correlation analysis indicates high frequencies of GC-content spikes at transcription start and stop sites. The strong conservation of this genomic feature seen in comparisons of the human/mouse and human/rat orthologs, and the clustering of genes with GC-content spikes on chromosomes imply a biological function. The GC-content spikes at transcription boundaries may reflect a general principle of genomic punctuation. Our analysis also provides means for identifying these GC-content spikes in individual genomic sequences.
机译:在基因水平上对GC含量分布的大规模分析揭示了不同物种组的基因组的共同特征和基本差异。在分析的所有物种(包括人类,小鼠,大鼠,鸡,果蝇和蠕虫)的转录边界处检测到GC含量的急剧变化。但是,可以识别出两组基本不同的GC含量曲线:温血脊椎动物,包括人,小鼠,大鼠和鸡,无脊椎动物,包括果蝇和蠕虫。在脊椎动物中,分别在转录起始和终止位点观察到GC含量的尖锐的正负尖峰,并且沿着该基因从5'非翻译区到3'非翻译区的GC含量也逐渐降低。在无脊椎动物中,转录起始和终止位点的正负GC含量峰值之前是相反值的峰值,并且在基因的编码区中发现了最高的GC含量。互相关分析表明,在转录起始和终止位点,GC含量峰值的频率很高。在人类/小鼠和人类/大鼠直系同源物的比较中可以看到这种基因组特征的强大保守性,并且染色体上具有GC含量峰值的基因簇暗示了生物学功能。转录边界处的GC含量峰值可能反映了基因组标点的一般原理。我们的分析还提供了识别单个基因组序列中这些GC含量峰值的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号