...
首页> 外文期刊>Journal of bacteriology >Genome-Scale Analysis of the Uses of the Escherichia coli Genome: Model-Driven Analysis of Heterogeneous Data Sets
【24h】

Genome-Scale Analysis of the Uses of the Escherichia coli Genome: Model-Driven Analysis of Heterogeneous Data Sets

机译:大肠杆菌基因组使用的基因组规模分析:异质数据集的模型驱动分析

获取原文
           

摘要

The recent availability of heterogeneous high-throughput data types has increased the need for scalable in silico methods with which to integrate data related to the processes of regulation, protein synthesis, and metabolism. A sequence-based framework for modeling transcription and translation in prokaryotes has been established and has been extended to study the expression state of the entire Escherichia coli genome. The resulting in silico analysis of the expression state highlighted three facets of gene expression in E. coli: (i) the metabolic resources required for genome expression and protein synthesis were found to be relatively invariant under the conditions tested; (ii) effective promoter strengths were estimated at the genome scale by using global mRNA abundance and half-life data, revealing genes subject to regulation under the experimental conditions tested; and (iii) large-scale genome location-dependent expression patterns with approximately 600-kb periodicity were detected in the E. coli genome based on the 49 expression data sets analyzed. These results support the notion that a structured model-driven analysis of expression data yields additional information that can be subjected to commonly used statistical analyses. The integration of heterogeneous genome-scale data (i.e., sequence, expression data, and mRNA half-life data) is readily achieved in the context of an in silico model.
机译:异构高通量数据类型的最新可用性增加了对可扩展计算机方法的需求,该方法可用于整合与调节,蛋白质合成和代谢过程有关的数据。建立了基于序列的原核生物转录和翻译模型,并已扩展到研究整个大肠杆菌基因组的表达状态。所得表达状态的计算机分析表明了 E基因表达的三个方面。大肠杆菌:(i)在测试条件下,发现基因组表达和蛋白质合成所需的代谢资源相对不变; (ii)通过使用总体mRNA丰度和半衰期数据在基因组规模上估算有效启动子强度,揭示在测试的实验条件下受到调控的基因; (iii)在 E中检测到大约600kb周期性的大规模基因组位置依赖性表达模式。基于分析的49个表达数据集的大肠杆菌基因组。这些结果支持以下观点:结构化的模型驱动的表达数据分析会产生其他信息,这些信息可以接受常用的统计分析。在计算机模拟模型的背景下,很容易实现异质基因组规模数据(即序列,表达数据和mRNA半衰期数据)的整合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号