Advances in Computational Identification and Modeling of DNA Regulatory Elements in the Human Genome

机译：人类基因组中DNA调控元件的计算鉴定和建模研究进展

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Identification of DNA regulatory elements in the human genome remains a significant challenge. Variation in these regulatory elements can contribute to disease in many ways by altering protein levels. Enhancers constitute an important class of these DNA regulatory elements, and a major component of current research is focused on a more complete understanding of enhancer function and improved techniques for enhancer detection. We recently developed a computational approach to identify enhancers from primary DNA sequence using a support vector machine (kmer-SVM) framework. Here we show that the kmer-SVM model can accurately predict tissue specific enhancer activity without any prior knowledge about TF binding sites. We adapt this approach to predict genomic TF binding data generated by the ENCODE project, showing that genomic MYC binding can be accurately predicted from local DNA sequence with the kmer-SVM. We find similar accuracy with an SVM using PWMs representing known TF binding specificities. By integrating Chip-seq and expression data, we show that while much of MYC binding is shared between ENCODE cell types and is promoter proximal, cell-type specific MYC binding is distal and is correlated with enhanced cell-specific expression of nearby (～50kb) genes. The distinction between shared and cell-specific MYC binding is determined by DNA sequence variation around the canonical MYC binding site, which by itself cannot distinguish cell-specific binding events. These results suggest that tissue specific enhancer activity is specified by primary DNA sequence, that local sequence context controls tissue specific activity through cooperative TF interactions, and that local context sequence features can be identified from genomic binding data.

机译：鉴定人类基因组中的DNA调控元件仍然是一项重大挑战。这些调节元件的变化可通过改变蛋白质水平以多种方式导致疾病。增强子构成了这些DNA调控元件的重要类别，当前研究的主要内容集中在对增强子功能的更全面理解以及增强子检测的改进技术上。我们最近开发了一种使用支持向量机（kmer-SVM）框架从一级DNA序列中识别增强子的计算方法。在这里，我们显示kmer-SVM模型可以准确预测组织特异的增强子活性，而无需任何有关TF结合位点的先验知识。我们采用这种方法来预测由ENCODE项目生成的基因组TF结合数据，表明可以使用kmer-SVM从本地DNA序列准确预测基因组MYC结合。我们发现使用代表已知TF结合特异性的PWM的SVM具有相似的准确性。通过整合Chip-seq和表达数据，我们发现，虽然许多MYC结合在ENCODE细胞类型之间共享并且位于启动子的近端，但细胞类型特异性MYC结合却在远端，并且与附近的增强的细胞特异性表达相关（〜50kb ）基因。共享的和细胞特异性的MYC结合之间的区别是由规范的MYC结合位点周围的DNA序列变异决定的，而DNA序列变异本身不能区分细胞特异性的结合事件。这些结果表明，组织特异性增强子活性由初级DNA序列指定，局部序列背景通过协同TF相互作用控制组织特异性活性，并且局部背景序列特征可以从基因组结合数据中鉴定。

著录项

来源
《4th International Conference on Biomedical Engineering in Vietnam》|2012年|328-331|共4页
会议地点 Chi Minh City(VN)
作者
Dongwon Lee; Michael A. Beer;
展开▼
作者单位

Johns Hopkins University, Department of Biomedical Engineering and McKusick-Nathans Institute of Genetic Medicine, Baltimore, MD, USA;

Johns Hopkins University, Department of Biomedical Engineering and McKusick-Nathans Institute of Genetic Medicine, Baltimore, MD, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
computational biology; genomics; transcriptional regulation; enhancers;

机译：计算生物学；基因组学转录调控；增强剂;

相似文献

外文文献
中文文献
专利

1. Genome-wide computational and expression analyses reveal G-quadruplex DNA motifs as conserved cis-regulatory elements in human and related species [J] . Verma A, Halder K, Halder R, Journal of Medicinal Chemistry . 2008,第18期

机译：全基因组的计算和表达分析表明，G-四链体DNA基序是人类和相关物种中保守的顺式调控元件
2. Identification of Regulatory DNA Elements Using Genome-wide Mapping of DNase I Hypersensitive Sites during Tomato Fruit Development [J] . Zhengkun Qiu, Ren Li, Shuaibin Zhang, 分子植物（英文版） . 2016,第008期

机译：在番茄果实发育过程中使用全基因组的DNase I超敏位点鉴定调控DNA元件
3. Advances of DNase-seq for mapping active gene regulatory elements across the genome in animals [J] . Chen Ailing, Chen Daozhen, Chen Ying Gene: An International Journal Focusing on Gene Cloning and Gene Structure and Function . 2018,第期

机译：DNase-SEQ在动物基因组中映射活性基因调节元件的研究进展
4. Advances in Computational Identification and Modeling of DNA Regulatory Elements in the Human Genome [C] . Dongwon Lee, Michael A. Beer International Conference on Biomedical Engineering in Vietnam . 2013

机译：人类基因组中DNA调节元件的计算鉴定与建模的进展
5. Repetitive DNA sequence elements and DNA polymerases modulate instability in the human genome. [D] . Walsh, Erin. 2014

机译：重复的DNA序列元件和DNA聚合酶调节人类基因组中的不稳定性。
6. Genome-Wide Computational Identification of Biologically Significant Cis-Regulatory Elements and Associated Transcription Factors from Rice [O] . Chai-Ling Ho, Matt Geisler 2019

机译：水稻生物学上重要的顺式调控元件和相关转录因子的全基因组计算鉴定
7. TTS Mapping: integrative WEB tool for analysis of triplex formation target DNA Sequences, G-quadruplets and non-protein coding regulatory DNA elements in the human genome [O] . Kuznetsov Vladimir A, Jenjaroenpun Piroon 2009

机译： TTS映射：集成的WEB工具，用于分析人类基因组中三链体形成靶DNA序列，G-四联体和非蛋白质编码调控DNA元素

相关主题

Advances in Computational Identification and Modeling of DNA Regulatory Elements in the Human Genome

摘要

著录项

相似文献

相关主题

期刊订阅