Heuristic Bayesian Segmentation for Discovery of Coexpressed Genes within Genomic Regions

首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Heuristic Bayesian Segmentation for Discovery of Coexpressed Genes within Genomic Regions

【24h】

Heuristic Bayesian Segmentation for Discovery of Coexpressed Genes within Genomic Regions

机译：用于基因组区域内共表达基因发现的启发式贝叶斯分割

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Segmentation aims to separate homogeneous areas from the sequential data, and plays a central role in data mining. It has applications ranging from finance to molecular biology, where bioinformatics tasks such as genome data analysis are active application fields. In this paper, we present a novel application of segmentation in locating genomic regions with coexpressed genes. We aim at automated discovery of such regions without requirement for user-given parameters. In order to perform the segmentation within a reasonable time, we use heuristics. Most of the heuristic segmentation algorithms require some decision on the number of segments. This is usually accomplished by using asymptotic model selection methods like the Bayesian information criterion. Such methods are based on some simplification, which can limit their usage. In this paper, we propose a Bayesian model selection to choose the most proper result from heuristic segmentation. Our Bayesian model presents a simple prior for the segmentation solutions with various segment numbers and a modified Dirichlet prior for modeling multinomial data. We show with various artificial data sets in our benchmark system that our model selection criterion has the best overall performance. The application of our method in yeast cell-cycle gene expression data reveals potential active and passive regions of the genome.

机译：分割的目的是从顺序数据中分离出同类区域，并在数据挖掘中发挥核心作用。它的应用范围从金融到分子生物学，其中生物信息学任务（例如基因组数据分析）是活跃的应用领域。在本文中，我们提出了分割在定位具有共表达基因的基因组区域中的新应用。我们旨在自动发现此类区域，而无需用户提供参数。为了在合理的时间内执行细分，我们使用了启发式方法。大多数启发式分割算法都需要对段数做出一些决定。这通常通过使用渐近模型选择方法（如贝叶斯信息准则）来完成。此类方法基于一些简化，可能会限制其使用。在本文中，我们提出了一种贝叶斯模型选择，以从启发式分割中选择最合适的结果。我们的贝叶斯模型为具有各种段号的细分解决方案提供了简单的先验条件，为模型化多项式数据提供了改进的Dirichlet先验条件。我们在基准系统中通过各种人工数据集表明，我们的模型选择标准具有最佳的整体性能。我们的方法在酵母细胞周期基因表达数据中的应用揭示了基因组的潜在主动和被动区域。

著录项

来源
《Computational Biology and Bioinformatics, IEEE/ACM Transactions on》 |2010年第1期|p.37-49|共13页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Biology and genetics; and association rules; association rules; classification; clustering; segmentation; segmentation.;

机译：生物学和遗传学;以及关联规则;关联规则;分类;聚类;分段;分段。;

相似文献

外文文献
中文文献
专利

1. Discovery of genomic regions and candidate genes for grain weight employing next generation sequencing based QTL-seq approach in rice (Oryza sativa L.) [J] . Bommisetty Reddyyamini, Chakravartty Navajeet, Bodanapu Reddaiah, Molecular biology reports . 2020,第11期

机译：基因组区域发现基于水稻（Oryza Sativa L.）的下一代QTL-SEQ方法的晶粒重量和候选基因
2. Discovery of genomic regions and candidate genes controlling shelling percentage using QTL‐seq approach in cultivated peanut (Arachis hypogaea L.) [J] . Huaiyong Luo, Manish K. Pandey, Aamir W. Khan, Plant Biotechnology Journal . 2019,第7期

机译：在栽培花生中使用QTL-SEQ方法控制基因组区域和候选基因的候选基因（Arachis Hypogaea L.）
3. Genomic gains and losses influence expression levels of genes located within the affected regions: a study on acute myeloid leukemias with trisomy 8, 11, or 13, monosomy 7, or deletion 5q [J] . C Schoch, A Kohlmann, M Dugas, Leukemia . 2005,第7期

机译：基因组得失影响受影响区域内基因的表达水平：一项关于8号，11号或13号，7号或5q缺失的急性髓样白血病的研究
4. Cell segmentation from phase-contrast images using hybrid watershed and region growing algorithm for genomic drug discovery [C] . Orikawa Jo, Tanaka Toshiyuki Proceedings of SICE Annual Conference 2010 . 2010

机译：利用混合分水岭和区域增长算法从相衬图像中进行细胞分割，用于基因组药物发现
5. Discovery and deconvolution of an ensemble of G quadruplex structures located within the 3' proximal promoter region of the tyrosine hydroxylase gene. [D] . Sewell, Abby Leyda. 2011

机译：发现和解卷积位于酪氨酸羟化酶基因3'近端启动子区域内的G四链体结构的集合。
6. The Genomic Region Encompassing the Nephropathic Cystinosis Gene (CTNS): Complete Sequencing of a 200-kb Segment and Discovery of a Novel Gene within the Common Cystinosis-Causing Deletion [O] . Jeffrey W. Touchman, Yair Anikster, Nicole L. Dietrich, 2000

机译：包含肾病性膀胱炎基因（CTNS）的基因组区域：200 kb节段的完整测序和常见的引起膀胱炎的缺失中发现一个新基因。
7. The Genomic Region Encompassing the Nephropathic Cystinosis Gene (CTNS): Complete Sequencing of a 200-kb Segment and Discovery of a Novel Gene within the Common Cystinosis-Causing Deletion [O] . Touchman, Jeffrey W., Anikster, Yair, Dietrich, Nicole L., 2000

机译：包含肾病性膀胱炎基因（CTNS）的基因组区域：200 kb节段的完整测序和常见的引起膀胱炎的缺失中发现一个新基因。

Heuristic Bayesian Segmentation for Discovery of Coexpressed Genes within Genomic Regions

摘要

著录项

相似文献

相关主题

期刊订阅