Ortholog Clustering on a Multipartite Graph

Akshay Vashist; Casimir A. Kulikowski; Ilya Muchnik

首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Ortholog Clustering on a Multipartite Graph

【24h】

Ortholog Clustering on a Multipartite Graph

机译：多部分图上的Ortholog聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a method for automatically extracting groups of orthologous genes from a large set of genomes by a new clustering algorithm on a weighted multipartite graph. The method assigns a score to an arbitrary subset of genes from multiple genomes to assess the orthologous relationships between genes in the subset. This score is computed using sequence similarities between the member genes and the phylogenetic relationship between the corresponding genomes. An ortholog cluster is found as the subset with the highest score, so ortholog clustering is formulated as a combinatorial optimization problem. The algorithm for finding an ortholog cluster runs in time O(|E| + |V| log |V|), where V and E are the sets of vertices and edges, respectively, in the graph. However, if we discretize the similarity scores into a constant number of bins, the runtime improves to O(|E| + |V|). The proposed method was applied to seven complete eukaryote genomes on which the manually curated database of eukaryotic ortholog clusters, KOG, is constructed. A comparison of our results with the manually curated ortholog clusters shows that our clusters are well correlated with the existing clusters

机译：我们提出了一种通过加权多部分图上的新聚类算法自动从一大组基因组中自动提取直系同源基因组的方法。该方法给来自多个基因组的任意基因子集分配分数，以评估子集中基因之间的直系同源关系。使用成员基因之间的序列相似性和相应基因组之间的系统发生关系来计算该分数。发现直系同源簇是得分最高的子集，因此直系同源簇被表述为组合优化问题。查找直系同源簇的算法在时间O（| E | + | V | log | V |）中运行，其中V和E分别是图中的顶点和边的集合。但是，如果我们将相似性分数离散化为恒定数量的bin，则运行时间将提高为O（| E | + | V |）。该方法被应用于七个完整的真核生物基因组，在其上构建了人工策划的真核直系同源簇KOG数据库。将我们的结果与人工策划的直系同源聚类进行比较，结果表明我们的聚类与现有聚类具有很好的相关性

著录项

来源
《IEEE/ACM transactions on computational biology and bioinformatics》 |2007年第2007期|p.17-27|共11页
作者
Akshay Vashist; Casimir A. Kulikowski; Ilya Muchnik;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物数学方法;生物信息论;
关键词
biology computing; cellular biophysics; genetics; graph theory; molecular biophysics; molecular configurations; optimisation; combinatorial optimization problem; eukaryote genomes; eukaryotic ortholog clusters KOG; genomes; ortholog clustering; orthologous genes; phy;

机译：生物学计算;细胞生物物理学;遗传学;图论;分子生物物理学;分子构型;优化;组合优化问题;真核生物基因组;真核直系同源簇KOG;基因组;直系同源簇;直系同源基因;phy;

相似文献

外文文献
中文文献
专利

1. Ortholog Clustering on a Multipartite Graph [J] . Akshay Vashist, Casimir A. Kulikowski, Ilya Muchnik IEEE/ACM transactions on computational biology and bioinformatics . 2007,第1期

机译：多部分图上的Ortholog聚类
2. Optimal clustering of multipartite graphs [J] . Charon I, Hudry O Discrete Applied Mathematics . 2008,第8期

机译：多部分图的最优聚类
3. Spanning multipartite tournaments of semicomplete multipartite digraphs [J] . Lutz Volkmann Ars Combinatoria: An Australian-Canadian Journal of Combinatorics . 2001,第0期

机译：半完全多边有向图的跨越多边锦标赛
4. Screening for Ortholog Clusters Using Multipartite Graph Clustering by Quasi-Concave Set Function Optimization [C] . Akshay Vashist, Casimir Kulikowski, Ilya Muchnik International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing(RSFDGrC 2005) pt.2; 20050831-0903; Regina(CA) . 2005

机译：拟凹函数优化利用多图图聚类筛选直系同源簇
5. Multipartite graph clustering for structured datasets and automating ortholog extraction [D] . Vashist, Akshaya Kumar 2006

机译：用于结构化数据集和自动化直系同源物提取的多部分图聚类
6. Composition and dosage of a multipartite enhancer cluster control developmental expression of Indian hedgehog [O] . Anja J. Will, Giulia Cova, Marco Osterwalder, -1

机译：印度刺猬的多部分增强剂簇控制发育表达的组成和剂量
7. Optimal clustering of multipartite graphs [O] . Charon Irène, Hudry Olivier 2008

机译：多部分图的最优聚类

Ortholog Clustering on a Multipartite Graph

摘要

著录项

相似文献

相关主题

期刊订阅