Comparing Genomes with Duplications: A Computational Complexity Point of View

Blin Guillaume; Chauve Cedric; Fertin Guillaume; Rizzi Romeo; Vialette Stephane

首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Comparing Genomes with Duplications: A Computational Complexity Point of View

【24h】

Comparing Genomes with Duplications: A Computational Complexity Point of View

机译：比较基因组与重复：计算复杂性的观点

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we are interested in the computational complexity of computing (dis)similarity measures between two genomes when they contain duplicated genes or genomic markers, a problem that happens frequently when comparing whole nuclear genomes. Recently, several methods ( [1], [2]) have been proposed that are based on two steps to compute a given (dis)similarity measure M between two genomes G_1 and G_2: first, one establishes a oneto- one correspondence between genes of G_1 and genes of G_2 ; second, once this correspondence is established, it defines explicitly a permutation and it is then possible to quantify their similarity using classical measures defined for permutations, like the number of breakpoints. Hence these methods rely on two elements: a way to establish a one-to-one correspondence between genes of a pair of genomes, and a (dis)similarity measure for permutations. The problem is then, given a (dis)similarity measure for permutations, to compute a correspondence that defines an optimal permutation for this measure. We are interested here in two models to compute a one-to-one correspondence: the exemplar model, where all but one copy are deleted in both genomes for each gene family, and the matching model, that computes a maximal correspondence for each gene family. We show that for these two models, and for three (dis)similarity measures on permutations, namely the number of common intervals, the maximum adjacency disruption (MAD) number and the summed adjacency disruption (SAD) number, the problem of computing an optimal correspondence is NP-complete, and even APXhard for the MAD number and SAD number.

机译：在本文中，我们对两个基因组包含重复的基因或基因组标记时计算（非）相似性度量的计算复杂性感兴趣，这是在比较整个核基因组时经常发生的问题。最近，已经提出了几种方法[[1]，[2]），该方法基于两个步骤来计算两个基因组G_1和G_2之间的给定（不相似）度量M：首先，一个方法在基因之间建立一对一的对应关系。 G_1和G_2的基因;其次，一旦建立了这种对应关系，就可以明确定义一个排列，然后可以使用为排列定义的经典度量（如断点数）来量化它们的相似性。因此，这些方法依赖于两个要素：一种在一对基因组的基因之间建立一对一对应关系的方法，以及一种用于排列的（不相似）度量。然后，给定问题的（非）相似性度量问题，以计算为该度量定义最佳置换的对应关系。我们在这里感兴趣的是两个模型来计算一对一的对应关系：示例模型，其中每个基因家族的两个基因组中都删除了一个拷贝，但所有拷贝都被删除；以及匹配模型，该模型计算了每个基因家族的最大对应关系。我们表明，对于这两个模型，以及对于排列的三个（不相似）度量，即公共区间数，最大邻接破坏（MAD）数和总邻接破坏（SAD）数，计算最优值的问题对应关系是NP完整的，甚至是MAX编号和SAD编号的APXhard。

著录项

来源
《IEEE/ACM transactions on computational biology and bioinformatics》 |2007年第4期|p.523-534|共12页
作者
Blin Guillaume; Chauve Cedric; Fertin Guillaume; Rizzi Romeo; Vialette Stephane;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物数学方法;生物信息论;
关键词
Comparative genomics; common intervals; computational complexity; maximum adjacency disruption number; summed adjacency disruption number;

机译：比较基因组学;公共区间;计算复杂度;最大邻接破坏数;总邻接破坏数;

相似文献

外文文献
中文文献
专利

1. Widespread Whole Genome Duplications Contribute to Genome Complexity and Species Diversity in Angiosperms [J] . Ren Ren, Haifeng Wang, Chunce Guo, 分子植物（英文版） . 2018,第003期

机译：广泛的全基因组重复有助于被子植物的基因组复杂性和物种多样性。
2. Sorting duplicated loci disentangles complexities of polyploid genomes masked by genotyping by sequencing [J] . Limborg Morten T., Seeb Lisa W., Seeb James E. Molecular ecology . 2016,第10期

机译：排序重复的基因座可以解决被基因分型所掩盖的多倍体基因组的复杂性
3. What Is the Role of Genome Duplication in the Evolution of Complexity and Diversity? [J] . Karen D. Crow and Günter P. Wagner Molecular Biology and Evolution . 2006,第5期

机译：基因组复制在复杂性和多样性演变中的作用是什么？
4. On the Approximability of Comparing Genomes with Duplicates [C] . Sebastien Angibaud, Guillaume Fertin, Irena Rusu WALCOM: Algorithms and Computation . 2008

机译：关于重复基因组比较的近似性
5. Resolving Genome Complexity: Computational and Technological Methods [D] . Pendleton, Matthew 2016

机译：解决基因组复杂性：计算和技术方法
6. On the computational complexity of the maximum parsimony reconciliation problem in the duplication-loss-coalescence model [O] . Daniel Bork, Ricson Cheng, Jincheng Wang, 2017

机译：复制损失合并模型中最大简约对账问题的计算复杂度
7. Comparing Genomes with Duplications: a Computational Complexity Point of View [O] . Blin Guillaume, Chauve Cedric, Fertin Guillaume, 2007

机译：比较基因组与重复：计算复杂性的观点

Comparing Genomes with Duplications: A Computational Complexity Point of View

摘要

著录项

相似文献

相关主题

期刊订阅