首页> 外文期刊>Journal of Mathematical Biology >Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent
【24h】

Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent

机译:从合并后无根基因树的分布中识别有根物种树

获取原文
获取原文并翻译 | 示例
           

摘要

Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled.
机译:基因树是代表从多个种群中采样的基因祖先的进化树。种树代表个体的种群-每个个体都有许多基因-分裂成新的种群或物种。合并过程模拟种群内基因拷贝的祖先,通常用于给定固定树种的情况下对基因树的概率分布进行建模。这种多物种合并模型为系统进化论者提供了一个框架,可以使用最大似然法或贝叶斯方法从基因树中推断物种树。因为合并模型会随着时间的推移对分支过程进行建模,所以通常假定所有树都以该设置为根。但是,通过传统的系统发育方法推断出的基因树通常是无根的。我们调查在多物种合并模型下无根基因树的概率。我们表明,当有四个物种的每个物种采样一个基因时,无根基因树拓扑的分布可以识别无根物种树的拓扑结构,以及物种树边缘(分支长度)中的一些(但不是全部)信息。在这种情况下,无法识别树根的位置。但是,对于5个或5个以上物种,每个物种采样一个基因,我们表明无根基因树拓扑结构的分布确定了有根物种树的拓扑结构及其所有内部分支长度。对于从中采样了多个基因的任何物种,也可以确定导致物种树的叶子的任何侧链的长度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号