Highly Scalable Genotype Phasing by Entropy Minimization

Gusev Alexander; Măndoiu Ion I.; Paşaniuc Bogdan

首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Highly Scalable Genotype Phasing by Entropy Minimization

【24h】

Highly Scalable Genotype Phasing by Entropy Minimization

机译：通过熵最小化实现高度可扩展的基因型定相

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A Single Nucleotide Polymorphism (SNP) is a positionin the genome at which two or more of the possible fournucleotides occur in a large percentage of the population. SNPsaccount for most of the genetic variability between individuals,and mapping SNPs in the human population has become thenext high-priority in genomics after the completion of the HumanGenome project. In diploid organisms such as humans, thereare two non-identical copies of each autosomal chromosome. Adescription of the SNPs in a chromosome is called a haplotype.At present, it is prohibitively expensive to directly determine thehaplotypes of an individual, but it is possible to obtain rather easilythe conflated SNP information in the so called genotype. Computationalmethods for genotype phasing, i.e., inferring haplotypesfrom genotype data, have received much attention in recent yearsas haplotype information leads to increased statistical power ofdisease association tests. However, many of the existing algorithmshave impractical running time for phasing large genotype datasetssuch as those generated by the international HapMap project.In this paper we propose a highly scalable algorithm based onentropy minimization. Our algorithm is capable of phasing bothunrelated and related genotypes coming from complex pedigrees.Experimental results on both real and simulated datasets showthat our algorithm achieves a phasing accuracy worse but closeto that of best existing methods while being several orders ofmagnitude faster. The open source code implementation of thealgorithm and a web interface are publicly available at http://dna.engr.uconn.edu/~software/ent/.

机译：单核苷酸多态性（SNP）是基因组中的一个位置，在人口中有很大比例的两个或多个可能的四核苷酸存在。 SNP占据了个体之间的大部分遗传变异性，在人类基因组计划完成后，绘制人群中的SNP成为基因组学中的头等大事。在人类等二倍体生物中，每个常染色体有两个不同的副本。在染色体中对SNP的描述被称为单倍型。目前，直接确定一个人的单倍型是非常昂贵的，但是可以很容易地获得所谓的基因型的混合SNP信息。近年来，由于单倍型信息导致疾病关联测试的统计能力提高，因此用于基因型定相的计算方法（即从基因型数据推断单倍型）受到了广泛关注。然而，现有的许多算法在定级大型基因型数据集（如国际HapMap项目生成的基因型数据集）时，运行时间都不切实际。本文提出了一种基于熵最小化的高度可扩展算法。我们的算法能够对复杂谱系中不相关和相关的基因型进行定相。在真实数据集和模拟数据集上的实验结果均表明，我们的算法的定相精度较差，但与最佳现有方法相近，但速度却快了几个数量级。该算法的开放源代码实现和Web界面可从http://dna.engr.uconn.edu/~software/ent/公开获得。

著录项

来源
《IEEE/ACM transactions on computational biology and bioinformatics》 |2008年第2期|p.252-261|共10页
作者
Gusev Alexander; Măndoiu Ion I.; Paşaniuc Bogdan;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物数学方法;生物信息论;
关键词
Single Nucleotide Polymorphism; algorithm.; genotype phasing; haplotype;

机译：单核苷酸多态性;算法;基因型定相;单倍型;

相似文献

外文文献
中文文献
专利

1. Highly Accurate Target Motion Compensation Using Entropy Function Minimization [J] . Amin Aghatabar Roodbary, Mohammad Hassan Bastani International Journal of Information Technology . 2018,第8期

机译：使用熵函数最小化的高精度目标运动补偿
2. A fast and flexible statistical model for large-scale population genotype data: Applications to inferring missing genotypes and haplotypic phase [J] . Scheet P, Stephens M The American Journal of Human Genetics . 2006,第4期

机译：快速，灵活的大规模人口基因型数据统计模型：推断缺失的基因型和单倍型的应用
3. Large-scale genotyping of highly polymorphic loci by next-generation sequencing: how to overcome the challenges to reliably genotype individuals? [J] . Ferrandiz-Rovira M., Bigot T., Allaine D., Heredity: An International Journal of Genetics . 2015,第5期

机译：通过下一代测序对高度多态性基因座进行大规模基因分型：如何克服对可靠基因型个体的挑战？
4. Highly Scalable Genotype Phasing by Entropy Minimization [C] . Pasaniuc, Bogdan, Mandoiu, Engineering in Medicine and Biology Society, 2006 Annual International Conference of the IEEE . 2006

机译：通过熵最小化实现高度可扩展的基因型定相
5. Characterization of the Genotypes, Phenotypes and Neurotropism of the HIV-1 Envelope glycoproteins from two highly Neurotoxic CSF-derived isolates. [D] . Sierra, Luz-Jeannette. 2014

机译：从两个高度神经毒性的脑脊液来源分离株中HIV-1包膜糖蛋白的基因型，表型和嗜神经性的表征。
6. Phase space volume scaling of generalized entropies and anomalous diffusion scaling governed by corresponding non-linear Fokker-Planck equations [O] . Dániel Czégel, Sámuel G. Balogh, Péter Pollner, -1

机译：由相应的非线性Fokker-Planck方程控制的广义熵的相空间体积缩放和反常扩散缩放
7. Highly Scalable Genotype Phasing by Entropy Minimization [O] . 2015

机译：通过熵最小化实现高度可扩展的基因型定相

Highly Scalable Genotype Phasing by Entropy Minimization

摘要

著录项

相似文献

相关主题

期刊订阅