Aligning and Clustering Patterns to Reveal the Protein Functionality of Sequences

Wong A.K.; Lee E.-S.A.

首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Aligning and Clustering Patterns to Reveal the Protein Functionality of Sequences

【24h】

Aligning and Clustering Patterns to Reveal the Protein Functionality of Sequences

机译：排列和聚类模式揭示序列的蛋白质功能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Discovering sequence patterns with variations unveils significant functions of a protein family. Existing combinatorial methods of discovering patterns with variations are computationally expensive, and probabilistic methods require more elaborate probabilistic representation of the amino acid associations. To overcome these shortcomings, this paper presents a new computationally efficient method for representing patterns with variations in a compact representation called Aligned Pattern Cluster (AP Cluster). To tackle the runtime, our method discovers a shortened list of non-redundant statistically significant sequence associations based on our previous work. To address the representation of protein functional regions, our pattern alignment and clustering step, presented in this paper captures the conservations and variations of the aligned patterns. We further refine our solution to allow more coverage of sequences via extending the AP Clusters containing only statistically significant patterns to Weak and Conserved AP Clusters. When applied to the cytochrome c, the ubiquitin, and the triosephosphate isomerase protein families, our algorithm identifies the binding segments as well as the binding residues. When compared to other methods, ours discovers all binding sites in the AP Clusters with superior entropy and coverage. The identification of patterns with variations help biologists to avoid time-consuming simulations and experimentations. (Software available upon request).

机译：发现具有变异的序列模式揭示了蛋白质家族的重要功能。现有的发现具有变异的模式的组合方法在计算上是昂贵的，并且概率方法需要氨基酸关联的更精细的概率表示。为了克服这些缺点，本文提出了一种新的计算有效的方法，用于以紧凑的表示形式来表示具有变化的模式，称为对齐模式簇（AP簇）。为了解决运行时问题，我们的方法在以前的工作基础上，发现了较短的非冗余统计上有意义的序列关联列表。为了解决蛋白质功能区的代表问题，本文提出的我们的模式比对和聚类步骤捕获了比对模式的保守性和变异性。我们进一步完善了我们的解决方案，通过将仅包含统计上显着模式的AP群集扩展到弱和保守AP群集，从而允许更多的序列覆盖。当应用于细胞色素c，泛素和磷酸三糖异构酶蛋白家族时，我们的算法可识别结合片段以及结合残基。与其他方法相比，我们的方法发现了AP簇中所有具有强熵和覆盖率的结合位点。模式变化的识别有助于生物学家避免费时的模拟和实验。（可根据要求提供软件）。

著录项

来源
《Computational Biology and Bioinformatics, IEEE/ACM Transactions on》 |2014年第3期|548-560|共13页
作者
Wong A.K.; Lee E.-S.A.;
展开▼
作者单位

Department of Systems Design Engineering, University of Waterloo, Waterloo, Canada|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Aligned pattern cluster; hierarchical clustering; protein function prediction; sequence pattern;

机译：对齐模式聚类;分层聚类;蛋白质功能预测;序列模式;

相似文献

外文文献
中文文献
专利

1. Expression clustering reveals detailed co-expression patterns of functionally related proteins during B cell differentiation: a proteomic study using a combination of one-dimensional gel electrophoresis, LC-MS/MS, and stable isotope labeling by amino [J] . Romijn EP, Christis C, Wieffer M, Molecular & cellular proteomics: MCP . 2005,第9期

机译：表达聚类揭示了B细胞分化过程中功能相关蛋白的详细共表达模式：蛋白质组学研究，结合了一维凝胶电泳，LC-MS / MS和稳定的氨基同位素标记
2. Unsupervised Clustering of Subcellular Protein Expression Patterns in High-Throughput Microscopy Images Reveals Protein Complexes and Functional Relationships between Proteins [J] . Louis-Fran?ois Handfield, Yolanda T. Chong, Jibril Simmons, PLoS Computational Biology . 2013,第6期

机译：高通量显微镜图像中亚细胞蛋白质表达模式的无监督聚类揭示了蛋白质复合物和蛋白质之间的功能关系
3. Discovering Patterns From Sequences Using Pattern-Directed Aligned Pattern Clustering [J] . Antonio Sze-To, Andrew K. C. Wong NanoBioscience, IEEE Transactions on . 2018,第3期

机译：使用模式定向的对齐模式聚类从序列中发现模式
4. Identifying protein binding functionality of protein family sequences by Aligned Pattern clusters [C] . Lee En-Shiun Annie, Wong Andrew K. C. 2012 IEEE International Conference on Bioinformatics and Biomedicine. . 2012

机译：通过比对模式簇鉴定蛋白质家族序列的蛋白质结合功能
5. Algorithms for aligning and clustering genomic sequences that contain duplications. [D] . Hou, Minmei. 2007

机译：对包含重复项的基因组序列进行比对和聚类的算法。
6. Unsupervised Clustering of Subcellular Protein Expression Patterns in High-Throughput Microscopy Images Reveals Protein Complexes and Functional Relationships between Proteins [O] . Louis-François Handfield, Yolanda T. Chong, Jibril Simmons, 2013

机译：高通量显微镜图像中亚细胞蛋白质表达模式的无监督聚类揭示了蛋白质复合物和蛋白质之间的功能关系
7. Discovering Protein Functional Regions and Protein-Protein Interaction using Co-occurring Aligned Pattern Clusters [O] . Fung Sanderz 2015

机译：使用并发排列的模式簇发现蛋白质功能区和蛋白质-蛋白质相互作用

Aligning and Clustering Patterns to Reveal the Protein Functionality of Sequences

摘要

著录项

相似文献

相关主题

期刊订阅