首页> 外文期刊>Journal of Molecular Biology >Characterization of novel proteins based on known protein structures.
【24h】

Characterization of novel proteins based on known protein structures.

机译:基于已知蛋白质结构的新型蛋白质的表征。

获取原文
获取原文并翻译 | 示例
           

摘要

The genome sciences face the challenge to characterize structure and function of a vast number of novel genes. Sequence search techniques are used to infer functional and structural information from similarities to experimentally characterized genes or proteins. The persistent goal is to refine these techniques and to develop alternative and complementary methods to increase the range of reliable inference.Here, we focus on the structural and functional assignments that can be inferred from the known three-dimensional structures of proteins. The study uses all structures in the Protein Data Bank that were known by the end of 1997. The protein structures released in 1998 were then characterized in terms of functional and structural similarity to the previously known structures, yielding an estimate of the maximum amount of information on novel protein sequences that can be obtained from inference techniques.The 147 globular proteins corresponding to 196 domains released in 1998 have no clear sequence similarity to previously known structures. However, 75 % of the domains have extensive structure similarity to previously known folds, and most importantly, in two out of three cases similarity in structure coincides with related function. In view of this analysis, full utilization of existing structure data bases would provide information for many new targets even if the relationship is not accessible from sequence information alone. Currently, the most sophisticated techniques detect of the order of one-third of these relationships. Copyright 2000 Academic Press.
机译:基因组科学面临着表征大量新基因的结构和功能的挑战。序列搜索技术用于根据与实验表征的基因或蛋白质的相似性推断功能和结构信息。持续的目标是完善这些技术并开发替代和补充方法以增加可靠推断的范围。在此,我们着重于可以从已知的蛋白质三维结构中推断出的结构和功能分配。这项研究使用了1997年底之前已知的蛋白质数据库中的所有结构。然后根据与先前已知结构的功能和结构相似性,对1998年发布的蛋白质结构进行了表征,从而估算出了最大的信息量。 1998年发布的196个结构域对应的147个球状蛋白与先前已知的结构没有明确的序列相似性。但是,75%的结构域与先前已知的折叠具有广泛的结构相似性,最重要的是,在三分之二的情况下,结构相似性与相关功能一致。鉴于这种分析,即使不能仅从序列信息访问该关系,对现有结构数据库的充分利用也将为许多新靶标提供信息。当前,最复杂的技术检测到这些关系的三分之一。版权所有2000学术出版社。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号