【24h】

Swiss-Prot and Its Computer-Annotated Spplement Trembl: How to Produce High Quality Automatic Annotation

机译:Swiss-Prot及其计算机注释的摆线颤音:如何产生高质量的自动注释

获取原文

摘要

SWISS-PROT (http://www.ebi.ac.uk/ebi_docs/swissprot_db/swisshome.html) is a protein sequence database with a high level of annotation and integration with other databases, and a minimal level of redundancy [1]. The ongoing genome sequencing projects have dramatically increased the number of known protein sequences. To make the sequence information available as quickly as possible, we introudced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT.TREMBL consists of computer-annotated entries derived fro mthe translation of all coding sequences (CDS) in the EMBL database, except for CDS already included in SWISS-PROT.SWISS-PROT + TREMBL provides the scientific community with a comprehensive non-redundant protein sequence databank. However, there is a clear need for new techniques to enhance the production of SWISS-PROT + TREMBL to cope with the flood of sequence and functional data. To achieve this, we are currently developing new methods to accelerate sequence analysis, information acquisition and data integration. Central to this effort in future will be EDIT to TREMBL (Environment for Distributed information Transfer to TREMBL) a system which enables the investigation of differnet possibilities to share and deduce biological information. EDIT to TREMBL analyzes sequences by comparison to the biochemically characterized and well-annotated entries in SWISS-PROT to predict in a standardized way the functional properties of the TREMBL entries.
机译:SWISS-PROT(http://www.ebi.ac.uk/ebi_docs/swissprot_db/swisshome.html)是一种蛋白质序列数据库,具有高度注释和与其他数据库的集成,并且冗余程度最低[1] 。正在进行的基因组测序项目大大增加了已知蛋白质序列的数量。为了使序列信息尽快可用,我们引入了SWREM-PROT的补充版TREMBL(EMBL核苷酸序列数据库的翻译).TREMBL由计算机注释的条目组成,这些条目是由翻译过程中所有编码序列(CDS)的翻译而来的EMBL数据库(SWISS-PROT中已包含CDS除外).SWISS-PROT + TREMBL为科学界提供了一个全面的非冗余蛋白质序列数据库。但是,显然需要新技术来提高SWISS-PROT + TREMBL的产量,以应对大量的序列和功能数据。为了实现这一目标,我们目前正在开发新的方法来加速序列分析,信息获取和数据集成。未来这项工作的核心将是对TREMBL(向TREMBL分发信息的环境)的编辑,该系统可以研究共享和推论生物学信息的不同网络可能性。对TREMBL的编辑通过与SWISS-PROT中经过生化鉴定和注释充分的条目进行比较来分析序列,从而以标准化的方式预测TREMBL条目的功能特性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号