MisPred: a resource for identification of erroneous protein sequences in public databases

Alinda Nagy; László Patthy

首页> 外文期刊>Database >MisPred: a resource for identification of erroneous protein sequences in public databases

【24h】

MisPred: a resource for identification of erroneous protein sequences in public databases

机译：MisPred：在公共数据库中用于识别错误蛋白质序列的资源

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Correct prediction of the structure of protein-coding genes of higher eukaryotes is still a difficult task; therefore, public databases are heavily contaminated with mispredicted sequences. The high rate of misprediction has serious consequences because it significantly affects the conclusions that may be drawn from genome-scale sequence analyses of eukaryotic genomes. Here we present the MisPred database and computational pipeline that provide efficient means for the identification of erroneous sequences in public databases. The MisPred database contains a collection of abnormal, incomplete and mispredicted protein sequences from 19 metazoan species identified as erroneous by MisPred quality control tools in the UniProtKB/Swiss-Prot, UniProtKB/TrEMBL, NCBI/RefSeq and EnsEMBL databases. Major releases of the database are automatically generated and updated regularly. The database (http://www.mispred.com) is easily accessible through a simple web interface coupled to a powerful query engine and a standard web service. The content is completely or partially downloadable in a variety of formats. Database URL: http://www.mispred.com

机译：正确预测高级真核生物蛋白质编码基因的结构仍然是一项艰巨的任务。因此，公共数据库被错误预测的序列严重污染。错误预测的高比率具有严重的后果，因为它会严重影响可能从真核基因组的基因组规模序列分析得出的结论。在这里，我们介绍了MisPred数据库和计算管道，它们为识别公共数据库中的错误序列提供了有效的手段。 MisPred数据库包含UniProtKB / Swiss-Prot，UniProtKB / TrEMBL，NCBI / RefSeq和EnsEMBL数据库中被MisPred质量控制工具鉴定为错误的19种后生动物物种的异常，不完整和错误预测的蛋白质序列的集合。数据库的主要版本会自动生成并定期更新。可通过一个简单的Web界面轻松访问该数据库（http://www.mispred.com），该界面与功能强大的查询引擎和标准的Web服务结合在一起。内容可以全部或部分以各种格式下载。数据库URL：http://www.mispred.com

著录项

来源
《Database》 |2013年第40期|共页
作者
Alinda Nagy; László Patthy;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类数学;
关键词

相似文献

外文文献
中文文献
专利

1. FixPred: a resource for correction of erroneous protein sequences [J] . Alinda Nagy, László Patthy Database . 2014,第0期

机译：FixPred：纠正错误蛋白质序列的资源
2. PFDB: a generic protein family database integrating the CATH domain structure database with sequence based protein family resources [J] . Shepherd AJ, Martin NJ, Johnson RG, Bioinformatics . 2002,第12期

机译：PFDB：通用蛋白家族数据库，将CATH结构域数据库与基于序列的蛋白家族资源整合在一起
3. The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants [J] . Shu Ouyang, C. Robin Buell Nucleic Acids Research . 2004,第1期

机译：TIGR植物重复数据库：用于识别植物中重复序列的集体资源
4. Species delineation by ribosomal protein sequences, and identification of bacteria by MALDI to DNA databases. [C] . Kenneth C. Parker ASMS Conference on Mass Spectrometry and Allied Topics . 2016

机译：用核糖体蛋白序列的物种描绘，并通过MALDI对DNA数据库的细菌鉴定。
5. The Role of Mechanical Loading, Bone Morphogenetic Proteins and Erroneous Differentiation of Tendon-derived Stem Cells in the Pathogenesis of Patellar Tendinopathy: A Potential Mechanism for the Chondro-ossification and Failed Healing in Patellar Tendinopathy [D] . Rui, Yunfeng 2011

机译：机械加载，骨形态发生蛋白和肌腱衍生干细胞的错误分化在髌骨肌腱病的发病机制中的作用：髌骨腹膜病患中的软骨化和愈合失败的潜在机制
6. MisPred: a resource for identification of erroneous protein sequences in public databases [O] . Alinda Nagy, László Patthy 2013

机译：MisPred：在公共数据库中用于识别错误蛋白质序列的资源
7. MisPred: a resource for identification of erroneous protein sequences in public databases. [O] . Nagy Alinda, Patthy László 2013

机译：MisPred：在公共数据库中用于识别错误蛋白质序列的资源。

MisPred: a resource for identification of erroneous protein sequences in public databases

摘要

著录项

相似文献

相关主题

期刊订阅