首页> 外文期刊>Database >Curation accuracy of model organism databases
【24h】

Curation accuracy of model organism databases

机译:模型生物数据库的定位精度

获取原文
           

摘要

Manual extraction of information from the biomedical literature—or biocuration—is the central methodology used to construct many biological databases. For example, the UniProt protein database, the EcoCyc Escherichia coli database and the Candida Genome Database (CGD) are all based on biocuration. Biological databases are used extensively by life science researchers, as online encyclopedias, as aids in the interpretation of new experimental data and as golden standards for the development of new bioinformatics algorithms. Although manual curation has been assumed to be highly accurate, we are aware of only one previous study of biocuration accuracy. We assessed the accuracy of EcoCyc and CGD by manually selecting curated assertions within randomly chosen EcoCyc and CGD gene pages and by then validating that the data found in the referenced publications supported those assertions. A database assertion is considered to be in error if that assertion could not be found in the publication cited for that assertion. We identified 10 errors in the 633 facts that we validated across the two databases, for an overall error rate of 1.58%, and individual error rates of 1.82% for CGD and 1.40% for EcoCyc. These data suggest that manual curation of the experimental literature by Ph.D-level scientists is highly accurate. Database URL: http://ecocyc.org/, http://www.candidagenome.org//
机译:从生物医学文献中手动提取信息或生物固化是用于构建许多生物学数据库的主要方法。例如,UniProt蛋白质数据库,EcoCyc大肠杆菌数据库和念珠菌基因组数据库(CGD)均基于生物固化。生命科学研究人员广泛地使用生物数据库作为在线百科全书,以帮助解释新的实验数据,并作为开发新的生物信息学算法的黄金标准。尽管我们认为手动管理的准确性很高,但我们仅了解过一项以前关于生物管理准确性的研究。我们通过在随机选择的EcoCyc和CGD基因页面中手动选择已确定的断言,然后验证在参考出版物中找到的数据支持这些断言,来评估EcoCyc和CGD的准确性。如果在为该断言引用的出版物中找不到该断言,则认为该数据库断言是错误的。我们在两个数据库中验证的633个事实中确定了10个错误,总错误率1.58%,CGD的单个错误率1.82%,EcoCyc的单个错误率1.40%。这些数据表明,由博士级科学家手工整理实验文献是高度准确的。数据库网址:http://ecocyc.org/、http://www.candidagenome.org//

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号