首页> 外文学位 >Contextual analysis of variation and quality in human-curated gene ontology annotations.
【24h】

Contextual analysis of variation and quality in human-curated gene ontology annotations.

机译:人为管理的基因本体注释中变异和质量的上下文分析。

获取原文
获取原文并翻译 | 示例

摘要

Two prospective randomized controlled studies of scientific curators of model organism databases (MODs) were conducted using common document collections to investigate the origins, nature, and extent of variation in curators' Gene Ontology (GO) annotations. Additional contextual data about curators' backgrounds, experience, personal annotation behaviors, and work practices were also collected to provide additional means of explaining variation. A corpus of nearly 4,000 new GO annotations covering 5 organisms was generated by 31 curators and analyzed at the paper, instance, and GO element levels. Variation was observed by organism expertise, by group assignment, and between individual and consensus annotations. Years of GO curation experience was found to not be a predictor of annotation instance quantities. Five facets of GO annotation quality (Consistency, Specificity, Completeness, Validity, and Reliability) were evaluated for utility, and showed promise for use in training novice curators. Pairwise matching and comparison of instances was found to be difficult and atypical, limiting the usefulness of the quality measures. Content analysis was performed on more than 600 pages of curators' hand-annotated paper journal articles used in GO annotation, yielding six types of common notations.
机译:使用共同的文献收集对模型生物数据库(MODs)的科学策展人进行了两项前瞻性随机对照研究,以调查策展人基因本体论(GO)注释的起源,性质和变化程度。还收集了有关策展人的背景,经验,个人注释行为和工作实践的其他上下文数据,以提供解释变化的其他方式。 31位策展人生成了涵盖5种生物的近4,000个新的GO注释语料库,并在论文,实例和GO元素级别进行了分析。通过有机体专业知识,小组分配以及个人注释和共识注释之间的差异进行了观察。发现多年的GO策划经验不能预测注释实例的数量。对GO注释质量的五个方面(一致性,特异性,完整性,有效性和可靠性)进行了实用性评估,并显示了在培训新手策展人方面的前景。发现实例的成对匹配和比较是困难且非典型的,从而限制了质量度量的有用性。对GO注释中使用的600多页策展人的手工注释纸质期刊文章进行了内容分析,产生了六种常见注释。

著录项

  • 作者

    MacMullen, W. John.;

  • 作者单位

    The University of North Carolina at Chapel Hill.;

  • 授予单位 The University of North Carolina at Chapel Hill.;
  • 学科 Biology Bioinformatics.; Information Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 179 p.
  • 总页数 179
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 信息与知识传播;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号