...
首页> 外文期刊>ACS Omega >Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses
【24h】

Crowd-Sourced Chemistry: Considerations for Building a Standardized Database to Improve Omic Analyses

机译:人群来源的化学:建立标准化数据库以改善Omic分析的注意事项

获取原文
           

摘要

Mass spectrometry (MS) is used in multiple omics disciplines to generate large collections of data. This data enables advancements in biomedical research by providing global profiles of a given system. One of the main barriers to generating these profiles is the inability to accurately annotate omics data, especially small molecules. To complement pre-existing large databases that are not quite complete, research groups devote efforts to generating personal libraries to annotate their data. Scientific progress is impeded during the generation of these personal libraries because the data contained within them is often redundant and/or incompatible with other databases. To overcome these redundancies and incompatibilities, we propose that communal, crowd-sourced databases be curated in a standardized fashion. A small number of groups have shown this model is feasible and successful. While the needs of a specific field will dictate the functionality of a communal database, we discuss some features to consider during database development. Special emphasis is made on standardization of terminology, documentation, format, reference materials, and quality assurance practices. These standardization procedures enable a field to have higher confidence in the quality of the data within a given database. We also discuss the three conceptual pillars of database design as well as how crowd-sourcing is practiced. Generating open-source databases requires front-end effort, but the result is a well curated, high quality data set that all can use. Having a resource such as this fosters collaboration and scientific advancement.
机译:质谱(MS)用于多种组学学科,以生成大量数据。通过提供给定系统的全局概况,该数据可推动生物医学研究的发展。产生这些谱图的主要障碍之一是无法准确注释组学数据,尤其是小分子。为了补充不完整的现有大型数据库,研究小组致力于创建个人库来注释其数据。在这些个人图书馆的生成过程中,科学进展受到了阻碍,因为其中包含的数据通常是多余的和/或与其他数据库不兼容的。为了克服这些冗余和不兼容性,我们建议以标准化的方式来管理公共的,众包的数据库。少数小组表明此模型是可行且成功的。虽然特定领域的需求将决定公用数据库的功能,但我们讨论了在数据库开发过程中要考虑的一些功能。特别强调术语,文档,格式,参考资料和质量保证实践的标准化。这些标准化程序使字段对给定数据库内的数据质量具有更高的置信度。我们还将讨论数据库设计的三个概念支柱以及如何实施众包。生成开源数据库需要前端工作,但是结果是精心策划的高质量数据集,所有人都可以使用。拥有这样的资源可以促进协作和科学进步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号