首页> 外文期刊>Database >ORION-VIRCAT: a tool for mapping ICTV and NCBI taxonomies
【24h】

ORION-VIRCAT: a tool for mapping ICTV and NCBI taxonomies

机译:ORION-VIRCAT:绘制ICTV和NCBI分类标准的工具

获取原文
           

摘要

Viruses, viroids and prions are the smallest infectious biological entities that depend on their host for replication. The number of pathogenic viruses is considerably large and their impact in human global health is well documented. Currently, the International Committee on the Taxonomy of Viruses (ICTV) has classified ~4379 virus species while the National Center for Biotechnology Information Viral Genomes Resource (NCBI-VGR) database has mapped 617 705 proteins to eight large taxonomic groups. Despite these efforts, an automated approach for mapping the ICTV master list and its officially accepted virus naming to the NCBI-VGR's taxonomical classification is not available. Due to metagenomic sequencing, it is likely that the discovery and naming of new viral species will increase by at least ten fold. Unfortunately, existing viral databases are not adequately prepared to scale, maintain and annotate automatically ultra-high throughput sequences and place this information into specific taxonomic categories. ORION-VIRCAT is a scalable and interoperable object-relational database designed to serve as a resource for the integration and verification of taxonomical classifications generated by the ICTV and NCBI-VGR. The current release (v1.0) of ORION-VIRCAT is implemented in PostgreSQL and it has been extended to ORACLE, MySQL and SyBase. ORION-VIRCAT automatically mapped and joined 617 705 entries from the NCBI-VGR to the viral naming of the ICTV. This detailed analysis revealed that 399 095 entries from the NCBI-VGR can be mapped to the ICTV classification and that one Order, 10 families, 35 genera and 503 species listed in the ICTV disagree with the the NCBI-VGR classification schema. Nevertheless, we were eable to correct several discrepancies mapping 234 000 additional entries. Database URL: http://www.orionbiosciences.com/research/orion-vircat.html
机译:病毒,类病毒和病毒是最小的传染性生物实体,它们依赖于其宿主进行复制。病原性病毒的数量非常多,并且已充分证明了它们对人类全球健康的影响。目前,国际病毒分类学委员会(ICTV)已分类了约4379种病毒,而国家生物技术信息病毒基因组资源中心(NCBI-VGR)数据库已将617705种蛋白质映射到8个主要的分类学组中。尽管做出了这些努力,但仍无法使用自动方法将ICTV主列表及其正式接受的病毒命名映射到NCBI-VGR的分类标准。由于宏基因组测序,新病毒物种的发现和命名可能会增加至少十倍。不幸的是,现有的病毒数据库没有充分准备好自动缩放,维护和注释超高通量序列,并将此信息放入特定的分类学类别。 ORION-VIRCAT是一个可伸缩且可互操作的对象关系数据库,旨在用作ICTV和NCBI-VGR生成的分类标准的集成和验证的资源。 ORION-VIRCAT的当前版本(v1.0)在PostgreSQL中实现,并且已扩展到ORACLE,MySQL和SyBase。 ORION-VIRCAT自动将617705个条目从NCBI-VGR映射并加入到ICTV的病毒命名中。这项详细的分析显示,可以将NCBI-VGR的399 095个条目映射到ICTV分类,并且ICTV中列出的1个科,10个科,35个属和503个物种与NCBI-VGR分类架构不同。不过,我们能够更正一些差异,从而映射了234 000个其他条目。数据库网址:http://www.orionbiosciences.com/research/orion-vircat.html

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号