首页> 美国卫生研究院文献>other >MaPLE: A MapReduce Pipeline for Lattice-based Evaluation and Its Application to SNOMED CT
【2h】

MaPLE: A MapReduce Pipeline for Lattice-based Evaluation and Its Application to SNOMED CT

机译:MaPLE:基于格的评估的MapReduce管道及其在SNOMED CT中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Non-lattice fragments are often indicative of structural anomalies in ontological systems and, as such, represent possible areas of focus for subsequent quality assurance work. However, extracting the non-lattice fragments in large ontological systems is computationally expensive if not prohibitive, using a traditional sequential approach. In this paper we present a general MapReduce pipeline, called MaPLE (MapReduce Pipeline for Lattice-based Evaluation), for extracting non-lattice fragments in large partially ordered sets and demonstrate its applicability in ontology quality assurance. Using MaPLE in a 30-node Hadoop local cloud, we systematically extracted non-lattice fragments in 8 SNOMED CT versions from 2009 to 2014 (each containing over 300k concepts), with an average total computing time of less than 3 hours per version. With dramatically reduced time, MaPLE makes it feasible not only to perform exhaustive structural analysis of large ontological hierarchies, but also to systematically track structural changes between versions. Our change analysis showed that the average change rates on the non-lattice pairs are up to 38.6 times higher than the change rates of the background structure (concept nodes). This demonstrates that fragments around non-lattice pairs exhibit significantly higher rates of change in the process of ontological evolution.
机译:非晶格碎片通常表示本体系统中的结构异常,因此代表了后续质量保证工作的重点领域。但是,使用传统的顺序方法,如果不是禁止的话,则在大型本体系统中提取非晶格片段的计算量很大。在本文中,我们提出了一种通用的MapReduce管道,称为MaPLE(基于格的评估的MapReduce管道),用于提取大的部分有序集合中的非格子片段,并证明了其在本体质量保证中的适用性。在30个节点的Hadoop本地云中使用MaPLE,我们系统地提取了2009年至2014年的8个SNOMED CT版本(每个包含30万个概念)中的非格子片段,每个版本的平均总计算时间少于3小时。通过显着减少时间,MaPLE使得不仅可以对大型本体层次结构进行详尽的结构分析,而且还可以系统地跟踪版本之间的结构变化。我们的变化分析表明,非晶格对的平均变化率比背景结构(概念节点)的变化率高38.6倍。这表明非格对周围的片段在本体演化过程中表现出明显更高的变化率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号