An efficient path computing model for measuring semantic similarity using edge and density

Zhu Xinhua; Li Fei; Chen Hongchao; Peng Qi

首页> 外文期刊>Knowledge and information systems >An efficient path computing model for measuring semantic similarity using edge and density

【24h】

An efficient path computing model for measuring semantic similarity using edge and density

机译：使用边缘和密度测量语义相似性的有效路径计算模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The shortest path between two concepts in a taxonomic ontology is commonly used to represent the semantic distance between concepts in edge-based semantic similarity measures. In the past, edge counting, which is simple and intuitive and has low computational complexity, was considered the default method for path computation. However, a large lexical taxonomy, such as WordNet, has irregular link densities between concepts due to its broad domain, but edge counting-based path computation is powerless for this non-uniformity problem. In this paper, we advocate that the path computation can be separated from edge-based similarity measures and can form various general computing models. Therefore, to solve the problem of the non-uniformity of concept density in a large taxonomic ontology, we propose a new path computing model based on the compensation of local area density of concepts, which is equal to the number of direct hyponyms of the subsumers for concepts in the shortest path. This path model considers the local area density of concepts as an extension of the edge counting-based path according to the information theory. This model is a general path computing model and can be applied in various edge-based similarity approaches. The experimental results show that the proposed path model improves the average optimal correlation between edge-based measures and human judgments on the Miller and Charles benchmark for WordNet from less than 0.79 to more than 0.86, on the Pedersenet al. benchmark (average of both Physician and Coder) for SNOMED-CT from less than 0.75 to more than 0.82, and it has a large advantage in efficiency compared with information content computation in a dynamic ontology, thereby successfully improving the edge-based similarity measure as an excellent method with high performance and high efficiency.

机译：分类本体本体中的两个概念之间的最短路径通常用于表示基于边缘的语义相似度措施的概念之间的语义距离。过去，边缘计数，简单且直观并具有低计算复杂性，被认为是路径计算的默认方法。然而，由于其宽域，诸如Wordnet等大型词汇分类，例如Wordnet，在概念之间具有不规则的链路密度，但是对于这种非均匀性问题，基于边缘计数的路径计算是无能为力的。在本文中，我们倡导路径计算可以与基于边缘的相似度测量分离，并且可以形成各种常规计算模型。因此，为了解决大型分类本体中的概念密度的不均匀性问题，我们提出了一种基于局域局部密度补偿的新路径计算模型，其等于Supumers的直接假设的数量对于最短路径的概念。该路径模型认为概念的局部密度作为基于边缘计数的路径的扩展，根据信息理论。该模型是一般路径计算模型，可以应用于各种基于边缘的相似性方法。实验结果表明，在Pedersenet Al上，所提出的路径模型提高了基于米勒和查理基准的边缘措施和人力判断之间的平均最佳相关性和用于Wordnet的Charles基准。基准（医生和编码器的平均值）对于小于0.75至大于0.82的SnoMed-CT，与动态本体中的信息内容计算相比，它具有很大的优势，从而成功地提高了基于边缘的相似度测量一种高性能和高效率的优异方法。

著录项

来源
《Knowledge and information systems》 |2018年第1期|共33页
作者
Zhu Xinhua; Li Fei; Chen Hongchao; Peng Qi;
展开▼
作者单位

Guangxi Normal Univ Guangxi Key Lab Multisource Informat Min &

Secur Guilin 541004 Peoples R China;

Guangxi Normal Univ Guangxi Key Lab Multisource Informat Min &

Secur Guilin 541004 Peoples R China;

Guangxi Normal Univ Guangxi Key Lab Multisource Informat Min &

Secur Guilin 541004 Peoples R China;

Guangxi Normal Univ Guangxi Key Lab Multisource Informat Min &

Secur Guilin 541004 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词
Path computing model; Semantic similarity; Local density; WordNet; SNOMED-CT;

机译：路径计算模型;语义相似性;局部密度;Wordnet;Snomed-CT;

相似文献

外文文献
中文文献
专利

1. An efficient path computing model for measuring semantic similarity using edge and density [J] . Zhu Xinhua, Li Fei, Chen Hongchao, Knowledge and information systems . 2018,第1期

机译：使用边缘和密度测量语义相似性的有效路径计算模型
2. DW-PathSim: a distributed computing model for topic-driven weighted meta-path-based similarity measure in a large-scale content-based heterogeneous information network [J] . Phuc Do, Phu Pham Journal of Information and Telecommunication . 2019,第1期

机译：DW-PathSim：大型基于内容的异构信息网络中主题驱动的加权基于元路径的相似性度量的分布式计算模型
3. Computing semantic similarity based on novel models of semantic representation using Wikipedia [J] . Qu Rong, Fang Yongyi, Bai Wen, Information Processing & Management . 2018,第6期

机译：使用Wikipedia基于新颖的语义表示模型计算语义相似度
4. Efficient Versus Accurate Algorithms for Computing a Semantic Logic-Based Similarity Measure [C] . Fatma Ezzahra Gmati, Salem Chakhar, Nadia Yacoubi Ayadi, International conference on industrial engineering and other applications of applied intelligent systems . 2018

机译：用于计算基于语义逻辑的相似性测度的高效与精确算法
5. Using semantic similarity measures in the biomedical domain for computing functional similarity between genes based on gene ontology [D] . Khabiri, Elham 2007

机译：在生物医学领域中使用语义相似性度量基于基因本体计算基因之间的功能相似性
6. U-path: An undirected path-based measure of semantic similarity [O] . Bridget T. McInnes, Ted Pedersen, Ying Liu, 2014

机译：U路径：一种基于路径的语义相似度度量
7. EFFICIENT PROTOCOLS FOR COMPUTING THE OPTIMAL SWAP EDGES OF A SHORTEST PATH TREE [O] . Nicola Santoro 2014

机译：用于计算最短路径树的最佳交换边缘的有效协议

An efficient path computing model for measuring semantic similarity using edge and density

摘要

著录项

相似文献

相关主题

期刊订阅