An Approach for Automatic Ontology Enrichment from Texts

Nassima Mellal; Tahar Guerram; Faiza Bouhalassa

摘要

The automatic ontology enrichment consists of automatic knowledge extraction from texts related to a domain of discourse in the aim to enrich automatically an initial ontology of the same domain. However, the passage, from a plain text to an enriched ontology requires a number of steps. In this paper, we present a three steps ontology enrichment approach. In the first step, we apply natural language processing techniques to obtain tagged sentences. The second step allows us to reduce each extracted sentence to an SVO (Subject, Verb, and Object) sentence, supposed to preserve main information carried by the original sentence(s) from which it is extracted. Finally, in the third step, we proceed to enrich an initial ontology built manually by adding extracted terms in the generated SVO as new concepts or instances of concepts and new relations. To validate our approach, we have used “Phytotherapy" domain because of the availability of related texts on the WWW and also because its usefulness for pharmaceutical industry. The first results obtained, after experiments on a set of different texts, testify the performance of the proposed approach.

机译：自动本体丰富由与话语领域相关的文本的自动知识提取，目的是自动丰富同一域的初始本体。然而，从纯文本到丰富的本体的段落需要许多步骤。在本文中，我们提出了三个步骤本体丰富方法。在第一步中，我们应用自然语言处理技术以获得标记的句子。第二步允许我们将每个提取的句子减少到SVO（主题，动词和对象）句子，该句子应该保留由提取的原始句子所携带的主信息。最后，在第三步中，我们通过将生成的SVO中的提取条款作为概念和新关系的新概念或实例添加了所提取的术语来进行手动提高手动构建的初始本体。为了验证我们的方法，我们使用了“植物疗法”域，因为WWW上的相关文本以及其对制药行业的有用性。在一组不同文本的实验后获得的第一个结果，证明了这种表现提出的方法。

An Approach for Automatic Ontology Enrichment from Texts

摘要

著录项

相关主题

期刊订阅