首页> 外文会议>International Conference on Advanced Data Mining and Applications >CCE: A Chinese Concept Encyclopedia Incorporating the Expert-Edited Chinese Concept Dictionary with Online Cyclopedias
【24h】

CCE: A Chinese Concept Encyclopedia Incorporating the Expert-Edited Chinese Concept Dictionary with Online Cyclopedias

机译:CCE:中国概念百科全书,纳入专家编辑的中国概念词典,在线cyclopedias

获取原文

摘要

Bag-of-words is the most common-used method in text mining tasks and many other applications. However, this method has some obvious shortcomings, such as ignoring semantic information. While in document analysis, semantic information always plays a more important role than individual words. To tackle this problem, we need to borrow semantic information from ontologies to learn the text information better. An expert-edited ontology is usually well structured and is more authoritative than an online cyclopedia. On the other hand, due to the costly editing, it is rather difficult for expert-edited ontologies to keep up with a deluge of new words. In this paper, we propose a method to construct a Chinese ontology to keep the carefully-designed structure of an expert-edited ontology, meanwhile embody new vocabulary from an online cyclopedia. We name the enhanced ontology as Chinese Concept Encyclopedia (CCE) and employ it in some text mining applications. The experimental results show that CCE outperforms the expert-edited ontology Chinese Concept Dictionary (CCD).
机译:文字袋是文本挖掘任务和许多其他应用中最常用的方法。然而,这种方法具有一些明显的缺点,例如忽略语义信息。虽然在文档分析中,语义信息总是比单个单词更重要的角色。为了解决这个问题,我们需要从本体中借用语义信息来更好地学习文本信息。专家编辑的本体通常是很好的结构化,而且比在线Cyperopedia更权威。另一方面,由于昂贵的编辑,专家编辑的本体是相当困难的,以跟上卓越的新词。在本文中,我们提出了一种构建中国本体论的方法,以保持专家编辑的本体论的仔细设计,同时体现了来自在线Cyperopedia的新词汇。我们将增强的本体列为中国概念百科全书(CCE)命名,并在一些文本挖掘应用程序中雇用。实验结果表明,CCE优于专家编辑的本体中文概念词典(CCD)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号