...
首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Interactive Document Clustering System Based on Coordinated Multiple Views
【24h】

Interactive Document Clustering System Based on Coordinated Multiple Views

机译:基于多视图协同的交互式文档聚类系统

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes an interactive document clustering system, which is designed based on the concept of CMV (coordinated multiple views). An interactive document clustering is used by a user to obtain a set of document groups from a document collection in interactive manner. It is expected to be useful for various tasks such as text mining and document retrieval. As the result of document clustering consists of multiple objects such as clusters (document groups), documents, and words, each of those should be presented to users in different ways. Based on this consideration, the proposed system employs multiple views, each of which is designed for specific object such as document and keyword. A prototype system is implemented on TETDM (Total Environment for Text Data Mining), which is one of environments for developing text data mining tools. As it can provide the mechanism of coordination between modules, we decided to use it for developing the prototype system. The proposed system classifies information to be presented into 4 levels: clusters, document, bag of words, and word, each of which is displayed with different views. Experimental results with test participants show the effectiveness of the proposed system.
机译:本文提出了一种基于CMV(协同多视图)概念设计的交互式文档聚类系统。用户使用交互式文档聚类以交互方式从文档集合中获取一组文档组。预期它对各种任务(例如文本挖掘和文档检索)很有用。由于文档聚类的结果包含多个对象,例如聚类(文档组),文档和单词,因此应以不同的方式将这些对象呈现给用户。基于此考虑,提出的系统采用了多个视图,每个视图都是针对特定对象(例如文档和关键字)设计的。在TETDM(文本数据挖掘的总体环境)上实现了原型系统,该环境是用于开发文本数据挖掘工具的环境之一。由于它可以提供模块之间的协调机制,因此我们决定将其用于开发原型系统。提议的系统将要呈现的信息分为四个级别:簇,文档,单词袋和单词,每个级别都以不同的视图显示。测试参与者的实验结果表明了该系统的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号