首页> 外文会议>International Conference on Imaging Science,Systems,and Technology CISST'99 Une 28-July 1, 1999 Las Vegas, Nevada, USA >Textual data mining through the synergistic combination of classifiers and linguistic processors
【24h】

Textual data mining through the synergistic combination of classifiers and linguistic processors

机译:通过分类器和语言处理器的协同组合进行文本数据挖掘

获取原文
获取原文并翻译 | 示例

摘要

Numerical data mining tools are generally quite robust but only provide coarse-granularity results; such tools can handle very large inputs. Computational linguistic tools are able to provide fine-granularity results but are less robust; such tools, often semi-automatic, usually handle relatively short inputs. A synergistic combination of both types of tools is the basis of our hybrid approach. First, a connectionist classifier is used to locate potentially interesting documents, or segments thereof. Second,the user selects segments that will be forwarded to the linguistic processor in order to semi-automatically analyse their textural data and extract relevant informaiton or knowledge elements. We present the main characteristics of our hybrid approach to textual data mining, plus a methodology by which it can be put to use. We alos report on the results of a first evaluation involving a corpus made up of two texts pertaining to two different domains.
机译:数值数据挖掘工具通常相当健壮,但只能提供粗粒度结果。这样的工具可以处理非常大的输入。计算语言工具能够提供细粒度的结果,但功能较差。这样的工具,通常是半自动的,通常处理相对较短的输入。两种工具的协同组合是我们混合方法的基础。首先,使用连接器分类器来定位潜在有趣的文档或其片段。其次,用户选择将被转发到语言处理器的段,以便半自动地分析其纹理数据并提取相关的信息或知识元素。我们介绍了文本数据挖掘混合方法的主要特征,以及可以使用它的方法。我们将报告涉及一个语料库的首次评估结果,该语料库由与两个不同领域相关的两个文本组成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号