首页> 外国专利> Method for performing effective drill-down operations in text corpus visualization and exploration using language model approaches for key phrase weighting

Method for performing effective drill-down operations in text corpus visualization and exploration using language model approaches for key phrase weighting

机译:在语言语料库可视化和探索中使用语言模型方法对关键短语加权进行有效的向下钻取操作的方法

摘要

The invention relates to a method and an apparatus for performing a drill-down operation on a text corpus comprising documents, using language models for key phrase weighting, said method comprising the steps of weighting key phrases occurring both in a foreground language model, which contains a selected document cluster of said text corpus, and in a background language model, which does not contain said selected document cluster, by calculating for each key phrase a key phrase weight comprising a ratio between the foreground weight of said key phrase and a background weight of said key phrase, and assigning documents of the foreground language model to cluster labels which are formed by key phrases having high calculated key phrase weights.
机译:本发明涉及一种方法和设备,其使用用于关键词短语加权的语言模型对包括文档的文本语料库执行向下钻取操作,所述方法包括对在前景语言模型中同时出现的关键词短语进行加权的步骤,该方法和设备包括在不包含所述选择的文档簇的背景语言模型中,通过为每个关键词计算关键词权重,所述关键词权重包括所述关键词的前景权重与背景权重之比,在不包含所述选择的文档群的背景语言模型中然后,将前景语言模型的文档分配给聚类标签,这些聚类标签是由具有较高计算关键字权重的关键字构成的。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号