【24h】

Multi-document Summarization Based on BE-Vector Clustering

机译:基于BE-Vector聚类的多文档摘要

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a novel multi-document summarization strategy based on Basic Element (BE) vector clustering. In this strategy, sentences are represented by BE vectors instead of word or term vectors before clustering. BE is a head-modifier-relation triple representation of sentence content, and it is more precise to use BE as semantic unit than to use word. The BE-vector clustering is realized by adopting the k-means clustering method, and a novel clustering analysis method is employed to automatically detect the number of clusters, K. The experimental results indicate a superiority of the proposed strategy over the traditional summarization strategy based on word vector clustering. The summaries generated by the proposed strategy achieve a ROUGE-1 score of 0.37291 that is better than those generated by traditional strategy (at 0.36936) on DUC04 task-2.
机译:在本文中,我们提出了一种基于基本元素(BE)向量聚类的新颖的多文档摘要策略。在这种策略中,在聚类之前,句子由BE向量代替单词或术语向量表示。 BE是句子内容的头-修饰语-关系三元表示,并且使用BE作为语义单元比使用单词更精确。通过采用k均值聚类方法实现BE向量聚类,并采用一种新颖的聚类分析方法自动检测聚类数K。实验结果表明,该方法优于传统的基于聚类策略的聚类方法。在词向量聚类上。提议的策略生成的摘要的ROUGE-1得分为0.37291,比DUC04任务2的传统策略生成的摘要(0.36936)要好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号