首页> 外文会议>Twentieth International Joint Conference on Artificial Intelligence(IJCAI-07) >Manifold-Ranking Based Topic-Focused Multi-Document Summarization
【24h】

Manifold-Ranking Based Topic-Focused Multi-Document Summarization

机译:基于歧管排序的主题多文档摘要

获取原文

摘要

Topic-focused multi-document summarization aims to produce a summary biased to a given topic or user profile. This paper presents a novel extractive approach based on manifold-ranking of sentences to this summarization task. The manifold-ranking process can naturally make full use of both the relationships among all the sentences in the documents'and the relationships between the given topic and the sentences. The ranking score is obtained for each sentence in the manifold-ranking process to denote the biased information richness of the sentence. Then the greedy algorithm is employed to impose diversity penalty on each sentence. The summary is produced by choosing the sentences with both high biased information richness and high information novelty. Experiments on DUC2003 and DUC2005 are performed and the ROUGE evaluation results show that the proposed approach can significantly outperform existing approaches of the top performing systems in DUC tasks and baseline approaches.
机译:以主题为中心的多文档摘要旨在生成偏向给定主题或用户个人资料的摘要。本文提出了一种新颖的提取方法,该方法基于句子的多级排序来完成该摘要任务。自然排序过程可以自然地充分利用文档中所有句子之间的关系以及给定主题与句子之间的关系。在多级排序过程中为每个句子获得排名分数,以表示该句子的信息丰富度。然后采用贪婪算法对每个句子施加分集惩罚。通过选择具有高偏见的信息丰富度和高的信息新颖性的句子来产生摘要。在DUC2003和DUC2005上进行了实验,ROUGE评估结果表明,该方法在DUC任务和基线方法中可以明显优于性能最高的系统的现有方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号