首页> 外文会议>Mexican International Conference on Artificial Intelligence >Intra-document and Inter-document Redundancy in Multi-document Summarization
【24h】

Intra-document and Inter-document Redundancy in Multi-document Summarization

机译:多文件摘要中的文档内和文档间冗余

获取原文

摘要

Multi-document summarization differs from single-document summarization in excessive redundancy of mentions of some events or ideas. We show how the amount of redundancy in a document collection can be used for assigning importance to sentences in multi-document extractive summarization: for instance, an idea could be important if it is redundant across documents because of its popularity; on the other hand, an idea could be important if it is not redundant across documents because of its novelty. We propose an unsupervised graph-based technique that, based on proper similarity measures, allows us to experiment with intra-document and inter-document redundancy. Our experiments on DUC corpora show promising results.
机译:多文件摘要与单一文件摘要不同,在一些事件或想法的过度冗余中的过度冗余。我们展示了文档集合中的冗余量如何用于为多文件提取摘要中的句子分配重要性:例如,如果由于其受欢迎程度,如果由于文档冗余,则一个想法可能很重要;另一方面,如果由于新颖性,如果由于其新颖而横跨文档而不是多余的想法,这可能很重要。我们提出了一种无监督的基于图形的技术,基于适当的相似性措施,允许我们在文档内和文档间冗余中进行实验。我们对DUC Corea的实验显示了有希望的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号