Cross-Document Summarization by Concept Classification

机译：概念分类的交叉文件摘要

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we describe a Cross Document Summarizer XDoX designed specifically to summarize large document sets (50-500 documents and more). Such sets of documents are typically obtained from routing or filtering systems run against a continuous stream of data, such as a newswire. XDoX works by identifying the most salient themes within the set (at the granularity level that is regulated by the user) and composing an extraction summary, which reflects these main themes. In the current version, XDoX is not optimized to produce a summary based on a few unrelated documents; indeed, such summaries are best obtained simply by concatenating summaries of individual documents. We show examples of summaries obtained in our tests as well as from our participation in the first Document Understanding Conference (DUC).

机译：在本文中，我们描述了一个跨文件摘要XDOX，专门设计用于总结大型文件集（50-500文档等）。这些文件集通常是从路由或过滤系统获得的，用于针对连续的数据流，例如新闻。 XDOX通过识别集中的最大主题（在用户调节的粒度级别）并构成提取摘要，这反映了这些主题。在当前版本中，XDox未得到优化，以基于几个不相关的文件生成摘要;实际上，通过串联各个文件的摘要，最好地获得这些摘要。我们展示了我们测试中获得的摘要的例子，以及我们参与第一个文件理解会议（DUC）。

著录项

来源
《Annual international ACM SIGIR conference on research and development in information retrieval》|2002年||共8页
会议地点
作者
Hilda Hardy; Nobuyuki Shimizu; Tomek Strzalkowski; Liu Ting; G. Bowden Wise; Xinyang Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类情报检索;
关键词
summary; clustering; n-grams; multi-document summarization; passage similarity; term weights;

机译：摘要;聚类;n-gram;多文件摘要;通过相似性;术语权重;

相似文献

外文文献
中文文献
专利

1. NATSUM: Narrative abstractive summarization through cross-document timeline generation [J] . Barros Cristina, Lloret Elena, Saquete Estela, Information Processing & Management . 2019,第5期

机译：NATSUM：通过跨文档时间轴生成的叙事抽象摘要
2. NATSUM: Narrative abstractive summarization through cross-document timeline generation [J] . Barros Cristina, Lloret Elena, Saquete Estela, Information Processing & Management . 2019,第5期

机译：Natsum：通过交叉文档时间线生成叙事抽象概括
3. Multi document summarization based on news components using fuzzy cross-document relations [J] . Yogan Jaya Kumar, Naomie Salim, Albaraa Abuobieda, Applied Soft Computing . 2014,第Null期

机译：使用模糊的跨文档关系基于新闻组件的多文档摘要
4. Cross-Document Summarization by Concept Classification [C] . Hilda Hardy, Nobuyuki Shimizu, Tomek Strzalkowski, The Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Aug 11-15, 2002, Tampere, Finland . 2002

机译：按概念分类的跨文档摘要
5. Using Classification for Analysis of Multi-Modal Video Summarization [D] . ?Wells, Brendan 2020

机译：采用分级的多模态视频摘要分析
6. A Tool for Abstracting Relevant Classes of Concepts: the Common Ancestry Summarizer [O] . Ying Tao, Eneida A. Mendonça, Yves A. Lussier -1

机译：抽象相关概念类的工具：共同祖先摘要器
7. Exploiting Cross-Document Relations for Multi-document Evolving Summarization [O] . Afantenos, Stergos D., Doura, Irene, Kapellou, Eleni, 2004

机译：利用跨文档关系实现多文档演进概要
8. CONCEPTS AND TECHNIQUES FOR SUMMARIZING DEFENSE SYSTEM COSTS [R] . J. W. Noah 1965

机译：概述国防系统成本的概念和技巧

Cross-Document Summarization by Concept Classification

摘要

著录项

相似文献

相关主题

期刊订阅