A Framework for Identifying Textual Redundancy

机译：识别文本冗余的框架

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The task of identifying redundant information in documents that are generated from multiple sources provides a significant challenge for summarization and QA systems. Traditional clustering techniques detect redundancy at the sentential level and do not guarantee the preservation of all information within the document. We discuss an algorithm that generates a novel graph-based representation for a document and then utilizes a set cover approximation algorithm to remove redundant text from it. Our experiments show that this approach offers a significant performance advantage over clustering when evaluated over an annotated dataset.

机译：识别从多个来源生成的文档中的冗余信息的任务为摘要和QA系统提出了重大挑战。传统的群集技术在句子级别检测冗余，并且不能保证在文档中保留所有信息。我们讨论了一种算法，该算法为文档生成基于图形的新颖表示形式，然后利用集合覆盖率近似算法从中删除多余的文本。我们的实验表明，对带注释的数据集进行评估时，该方法比聚类具有明显的性能优势。

著录项

来源
《22nd International Conference on Computational Linguistics》|2008年|873-880|共8页
会议地点 Manchester(GB);Manchester(GB)
作者
Kapil Thadani; Kathleen McKeown;
展开▼
作者单位

Department of Computer Science, Columbia University, New York, NY USA;

Department of Computer Science, Columbia University, New York, NY USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Richness, redundancy or relational salience? A comparison of the effect of textual and aural feedback modes on knowledge elaboration in higher education students' work [J] . Alan Gleaves, Caroline Walker Computers & education . 2013,第mara期

机译：丰富性，冗余性或关系显着性？文本和听觉反馈方式对高校学生工作中知识阐述的影响比较
2. Identifying duplicate functionality in textual use cases by aligning semantic actions [J] . Rago Alejandro, Marcos Claudia, Diaz-Pace J. Andres Software and systems modeling . 2016,第2期

机译：通过对齐语义动作来识别文本用例中的重复功能
3. A Linguistic Approach to Identify the Affective Dimension Expressed in Textual Messages [J] . Sandro Jose Rigo, Isa Mara da Rosa Alves, Jorge Luis Victoria Barbosa International Journal of Information and Communication Technology Education: An Official Pubblication of the Information Resources Management Association . 2015,第1期

机译：识别短信中情感维度的语言学方法
4. A Framework for Identifying Textual Redundancy [C] . International Conference on Computational Linguistics . 2008

机译：识别文本冗余的框架
5. Index compression and redundancy elimination in large textual collections. [D] . Yan, Hao. 2010

机译：大型文本集合中的索引压缩和冗余消除。
6. Development and Validation of Online Textual Pediatrician-Parent Communication Instrument Based on the SEGUE Framework [O] . Yuqi Xiong, Dan Wang, Haihong Chen, 2006

机译：基于SEGUE框架的在线文本儿科医生—父母沟通工具的开发与验证
7. A Framework for Identifying Textual Redundancy [O] . Kapil Thadani, Kathleen Mckeown 2011

机译：识别文本冗余的框架

A Framework for Identifying Textual Redundancy

摘要

著录项

相似文献

相关主题

期刊订阅