HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization

机译：HIBERT：用于文档摘要的分层双向变压器的文档级预训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural extractive summarization models usually employ a hierarchical encoder for document encoding and they are trained using sentence-level labels, which are created heuristically using rule-based methods. Training the hierarchical encoder with these inaccurate labels is challenging. Inspired by the recent work on pre-training transformer sentence encoders (Devlin et al., 2018), we propose HlBERT (as shorthand for HIerachical Bidirectional Encoder Representations from Transformers) for document encoding and a method to pre-train it using unlabeled data. We apply the pre-trained HlBERT to our summarization model and it outperforms its randomly initialized counterpart by 1.25 ROUGE on the CNN/Dailymail dataset and by 2.0 ROUGE on a version of New York Times dataset. We also achieve the state-of-the-art performance on these two datasets.

机译：神经提取摘要模型通常使用分层编码器进行文档编码，并使用句子级标签进行训练，这些句子级标签是使用基于规则的方法启发式创建的。用这些不正确的标签来训练分层编码器具有挑战性。受近期预训练变压器句子编码器的工作（Devlin等人，2018）的启发，我们提出了HlBERT（作为来自变压器的HIerachical双向编码器表示的简写形式）进行文件编码，以及一种使用未标记数据进行预训练的方法。我们将预训练的HlBERT应用于摘要模型，在CNN / Dailymail数据集上的性能优于随机初始化的HlBERT，在纽约时报数据集上的性能优于1.25 ROUGE，而在版本上优于2.0 ROUGE。我们还实现了这两个数据集的最新性能。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|5059-5069|共11页
会议地点
作者
Xingxing Zhang; Furu Wei; Ming Zhou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Weighted hierarchical archetypal analysis for multi-document summarization [J] . Ercan Canhasi, Igor Kononenko Computer speech and language . 2016,第May期

机译：用于多文档摘要的加权层次原型分析
2. Legal Documents Clustering and Summarization using Hierarchical Latent Dirichlet Allocation [J] . Ravi kumar Venkatesh IAES International Journal of Artificial Intelligence . 2013,第1期

机译：使用分层潜在Dirichlet分配的法律文件聚类和汇总
3. Hierarchical Summarization of Large Documents [J] . Christopher C. Yang, Fu Lee Wang Journal of the American Society for Information Science and Technology . 2008,第6期

机译：大型文档的层次汇总
4. HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization [C] . Xingxing Zhang, Furu Wei, Ming Zhou Annual meeting of the Association for Computational Linguistics . 2019

机译：HIBERT：文档级预培训分层双向变压器，用于文件摘要
5. Multi-document Summarization Based on Document Clustering and Neural Sentence Fusion [D] . Fuad, Tanvir Ahmed. 2018

机译：基于文档聚类和神经句子融合的多文件摘要
6. Relevance of health level 7 clinical document architecture and integrating the healthcare enterprise cross-enterprise document sharing profile for managing chronic wounds in a telemedicine context [O] . Philippe Finet, Bernard Gibaud, Olivier Dameron, 2016

机译：健康级别7临床文档架构的相关性以及集成医疗保健企业跨企业文档共享配置文件以在远程医疗环境中管理慢性伤口的相关性
7. Hierarchical Transformers for Multi-Document Summarization [O] . Yang Liu, Mirella Lapata 2019

机译：多文件摘要的分层变压器
8. Automatic Summarization with Sloth (Summarizes Lengthy Documents and Outputs The Highlights) [R] . Kaplin, D. B. 2002

机译：树懒自动摘要（总结冗长的文档和输出亮点）

HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization

摘要

著录项

相似文献

相关主题

期刊订阅