TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks

机译：讨论：基于会议谈判的科学论文摘要数据集和可扩展注释方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Currently, no large-scale training data is available for the task of scientific paper summarization. In this paper, we propose a novel method that automatically generates summaries for scientific papers, by utilizing videos of talks at scientific conferences. We hypothesize that such talks constitute a coherent and concise description of the papers' content, and can form the basis for good summaries. We collected 1716 papers and their corresponding videos, and created a dataset of paper summaries. A model trained on this dataset achieves similar performance as models trained on a dataset of summaries created manually. In addition, we validated the quality of our summaries by human experts.

机译：目前，没有大规模的培训数据可用于科学论文摘要的任务。在本文中，我们提出了一种新的方法，通过使用科学会议的谈判视频来自动为科学论文产生摘要。我们假设此类谈判构成了对论文内容的一致性和简明的描述，并且可以为良好的摘要构成基础。我们收集了1716篇论文及其相应的视频，并创建了纸张摘要数据集。在此数据集上培训的模型实现了类似的性能，因为模型在手动创建的摘要数据集上培训。此外，我们验证了人类专家摘要的质量。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|cxxxiv p. 1980-2638|共7页
会议地点
作者
Guy Lev; Michal Shmueli-Scheuer; Jonathan Herzig; Achiya Jerbi; David Konopnicki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Performance of single and multi-atlas based automated landmarking methods compared to expert annotations in volumetric microCT datasets of mouse mandibles [J] . Ryan Young, A. Murat Maga Frontiers in zoology . 2015,第1期

机译：基于单架和多标准的自动化地标方法的性能与鼠标颌骨体积微区数据集中的专家注释相比
2. QBSUM: A large-scale query-based document summarization dataset from real-world applications [J] . Mingjun Zhao, Shengli Yan, Bang Liu, Computer speech and language . 2021,第Mara期

机译：qbsum：真实世界应用程序的基于大规模的查询文件摘要数据集
3. METHODS FOR SUMMARIZING RADIOCARBON DATASETS [J] . Ramsey Christopher Bronk Radiocarbon . 2017,第6期

机译：汇总radiocarbon数据集的方法
4. TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks [C] . Guy Lev, Michal Shmueli-Scheuer, Jonathan Herzig, Annual meeting of the Association for Computational Linguistics . 2019

机译：TalkSumm：一种基于会议演讲的科学论文摘要的数据集和可扩展注释方法
5. Query-Driven Analysis and Visualization for Large-Scale Scientific Dataset using Geometry Summarization and Bitmap Indexing [D] . Wei, Tzu-Hsuan 2017

机译：使用几何汇总和位图索引的大规模科学数据集的查询驱动分析和可视化
6. Performance of single and multi-atlas based automated landmarking methods compared to expert annotations in volumetric microCT datasets of mouse mandibles [O] . Ryan Young, A. Murat Maga 2015

机译：与小鼠下颌骨体积microCT数据集中的专家注释相比基于单图谱和多图谱的自动界标方法的性能
7. TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks [O] . Guy Lev, Michal Shmueli-Scheuer, Jonathan Herzig, 2019

机译：讨论：基于会议谈判的科学论文摘要数据集和可扩展注释方法

TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks

摘要

著录项

相似文献

相关主题

期刊订阅