TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks

机译：TalkSumm：一种基于会议演讲的科学论文摘要的数据集和可扩展注释方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Currently, no large-scale training data is available for the task of scientific paper summarization. In this paper, we propose a novel method that automatically generates summaries for scientific papers, by utilizing videos of talks at scientific conferences. We hypothesize that such talks constitute a coherent and concise description of the papers' content, and can form the basis for good summaries. We collected 1716 papers and their corresponding videos, and created a dataset of paper summaries. A model trained on this dataset achieves similar performance as models trained on a dataset of summaries created manually. In addition, we validated the quality of our summaries by human experts.

机译：当前，尚无大规模的培训数据可用于科学论文摘要的任务。在本文中，我们提出了一种新颖的方法，该方法可以利用科学会议上的演讲视频自动生成科学论文的摘要。我们假设这样的谈话构成对论文内容的连贯和简洁的描述，并且可以构成良好总结的基础。我们收集了1716篇论文及其相应的视频，并创建了论文摘要数据集。在此数据集上训练的模型与在手动创建的摘要数据集上训练的模型具有相似的性能。此外，我们通过人类专家验证了摘要的质量。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|2125-2131|共7页
会议地点
作者
Guy Lev; Michal Shmueli-Scheuer; Jonathan Herzig; Achiya Jerbi; David Konopnicki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Performance of single and multi-atlas based automated landmarking methods compared to expert annotations in volumetric microCT datasets of mouse mandibles [J] . Ryan Young, A. Murat Maga Frontiers in zoology . 2015,第1期

机译：基于单架和多标准的自动化地标方法的性能与鼠标颌骨体积微区数据集中的专家注释相比
2. QBSUM: A large-scale query-based document summarization dataset from real-world applications [J] . Mingjun Zhao, Shengli Yan, Bang Liu, Computer speech and language . 2021,第Mara期

机译：qbsum：真实世界应用程序的基于大规模的查询文件摘要数据集
3. METHODS FOR SUMMARIZING RADIOCARBON DATASETS [J] . Ramsey Christopher Bronk Radiocarbon . 2017,第6期

机译：汇总radiocarbon数据集的方法
4. TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks [C] . Guy Lev, Michal Shmueli-Scheuer, Jonathan Herzig, Annual meeting of the Association for Computational Linguistics . 2019

机译：讨论：基于会议谈判的科学论文摘要数据集和可扩展注释方法
5. Query-Driven Analysis and Visualization for Large-Scale Scientific Dataset using Geometry Summarization and Bitmap Indexing [D] . Wei, Tzu-Hsuan 2017

机译：使用几何汇总和位图索引的大规模科学数据集的查询驱动分析和可视化
6. Performance of single and multi-atlas based automated landmarking methods compared to expert annotations in volumetric microCT datasets of mouse mandibles [O] . Ryan Young, A. Murat Maga 2015

机译：与小鼠下颌骨体积microCT数据集中的专家注释相比基于单图谱和多图谱的自动界标方法的性能
7. TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks [O] . Guy Lev, Michal Shmueli-Scheuer, Jonathan Herzig, 2019

机译：讨论：基于会议谈判的科学论文摘要数据集和可扩展注释方法

TalkSumm: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks

摘要

著录项

相似文献

相关主题

期刊订阅