Indonesian Abstractive Summarization using Pre-trained Model

机译：使用预先训练模型的印度尼西亚抽象摘要

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic text summarization systems are increasingly needed to encounter the information explosion caused by internet growth. Since Indonesian is still considered an under-resourced language, we take advantage of pre-trained language models to perform abstractive summarization. This paper investigates the BERT performance given the Indonesian article by comparing several BERT pre-trained models and evaluated the results based on the ROUGE values. Our experiment shows that an English pre-trained model can produce a good summary given Indonesian text, but it is more effective for using the Indonesian pre-trained model. The default training model only with the abstractive objective is better than using two-stage fine-tuning, where the extractive model must be trained in advance. We also found a lot of meaningless words in the summary words construction. This finding is the result of a preliminary study to improve the Indonesian abstractive summarization model.

机译：越来越需要自动文本摘要系统来遇到因互联网增长引起的信息爆炸。由于印度尼西亚仍被认为是一个资源不足的语言，我们利用预先训练的语言模型来执行抽象摘要。本文通过比较若干BERT预训练模型并根据胭脂值进行评估结果，调查伯特性能。我们的实验表明，给出了英语预先训练的模型，给出了印度尼西亚语文本的良好摘要，但它更有效地使用印度尼西亚预先训练的模型。只有抽象目标的默认培训模型比使用两级微调更好，其中必须提前培训提取型号。我们还在摘要单词施工中找到了很多无意义的词语。这一发现是提高印度尼西亚抽象摘要模型初步研究的结果。

著录项

来源
《East Indonesia Conference on Computer and Information Technology》|2021年|79-84|共6页
会议地点
作者
Rini Wijayanti; Masayu L. Khodra; Dwi H. Widyantoro;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Computational modeling; Bit error rate; Explosions; Information technology;

机译：培训;计算建模;误码率;爆炸;信息技术;

相似文献

外文文献
中文文献
专利

1. Biomedical-domain pre-trained language model for extractive summarization [J] . Du Yongping, Li Qingxiao, Wang Lulin, Knowledge-Based Systems . 2020,第Jul8期

机译：生物医学域预培训的提取综准语言模型
2. Indonesian Abstractive Text Summarization Using Bidirectional Gated Recurrent Unit [J] . Rike Adelia, Suyanto Suyanto, Untari Novia Wisesty Procedia Computer Science . 2019,第5期

机译：印度尼西亚抽象文本摘要使用双向门控复发单位
3. ABSTRACTS OF DOCTORAL THESES ON THE INDONESIAN ECONOMY--Essays on the Empirical Modelling of Money Demand in Periods of Financial Liberalisation: The Case of Indonesia [J] . Gregory A. James Bulletin of Indonesian Economic Studies . 2008,第3期

机译：印尼经济博士论文摘要-金融自由化时期货币需求经验模型的散文：以印度尼西亚为例
4. Automatic multi-document summarization for Indonesian documents using hybrid abstractive-extractive summarization technique [C] . Yapinus Glorian, Erwin Alva, Galinium Maulahikmah, International Conference on Information Technology and Electrical Engineering . 2014

机译：使用混合摘要提取摘要技术对印度尼西亚文档进行自动多文档摘要
5. Query Focused Abstractive Summarization Using BERTSUM Model [D] . Abdullah, Deen Mohammad. 2020

机译：查询使用Bertsum模型的聚焦抽象概述
6. Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimers Disease Based on Speech [O] . Aparna Balagopalan, Benjamin Eyre, Jessica Robin, 2021

机译：比较基于预训练和基于特征的模型以基于演讲预测阿尔茨海默病的预测
7. A Joint Summarization and Pre-Trained Model for Review-Based Recommendation [O] . Yi Bai, Yang Li, Letian Wang 2021

机译：审查的建议联合摘要和预训练模型

Indonesian Abstractive Summarization using Pre-trained Model

摘要

著录项

相似文献

相关主题

期刊订阅