How multilingual is Multilingual BERT?

机译：多语言BERT有多语言能力？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we show that Multilingual BERT (M-BERT). released by Devlin et al. (2019) as a single language model pre-trained from monolingual corpora in 104 languages, is surprisingly good at zero-shot cross-lingual model transfer, in which task-specific annotations in one language are used to fine-tune the model for evaluation in another language. To understand why, we present a large number of probing experiments, showing that transfer is possible even to languages in different scripts, that transfer works best between typologically similar languages, that monolingual corpora can train models for code-switching, and that the model can find translation pairs. From these results, we can conclude that M-BERT does create multilingual representations, but that these representations exhibit systematic deficiencies affecting certain language pairs.

机译：在本文中，我们展示了多语言BERT（M-BERT）。由Devlin等人发布。（2019）是从104种语言的单语语料库预训练的一种语言模型，令人惊讶地擅长于零镜头跨语言模型传递，其中使用一种语言的特定于任务的注释来微调模型以进行评估用另一种语言。为了理解原因，我们提出了大量的探测实验，表明即使在不同脚本中的语言也可以进行转移，在类型相似的语言之间转移效果最好，单语语料库可以训练模型进行代码转换，并且模型可以查找翻译对。从这些结果，我们可以得出结论，M-BERT确实创建了多语言表示形式，但是这些表示形式表现出影响某些语言对的系统缺陷。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|4996-5001|共6页
会议地点
作者
Telmo Pires; Eva Schlinger; Dan Garrette;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Multilingual evaluation of pre-processing for BERT-based sentiment analysis of tweets [J] . Pota Marco, Ventura Mirko, Fujita Hamido, Expert systems with applications . 2021,第Nova期

机译：多语言评价推文的伯特基情感分析的预处理
2. Multilingual emoji prediction using BERT for sentiment analysis [J] . Toshiki Tomihira, Atsushi Otsuka, Akihiro Yamashita, International journal of web information systems . 2020,第3期

机译：多语种Emoji使用BERT进行情感分析预测
3. Monolingual and multilingual topic analysis using LDA and BERT embeddings [J] . Xie Qing, Zhang Xinyuan, Ding Ying, Journal of informetrics . 2020,第3期

机译：使用LDA和BERT Embeddings的单语和多语言主题分析
4. How multilingual is Multilingual BERT? [C] . Telmo Pires, Eva Schlinger, Dan Garrette Annual meeting of the Association for Computational Linguistics . 2019

机译：多语种伯特多种语言如何？
5. International Multilingual Student Writers' (Re)negotiation of Their Languages and Literacies Practices in a First-Year Multilingual Composition Class. [D] . Prikhodko, Maria Y. 2017

机译：国际多语言学生作家在一年级的多语言作文课上对其语言和文学实践的（重新）谈判。
6. Automatic Truecasing of Video Subtitles Using BERT: A Multilingual Adaptable Approach [O] . Ricardo Rei, Nuno Miguel Guerreiro, Fernando Batista -1

机译：使用BERT自动对视频字幕进行装箱：一种多语言自适应方法
7. How Multilingual is Multilingual BERT? [O] . Telmo Pires, Eva Schlinger, Dan Garrette 2019

机译：多语种伯特多种语言如何？

How multilingual is Multilingual BERT?

摘要

著录项

相似文献

相关主题

期刊订阅