How multilingual is Multilingual BERT?

机译：多语种伯特多种语言如何？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we show that Multilingual BERT (M-BERT). released by Devlin et al. (2019) as a single language model pre-trained from monolingual corpora in 104 languages, is surprisingly good at zero-shot cross-lingual model transfer, in which task-specific annotations in one language are used to fine-tune the model for evaluation in another language. To understand why, we present a large number of probing experiments, showing that transfer is possible even to languages in different scripts, that transfer works best between typologically similar languages, that monolingual corpora can train models for code-switching, and that the model can find translation pairs. From these results, we can conclude that M-BERT does create multilingual representations, but that these representations exhibit systematic deficiencies affecting certain language pairs.

机译：在本文中，我们展示了多语言伯特（M-BERT）。 devlin等人发布。（2019年）作为从104种语言中从Monolingual Corpora进行的单一语言模型，令人惊讶地擅长零拍摄的交叉模型转移，其中一种语言的任务特定注释用于微调评估模型用另一种语言。要了解原因，我们展示了大量探测实验，表明转移甚至可能在不同脚本中的语言，转移工作在类型的类型类似的语言之间，Monolingual Corpora可以培训代码切换的模型，并且模型可以培训模型找到翻译对。从这些结果来看，我们可以得出结论，M-BERT确实创造了多语言表示，但这些表示表现出影响某些语言对的系统性缺陷。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|cxxxiv p. 4609-5267|共6页
会议地点
作者
Telmo Pires; Eva Schlinger; Dan Garrette;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Multilingual evaluation of pre-processing for BERT-based sentiment analysis of tweets [J] . Pota Marco, Ventura Mirko, Fujita Hamido, Expert systems with applications . 2021,第Nova期

机译：多语言评价推文的伯特基情感分析的预处理
2. Multilingual emoji prediction using BERT for sentiment analysis [J] . Toshiki Tomihira, Atsushi Otsuka, Akihiro Yamashita, International journal of web information systems . 2020,第3期

机译：多语种Emoji使用BERT进行情感分析预测
3. Monolingual and multilingual topic analysis using LDA and BERT embeddings [J] . Xie Qing, Zhang Xinyuan, Ding Ying, Journal of informetrics . 2020,第3期

机译：使用LDA和BERT Embeddings的单语和多语言主题分析
4. How multilingual is Multilingual BERT? [C] . Telmo Pires, Eva Schlinger, Dan Garrette Annual meeting of the Association for Computational Linguistics . 2019

机译：多语言BERT有多语言能力？
5. International Multilingual Student Writers' (Re)negotiation of Their Languages and Literacies Practices in a First-Year Multilingual Composition Class. [D] . Prikhodko, Maria Y. 2017

机译：国际多语言学生作家在一年级的多语言作文课上对其语言和文学实践的（重新）谈判。
6. Automatic Truecasing of Video Subtitles Using BERT: A Multilingual Adaptable Approach [O] . Ricardo Rei, Nuno Miguel Guerreiro, Fernando Batista -1

机译：使用BERT自动对视频字幕进行装箱：一种多语言自适应方法
7. How Multilingual is Multilingual BERT? [O] . Telmo Pires, Eva Schlinger, Dan Garrette 2019

机译：多语种伯特多种语言如何？

How multilingual is Multilingual BERT?

摘要

著录项

相似文献

相关主题

期刊订阅