Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation

机译：使用上下文和非上下文子字表示的序列标记：多语言评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Pretrained contextual and non-contextual sub-word embeddings have become available in over 250 languages, allowing massively multilingual NLP. However, while there is no dearth of pretrained embeddings, the distinct lack of systematic evaluations makes it difficult for practitioners to choose between them. In this work, we conduct an extensive evaluation comparing non-contextual subword embeddings, namely FastText and BPEmb, and a contextual representation method, namely BERT, on multilingual named entity recognition and part-of-speech tagging. We find that overall, a combination of BERT, BPEmb, and character representations works well across languages and tasks. A more detailed analysis reveals different strengths and weaknesses: Multilingual BERT performs well in medium- to high-resource languages, but is outperformed by non-contextual sub-word embeddings in a low-resource setting.

机译：预先训练的上下文和非上外上下文子字嵌入式已有超过250种语言可用，从而允许大量多语言NLP。然而，虽然没有紫外线嵌入的缺乏症，但在他们之间的明确缺乏系统评价使得从业者难以选择它们。在这项工作中，我们进行了一个广泛的评估，比较了非上下文子字嵌入，即FastText和BPEMB，以及一个上下文表示方法，即伯特，在多语言命名实体识别和语音标记上。我们发现总体而言，BERT，BPEMB和字符表示的组合跨越语言和任务。更详细的分析揭示了不同的优点和缺点：多语言杆在中到高资源语言中表现良好，但是在低资源设置中的非上下文子字嵌入式表现优于优势。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|cxxxiv 659 p.|共19页
会议地点
作者
Benjamin Heinzerling; Michael Strube;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Contextual and non-contextual performance evaluation of edge detectors [J] . T.B. Nguyen, D. Ziou Pattern recognition letters . 2000,第9期

机译：边缘检测器的上下文和非上下文性能评估
2. Contextual and non-contextual performance evaluation of edge detectors [J] . T.B. Nguyen, D. Ziou Pattern recognition letters . 2000,第9期

机译：边缘检测器的上下文和非上下文性能评估
3. Contextual and non-contextual performance evaluation of edge detectors [J] . T.B. Nguyen, D. Ziou Pattern recognition letters . 2000,第9期

机译：边缘探测器的上下文和非上下文性能评估
4. Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation [C] . Benjamin Heinzerling, Michael Strube Annual meeting of the Association for Computational Linguistics . 2019

机译：具有上下文和非上下文子词表示形式的序列标记：多语言评估
5. Multilingual model using cross-lingual word embeddings based on subword alignment and cross-task projection利用統計を見る [D] . Sakuma Jin 2019

机译：使用基于子词对齐和跨任务投影的跨语言词嵌入的多语言模型
6. Sensation Seeking Non-contextual Decision Making and Driving Abilities As Measured through a Moped Simulator [O] . Evelyn Gianfranchi, Mariaelena Tagliabue, Andrea Spoto, -1

机译：通过轻便摩托车模拟器测量的感觉寻求非上下文决策和驾驶能力
7. Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation [O] . Benjamin Heinzerling, Michael Strube 2019

机译：使用上下文和非上下文子字表示的序列标记：多语言评估

Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation

摘要

著录项

相似文献

相关主题

期刊订阅