Multilingual and unsupervised subword modeling for zero-resource languages

Enno Hermann; Herman Kamper; Sharon Goldwater

首页> 外文期刊>Computer speech and language >Multilingual and unsupervised subword modeling for zero-resource languages

【24h】

Multilingual and unsupervised subword modeling for zero-resource languages

机译：零资源语言的多语言和无人监督子字建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Subword modeling for zero-resource languages aims to learn low-level representations of speech audio without using transcriptions or other resources from the target language (such as text corpora or pronunciation dictionaries). A good representation should capture phonetic content and abstract away from other types of variability, such as speaker differences and channel noise. Previous work in this area has primarily focused unsupervised learning from target language data only, and has been evaluated only intrinsically. Here we directly compare multiple methods, including some that use only target language speech data and some that use transcribed speech from other (non-target) languages, and we evaluate using two intrinsic measures as well as on a downstream unsupervised word segmentation and clustering task. We find that combining two existing target-language-only methods yields better features than either method alone. Nevertheless, even better results are obtained by extracting target language bottleneck features using a model trained on other languages. Cross-lingual training using just one other language is enough to provide this benefit, but multilingual training helps even more. In addition to these results, which hold across both intrinsic measures and the extrinsic task, we discuss the qualitative differences between the different types of learned features.

机译：零资源语言的子字建模旨在学习语音音频的低级表示，而无需使用目标语言（例如文本语料库或发音词典）的转录或其他资源。良好的代表应该捕捉语音内容和抽象远离其他类型的可变性，例如扬声器差异和信道噪声。此领域的以前的工作主要专注于无监督从目标语言数据的学习，并且已仅在本质上进行评估。在这里，我们直接比较多种方法，包括仅使用目标语言语音数据的方法，以及一些使用来自其他（非目标）语言的转录语音的一些方法，并且我们使用两个内在措施以及在下游无监督的单词分段和聚类任务中进行评估。我们发现，组合两个现有的目标语言方法，仅产生比单独方法更好的特征。然而，即使使用在其他语言上培训的模型提取目标语言瓶颈特征也可以获得更好的结果。只使用另一种语言的交叉语言训练足以提供这种好处，但多语种培训有助于更多。除了这些结果，它还涉及内在措施和外在任务，我们讨论了不同类型的学习功能之间的定性差异。

著录项

来源
《Computer speech and language》 |2021年第1期|101098.1-101098.14|共14页
作者
Enno Hermann; Herman Kamper; Sharon Goldwater;
展开▼
作者单位

School of Informatics. University of Edinburgh. Edinburgh EH8 9AB UK;

Department of E&E Engineering Stellenbosch University Stellenbosch 7600 South Africa;

School of Informatics. University of Edinburgh. Edinburgh EH8 9AB UK;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Multilingual bottleneck features; Subword modeling; Unsupervised feature extraction; Zero-resource speech technology;

机译：多语种瓶颈特征;次字建模;无监督的特征提取;零资源语音技术;

相似文献

外文文献
中文文献
专利

1. Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling [J] . Michael Heck, Sakriani Sakti, Satoshi Nakamura Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2018,第11期

机译：无监督子词建模的混合模型Dirichlet过程混合
2. USING DATA-DRIVEN SUBWORD UNITS IN LANGUAGE MODEL OF HIGHLY INFLECTIVE SLOVENIAN LANGUAGE [J] . MIRJAM SEPESY MAUCEC, TOMAZ ROTOVNIK, ZDRAVKO KACIC, International Journal of Pattern Recognition and Artificial Intelligence . 2009,第2期

机译：在高反斯洛文尼亚语的语言模型中使用数据驱动的子词单位
3. An Experimental Comparison of Modeling Techniques and Combination of Speaker - Specific Information from Different Languages for Multilingual Speaker Identification [J] . H. S. Jayanna, B. G. Nagaraja Journal of Intelligent Systems . 2016,第4期

机译：多种语言的说话人识别的建模技术和来自不同语言的说话人特定信息组合的实验比较
4. Multilingual Acoustic Word Embedding Models for Processing Zero-resource Languages [C] . Herman Kamper, Yevgen Matusevych, Sharon Goldwater IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：处理零资源语言的多语言声学词嵌入模型
5. Multilingual model using cross-lingual word embeddings based on subword alignment and cross-task projection利用統計を見る [D] . Sakuma Jin 2019

机译：使用基于子词对齐和跨任务投影的跨语言词嵌入的多语言模型
6. Supervised and unsupervised language modelling in Chest X-Ray radiological reports [O] . Ignat Drozdov, Daniel Forbes, Benjamin Szubert, 2020

机译：胸部X射线放射性报告中的监督和无监督语言建模
7. Multilingual Bottleneck Features for Subword Modeling in Zero-resource Languages [O] . Enno Hermann, Sharon Goldwater 2018

机译：零资源语言中子字建模的多语言瓶颈特征

Multilingual and unsupervised subword modeling for zero-resource languages

摘要

著录项

相似文献

相关主题

期刊订阅