Multilingual Acoustic Word Embedding Models for Processing Zero-resource Languages

机译：处理零资源语言的多语言声学词嵌入模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Acoustic word embeddings are fixed-dimensional representations of variable-length speech segments. In settings where unlabelled speech is the only available resource, such embeddings can be used in "zero-resource" speech search, indexing and discovery systems. Here we propose to train a single supervised embedding model on labelled data from multiple well-resourced languages and then apply it to unseen zero-resource languages. For this transfer learning approach, we consider two multilingual recurrent neural network models: a discriminative classifier trained on the joint vocabularies of all training languages, and a correspondence autoencoder trained to reconstruct word pairs. We test these using a word discrimination task on six target zero-resource languages. When trained on seven well-resourced languages, both models perform similarly and outperform unsupervised models trained on the zero-resource languages. With just a single training language, the second model works better, but performance depends more on the particular training–testing language pair.

机译：声词嵌入是可变长度语音段的固定尺寸表示。在未标记语音是唯一可用资源的设置中，可以在“零资源”语音搜索，索引和发现系统中使用此类嵌入。在这里，我们建议在来自多种资源丰富的语言的标记数据上训练单个监督嵌入模型，然后将其应用于看不见的零资源语言。对于这种转移学习方法，我们考虑两个多语言递归神经网络模型：一个针对所有训练语言的联合词汇进行训练的判别式分类器，以及一个针对重构单词对而进行训练的对应自动编码器。我们使用针对六种目标零资源语言的单词区分任务对它们进行测试。在使用七种资源丰富的语言进行训练时，这两种模型的性能相似，并且优于在零资源语言下进行训练的无监督模型。仅使用一种培训语言，第二种模型就可以更好地工作，但是性能更多地取决于特定的培训-测试语言对。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|6414-6418|共5页
会议地点
作者
Herman Kamper; Yevgen Matusevych; Sharon Goldwater;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Acoustic word embeddings; multilingual models; zero-resource speech processing; query-by-example;

机译：语音词嵌入;多语言模型;零资源语音处理;实例查询;

相似文献

外文文献
中文文献
专利

1. Multilingual and unsupervised subword modeling for zero-resource languages [J] . Enno Hermann, Herman Kamper, Sharon Goldwater Computer speech and language . 2021,第Jana期

机译：零资源语言的多语言和无人监督子字建模
2. A novel approach for modeling non-keyword intervals in a keyword spotter exploiting acoustic similarities of languages [J] . Heracleous P, Shimizu T Speech Communication . 2005,第4期

机译：一种利用语言的声学相似性在关键字检测器中对非关键字间隔进行建模的新颖方法
3. On the efficiency of word processing in a natural language processing system with unknown word processing capability - a note on processing efficiency and registration criteria based on some models [J] . Hiroyuki Kameda 電子情報通信学会技術研究報告. 思考と言語. Thought and Language . 2001,第243期

机译：关于未知文字处理能力的自然语言处理系统中文字处理的效率-有关基于某些模型的处理效率和注册标准的注释
4. Multilingual Acoustic Word Embedding Models for Processing Zero-resource Languages [C] . Herman Kamper, Yevgen Matusevych, Sharon Goldwater IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：用于处理零资源语言的多语言声学词嵌入模型
5. Multilingual model using cross-lingual word embeddings based on subword alignment and cross-task projection利用統計を見る [D] . Sakuma Jin 2019

机译：使用基于子词对齐和跨任务投影的跨语言词嵌入的多语言模型
6. Why Language Processing Recruits Modality Specific Brain Regions: It Is Not About Understanding Words but About Modelling Situations [O] . Zoé Cayol, Tatjana A. Nazir 2020

机译：为什么语言处理招募型号特定的大脑区域：它不是理解词语而是关于建模情况
7. Acoustic Word Embeddings for Zero-Resource Languages Using Self-Supervised Contrastive Learning and Multilingual Adaptation [O] . Christiaan Jacobs, Yevgen Matusevych, Herman Kamper 2021

机译：使用自我监督的对比学习和多语言适应的声学词嵌入零资源语言

Multilingual Acoustic Word Embedding Models for Processing Zero-resource Languages

摘要

著录项

相似文献

相关主题

期刊订阅