Modelling the uncertainty in recovering articulation from acoustics

Korin Richmond; Simon King; Paul Taylor

首页> 外文期刊>Computer speech and language >Modelling the uncertainty in recovering articulation from acoustics

【24h】

Modelling the uncertainty in recovering articulation from acoustics

机译：对从声学中恢复发音的不确定性进行建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an experimental comparison of the performance of the multilayer perceptron (MLP) with that of the mixture density network (MDN) for an acoustic-to-articulatory mapping task. A corpus of acoustic-articulatory data recorded by electromagnetic articulography (EMA) for a single speaker was used as training and test data for this purpose. In theory, the MDN is able to provide a richer, more flexible description of the target variables in response to a given input vector than the least-squares trained MLP. Our results show that the mean likelihoods of the target articulatory parameters for an unseen test set were indeed consistently higher with the MDN than with the MLP. The increase ranged from approximately 3% to 22%, depending on the articulatory channel in question. On the basis of these results, we argue that using a more flexible description of the target domain, such as that offered by the MDN, can prove beneficial when modelling the acoustic-to-articulatory mapping.

机译：本文介绍了多层感知器（MLP）和混合物密度网络（MDN）在声学到发音映射任务中的性能的实验比较。为此目的，将由电磁关节造影（EMA）记录的针对单个说话者的声学发音数据集用作训练和测试数据。从理论上讲，与最小二乘训练的MLP相比，MDN能够响应给定的输入矢量，提供更丰富，更灵活的目标变量描述。我们的结果表明，对于一个看不见的测试集，目标关节参数的平均可能性确实比MLP一致地更高。根据所讨论的咬合通道，增加幅度约为3％至22％。根据这些结果，我们认为，在对声音到发音映射进行建模时，使用更灵活的目标域描述（例如MDN提供的描述）可以证明是有益的。

著录项

来源
《Computer speech and language》 |2003年第3期|p.153-172|共20页
作者
Korin Richmond; Simon King; Paul Taylor;
展开▼
作者单位

Centre for Speech Technology Research, University of Edinburgh, 2 Buccleuch Place, Edinburgh EH8 9LW, UK;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Face Active Appearance Modeling and Speech Acoustic Information to Recover Articulation [J] . Katsamanis A., Papandreou G., Maragos P. Audio, Speech, and Language Processing, IEEE Transactions on . 2009,第3期

机译：脸部主动外观建模和语音声学信息以恢复发音
2. Articulation-Disordered Speech Recognition Using Speaker-Adaptive Acoustic Models and Personalized Articulation Patterns [J] . CHUNG-HSIEN WU, HUNG-YU SU, HAN-PING SHEN ACM transactions on Asian language information processing . 2011,第2期

机译：使用说话者自适应声学模型和个性化发音模式的发音混乱语音识别
3. Inverting mappings from smooth paths through R~n to paths through R~m: A technique applied to recovering articulation from acoustics [J] . John Hogden, Philip Rubin, Erik McDermott, Speech Communication . 2007,第5期

机译：将映射从通过R〜n的平滑路径转换为通过R〜m的路径：一种用于从声学中恢复清晰度的技术
4. Acoustic echo cancellation using deep cerebellar model articulation controller [C] . Shih-Wei Lan, Yu Tsao, Junghsi Lee Asilomar Conference on Signals, Systems and Computers . 2017

机译：使用深小脑模型关节控制器消除回声
5. The impact of model uncertainty on spatial compensation in active structural acoustic control. [D] . Sprofera, Joseph Daniel. 2005

机译：主动结构声学控制中模型不确定性对空间补偿的影响。
6. Perception of static and dynamic acoustic cues to place of articulation in initial stop consonants [O] . Diane Kewley-Port, David B. Pisoni, Michael Studdert-Kennedy -1

机译：在初始停止辅音中感知静态和动态声学提示到关节处的发音
7. Modelling the uncertainty in recovering articulation from acoustics [O] . Richmond Korin, King Simon, Taylor Paul 2003

机译：对从声学中恢复发音的不确定性进行建模
8. ACOUSTICAL DESCRIPTION OF SYLLABIC NUCLEI: AN INTERPRETATION IN TERMS OF A DYNAMIC MODEL OF ARTICULATION [R] . A. S. HOUSE, K. N. STEVENS, A. P. PAUL 1963

机译：sYLLaBICUCI的声学描述：动态关节模型的解释

Modelling the uncertainty in recovering articulation from acoustics

摘要

著录项

相似文献

相关主题

期刊订阅