Enhanced Tree Clustering with Single Pronunciation Dictionary for Conversational Speech Recognition

机译：增强的树群与单一发音字典为会话语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modeling pronunciation variation is key for recognizing conversational speech. Rather than being limited to dictionary modeling, we argue that triphone clustering is an integral part of pronunciation modeling. We propose a new approach called enhanced tree clustering. This approach, in contrast to traditional decision tree based state tying, allows parameter sharing across phonemes. We show that accurate pronunciation modeling can be achieved through efficient parameter sharing in the acoustic model. Combined with a single pronunciation dictionary, a 1.8% absolute word error rate improvement is achieved on Switchboard, a large vocabulary conversational speech recognition task.

机译：建模发音变化是识别会话语音的关键。我们认为Triphone群集是语言的一个不可或缺的发音部分。我们提出了一种称为增强树聚类的新方法。与基于传统的决策树的状态相比，这种方法涉及跨越音素的参数共享。我们表明，通过声学模型中的有效参数共享，可以实现准确的发音建模。结合单个发音词典，在交换机上实现了1.8％的绝对字错误率改进，大词汇表会话语音识别任务。

著录项

来源
《European Conference on Speech Communication and Technology - EUROSPEECH》|2003年||共4页
会议地点
作者
Hua Yu; Tanja Schultz; International Speech Communication Association(ISCA);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动信息理论;
关键词

相似文献

外文文献
中文文献
专利

1. Pronunciation change in conversational speech and its implications for automatic speech recognition [J] . Murat Sarclar, Sanjeev Khudanpur Computer speech and language . 2004,第4期

机译：会话语音中的语音变化及其对自动语音识别的影响
2. Random Forests of Phonetic Decision Trees for Acoustic Modeling in Conversational Speech Recognition [J] . Xue J., Zhao Y. IEEE transactions on audio, speech and language processing . 2008,第3期

机译：会话语音识别中语音建模的语音决策树随机森林
3. Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition [J] . Baby Deepak, Virtanen Tuomas, Gemmeke Jort F., Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2015,第11期

机译：耦合字典用于基于示例的语音增强和自动语音识别
4. Enhanced Tree Clustering with Single Pronunciation Dictionary for Conversational Speech Recognition [C] . Hua Yu, Tanja Schultz, International Speech Communication Association(ISCA) European Conference on Speech Communication and Technology - EUROSPEECH . 2003

机译：增强的树群与单一发音字典为会话语音识别
5. Pronunciation modeling for conversational speech recognition. [D] . Saraclar, Murat. 2001

机译：用于对话语音识别的语音建模。
6. A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech [O] . Jodi Kodish-Wachs, Emin Agassi, Patrick Kenny III, 2018

机译：当代自动语音识别引擎用于对话式临床语音的系统比较
7. Pronunciation Modelling For Conversational Speech Recognition: A Status Report From WS97 [O] . B. Byrne, M. Finke, S. Khudanpur, 1997

机译：会话语音识别的发音建模：来自Ws97的状态报告

Enhanced Tree Clustering with Single Pronunciation Dictionary for Conversational Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅