首页> 外文会议>European Conference on Speech Communication and Technology - EUROSPEECH >Enhanced Tree Clustering with Single Pronunciation Dictionary for Conversational Speech Recognition
【24h】

Enhanced Tree Clustering with Single Pronunciation Dictionary for Conversational Speech Recognition

机译:增强的树群与单一发音字典为会话语音识别

获取原文

摘要

Modeling pronunciation variation is key for recognizing conversational speech. Rather than being limited to dictionary modeling, we argue that triphone clustering is an integral part of pronunciation modeling. We propose a new approach called enhanced tree clustering. This approach, in contrast to traditional decision tree based state tying, allows parameter sharing across phonemes. We show that accurate pronunciation modeling can be achieved through efficient parameter sharing in the acoustic model. Combined with a single pronunciation dictionary, a 1.8% absolute word error rate improvement is achieved on Switchboard, a large vocabulary conversational speech recognition task.
机译:建模发音变化是识别会话语音的关键。我们认为Triphone群集是语言的一个不可或缺的发音部分。我们提出了一种称为增强树聚类的新方法。与基于传统的决策树的状态相比,这种方法涉及跨越音素的参数共享。我们表明,通过声学模型中的有效参数共享,可以实现准确的发音建模。结合单个发音词典,在交换机上实现了1.8%的绝对字错误率改进,大词汇表会话语音识别任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号