语音关键词识别中基于MLP帧级子词后验概率的置信度方法

李文昕; 屈丹; 李弼程; 刘崧

首页> 中文期刊> 《信号处理》 >语音关键词识别中基于MLP帧级子词后验概率的置信度方法

语音关键词识别中基于MLP帧级子词后验概率的置信度方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

As the confidence measures in the scheme of Hidden Markov Model ( HMM) in keyword spotting system have some shortcomings, a confidence measure based on frame-level sub-word posterior probability of Multi-layer Perception ( MLP) is presented in this paper. Conventionally, the confidence is calculated from the acoustic and language model scores computed by the recogniser of HMM model, which makes some incorrect assumptions, such as the frame-wise and possibly component-wise independence of acoustic features, and a finite number of Gaussian mixtures. The proposed confidence measure is directly calculated from the frame-level sub-word posterior probabilities produced by a MLP network. The confidence estimation is completely separated from the keyword spotting and they use two different models. With this separation, decision making can be addressed with more reliable confidence and multiple confidence features can be integrated to improve the decision quality. The experimental results show that the proposed approach in this paper is better than the mainstream confidence measures in the framework of HMM model and they have good complement, when combining with the mainstream confidence measures in the scheme of HMM model, the Equal Error Rate ( EER) of keyword spotting system a-chievesll.5% relative improvement.%针对关键词检测系统中HMM模型框架下置信度计算存在的不足,本文提出了基于MLP帧级子词后验概率的置信度方法.与HMM模型框架下利用声学模型得分与语言模型得分进行置信度计算不同的是,该方法在MLP模型框架下直接将其输出的每帧语音类别的后验概率用于关键词置信度的计算,克服了HMM建模时假设每帧语音的声学特征相互独立以及对状态建模时采用有限混元的高斯分布的不足.关键词检出和置信度确认使用两套不同的模型结构,是两个完全独立的过程,便于融合其他的置信度特征.实验结果表明,本文提出的方法优于HMM框架下主流的置信度计算方法,且与其具有较好的互补性.因此本文将两种不同框架下不同的置信度方法进行融合,系统的等错误率(EER)相对提高了11.5％.

著录项

来源
《信号处理》 |2012年第7期|1051-1056|共6页
作者
李文昕; 屈丹; 李弼程; 刘崧;
展开▼
作者单位

解放军信息工程大学信息工程学院,河南郑州450002;

解放军信息工程大学信息工程学院,河南郑州450002;

解放军信息工程大学信息工程学院,河南郑州450002;

恒生数字有限公司,浙江杭州310012;

展开▼
原文格式 PDF
正文语种 chi
中图分类 TP391.42;
关键词
关键词检出; 置信度计算; 多层感知器; 后验概率;

相似文献

中文文献
外文文献
专利

1. 基于音素后验概率的样例语音关键词检测方法 [J] . 张卫强 ,宋贝利 ,蔡猛 . 天津大学学报 . 2015,第009期
2. 语音关键词检测系统中基于时长和边界信息的置信度 [J] . 李文昕 ,屈丹 ,李弼程 . 应用科学学报 . 2012,第006期
3. 语音关键词检测中置信测度方法研究综述 [J] . 李海洋 ,韩纪庆 ,郑贵滨 . 智能计算机与应用 . 2014,第002期
4. 语音关键词检测中置信测度方法研究综述 [J] . 李海洋 ,韩纪庆 ,郑贵滨 . 智能计算机与应用 . 2014,第002期
5. 基于SVM的置信度综合方法在语音识别中的应用 [J] . 黄石磊 ,匡镜明 ,谢湘 . 北京理工大学学报 . 2007,第3期
6. 基于关键词的句法分析及在连续语音识别中的应用 [C] . 俞一彪 ,顾晓东 ,赵鹤鸣 . 第九届全国信号处理学术年会 . 1999
7. 汉语语音关键词检测中置信测度研究 [A] . 李海洋 . 2014

语音关键词识别中基于MLP帧级子词后验概率的置信度方法

摘要

著录项

相似文献

相关主题

期刊订阅