首页> 外国专利> Robust speech recognition apparatus and method for Bayesian feature enhancement using independent vector analysis and reverberation parameter reestimation

Robust speech recognition apparatus and method for Bayesian feature enhancement using independent vector analysis and reverberation parameter reestimation

机译:使用独立矢量分析和混响参数重新估计进行贝叶斯特征增强的鲁棒语音识别设备和方法

摘要

The present invention relates to a voice recognition device, which enhances the Bayesian features by using re-estimated echo filter parameters and an independent vector analysis, and a method thereof. The voice recognition method includes the steps of: (a) converting and outputting the signals of each frequency band by executing a short-range Fourier transformation operation with multiple voice signals input from the outside; (b) estimating independent vector analysis (IVA) noise signals and IVA target sound signals by executing an IVA operation with the sound signals in the frequency bands; (c) extracting voice properties by using the hidden Markov model (HMM) based Bayesian feature enhancement (BFE) from the IVA target sound signals estimated by the IVA; (d) using the IVA target sound signals to scale the IVA noise signals estimated by the IVA to extract noise features from the scaled IVA noise signals; (e) estimating the initial sound signals by enhancing the sound features by executing an HMM-based BFE operation using the initial setting values of the voice feature and echo filter parameters; (f) re-tracking the echo filter parameters by using the estimated initial sound signals and the noise features; and (g) finally tracking the sound signals by enhancing the sound features by using the re-tracked echo filter parameters.
机译:语音识别装置及其方法技术领域本发明涉及一种通过使用重新估计的回声滤波器参数和独立的矢量分析来增强贝叶斯特征的语音识别装置及其方法。语音识别方法包括以下步骤:(a)通过对从外部输入的多个语音信号执行短距离傅里叶变换操作,来转换和输出每个频带的信号; (b)通过对频带中的声音信号执行IVA运算,估计独立矢量分析(IVA)噪声信号和IVA目标声音信号; (c)通过使用基于隐马尔可夫模型(HMM)的贝叶斯特征增强(BFE)从IVA估计的IVA目标声音信号中提取语音特性; (d)使用IVA目标声音信号缩放由IVA估计的IVA噪声信号,以从缩放后的IVA噪声信号中提取噪声特征; (e)通过使用语音特征的初始设置值和回声滤波器参数执行基于HMM的BFE操作,通过增强声音特征来估计初始声音信号; (f)通过使用估计的初始声音信号和噪声特征来重新跟踪回声滤波器参数; (g)最后通过使用重新跟踪的回声滤波器参数来增强声音特征来跟踪声音信号。

著录项

  • 公开/公告号KR101802444B1

    专利类型

  • 公开/公告日2017-11-29

    原文格式PDF

  • 申请/专利权人 SOGANG UNIVERSITY RESEARCH FOUNDATION;

    申请/专利号KR20160089966

  • 发明设计人 PARK HYUNG MIN;CHO JI WON;

    申请日2016-07-15

  • 分类号G10L15/20;G10L15/14;G10L19/02;G10L19/035;G10L21/0208;

  • 国家 KR

  • 入库时间 2022-08-21 12:41:34

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号