首页> 外国专利> ADAPTIVE SPATIAL VAD AND TIME-FREQUENCY MASK ESTIMATION FOR HIGHLY NON-STATIONARY NOISE SOURCES

ADAPTIVE SPATIAL VAD AND TIME-FREQUENCY MASK ESTIMATION FOR HIGHLY NON-STATIONARY NOISE SOURCES

机译:高度非平稳噪声源的自适应空间VAD和时频掩码估计

摘要

Systems and methods include a first voice activity detector operable to detect speech in a frame of a multichannel audio input signal and output a speech determination, a constrained minimum variance adaptive filter operable to receive the multichannel audio input signal and the speech determination and minimize a signal variance at the output of the filter, thereby producing an equalized target speech signal, a mask estimator operable to receive the equalized target speech signal and the speech determination and generate a spectral-temporal mask to discriminate a target speech from noise and interference speech, and a second activity voice detector operable to detect voice in a frame of the speech discriminated signal. An audio input sensor array including a plurality of microphones, each microphone generating a channel of the multichannel audio input signal. A sub-band analysis module operable to decompose each of the channels into a plurality of frequency sub-bands.
机译:系统和方法包括:第一语音活动检测器,其可操作以检测多声道音频输入信号的帧中的语音并输出语音确定;约束最小方差自适应滤波器,其可操作以接收多声道音频输入信号和语音确定并最小化信号滤波器输出端的方差,从而产生均衡的目标语音信号,掩码估计器,可操作以接收均衡的目标语音信号和语音确定,并生成频谱时域掩码以从噪声和干扰语音中区分出目标语音,以及第二活动语音检测器可操作来检测语音识别信号的帧中的语音。音频输入传感器阵列,包括多个麦克风,每个麦克风生成多声道音频输入信号的声道。子带分析模块可操作以将每个信道分解为多个频率子带。

著录项

  • 公开/公告号US2020219530A1

    专利类型

  • 公开/公告日2020-07-09

    原文格式PDF

  • 申请/专利权人 SYNAPTICS INCORPORATED;

    申请/专利号US202016735575

  • 发明设计人 FRANCESCO NESTA;ALIREZA MASNADI-SHIRAZI;

    申请日2020-01-06

  • 分类号G10L25/84;H04R1/40;H04R3;H04R5/027;G10L25/18;G10L15/16;G10L25/21;G10L15/22;G06N3/08;

  • 国家 US

  • 入库时间 2022-08-21 11:20:00

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号