首页> 外文会议>2016 IEEE International Workshop on Acoustic Signal Enhancement >Voice activity detection based on statistical likelihood ratio with adaptive thresholding
【24h】

Voice activity detection based on statistical likelihood ratio with adaptive thresholding

机译:基于统计似然比和自适应阈值的语音活动检测

获取原文
获取原文并翻译 | 示例

摘要

Statistical likelihood ratio test is a widely used voice activity detection (VAD) method, in which the likelihood ratio of the current temporal frame is compared with a threshold. A fixed threshold is always used, but this is not suitable for various types of noise. In this paper, an adaptive threshold is proposed as a function of the local statistics of the likelihood ratio. This threshold represents the upper bound of the likelihood ratio for the non-speech frames, whereas it remains generally lower than the likelihood ratio for the speech frames. As a result, a high non-speech hit rate can be achieved, while maintaining speech hit rate as large as possible.
机译:统计似然比测试是一种广泛使用的语音活动检测(VAD)方法,其中将当前时间帧的似然比与阈值进行比较。始终使用固定阈值,但这不适用于各种类型的噪声。在本文中,根据似然比的局部统计量提出了自适应阈值。该阈值表示非语音帧的似然比的上限,而通常仍低于语音帧的似然比。结果,可以实现高的非语音命中率,同时保持语音命中率尽可能大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号