首页> 外文会议>IEEE International Symposium on Robot and Human Interactive Communication >Single-channel particular voice activity detection for monitoring the violence situations
【24h】

Single-channel particular voice activity detection for monitoring the violence situations

机译:单通道特定语音活动检测以监视暴力情况

获取原文

摘要

The proposed algorithm in this paper is capable of classifying not only unusual speech when people get anger, surprised, or excited but also unusual noise such as clashing, hitting, or clapping in real-time without depending on particular speaker voices or utterances. Also, it does not require a prior learning process to construct acoustic models. This algorithm, therefore, allows a surveillance camera system to effectively monitor quarrel or violent situations regardless of object shields and light conditions. To realize our approach, we analyze the variance and change of spectral densities and pitches when unusual speech and noise occur. We then propose new methods (SEBNI, USDF, and UNDF) to classify unusual sounds in real-time. Moreover, to improve performance, we apply a noise suppression system based on MMSE-STSA and a statistic model-based VAD to our algorithm in order to extract reliable voice features and segment only voice-related periods in noisy environments. We confirm that our proposed method achieves an 87% accuracy performance for classifying unusual speech.
机译:本文中提出的算法不仅能够对人们发怒,惊讶或兴奋时的异常语音进行分类,而且还能够实时分类异常的噪声(例如碰撞,敲打或拍手),而无需依赖于特定的说话者语音或发声。而且,它不需要事先学习过程即可构建声学模型。因此,该算法允许监视摄像头系统有效监视争吵或暴力情况,而不管物体的防护罩和光线条件如何。为了实现我们的方法,我们分析了发生异常语音和噪声时频谱密度和音高的变化和变化。然后,我们提出了新的方法(SEBNI,USDF和UNDF)来实时对异常声音进行分类。此外,为了提高性能,我们将基于MMSE-STSA的噪声抑制系统和基于统计模型的VAD应用于我们的算法,以提取可靠的语音特征并仅在嘈杂的环境中分割与语音相关的时段。我们确认,我们提出的方法可以对不正常语音进行分类,达到87%的准确度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号