...
首页> 外文期刊>Computer vision and image understanding >Multi-modal human aggression detection
【24h】

Multi-modal human aggression detection

机译:多模式人体攻击检测

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a smart surveillance system named CASSANDRA, aimed at detecting instances of aggressive human behavior in public environments. A distinguishing aspect of CASSANDRA is the exploitation of complementary audio and video cues to disambiguate scene activity in real-life environments. From the video side, the system uses overlapping cameras to track persons in 3D and to extract features regarding the limb motion relative to the torso. From the audio side, it classifies instances of speech, screaming, singing, and kicking-object. The audio and video cues are fused with contextual cues (interaction, auxiliary objects); a Dynamic Bayesian Network (DBN) produces an estimate of the ambient aggression level. Our prototype system is validated on a realistic set of scenarios performed by professional actors at an actual train station to ensure a realistic audio and video noise setting.
机译:本文提出了一种名为CASSANDRA的智能监视系统,旨在检测公共环境中人类侵略行为的实例。 CASSANDRA的一个显着方面是利用互补的音频和视频提示来消除现实环境中的场景活动。从视频方面来看,该系统使用重叠的摄像机以3D方式跟踪人员并提取有关肢体相对于躯干运动的特征。从音频方面,它对语音,尖叫,唱歌和踢脚对象进行分类。音频和视频提示与上下文提示(交互,辅助对象)融合在一起;动态贝叶斯网络(DBN)会估算出环境攻击水平。我们的原型系统经过实际演员在实际火车站执行的一组真​​实场景的验证,以确保真实的音频和视频噪声设置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号