首页> 外文期刊>Machine Vision and Applications >Human interaction categorization by using audio-visual cues
【24h】

Human interaction categorization by using audio-visual cues

机译:通过视听提示进行人机交互分类

获取原文
获取原文并翻译 | 示例
           

摘要

Human Interaction Recognition (HIR) in uncontrolled TV video material is a very challenging problem because of the huge intra-class variability of the classes (due to large differences in the way actions are performed, lighting conditions and camera viewpoints, amongst others) as well as the existing small inter-class variability (e.g., the visual difference between hug and kiss is very subtle). Most of previous works have been focused only on visual information (i.e., image signal), thus missing an important source of information present in human interactions: the audio. So far, such approaches have not shown to be discriminative enough. This work proposes the use of Audio-Visual Bag of Words (AVB0W) as a more powerful mechanism to approach the HIR problem than the traditional Visual Bag of Words (VBOW). We show in this paper that the combined use of video and audio information yields to better classification results than video alone. Our approach has been validated in the challenging TVHID dataset showing that the proposed AVB0W provides statistically significant improvements over the VB0W employed in the related literature.
机译:由于类的类内差异很大(由于动作方式,照明条件和摄像机视点等存在很大差异),因此不受控制的电视视频材料中的人机交互识别(HIR)也非常具有挑战性因为现有的类别间差异很小(例如,拥抱和亲吻之间的视觉差异非常微妙)。以前的大多数作品都只专注于视觉信息(即图像信号),因此缺少了人类互动中存在的重要信息来源:音频。到目前为止,此类方法尚未显示出足够的区分性。这项工作建议使用视听单词袋(AVB0W)作为比传统的视觉单词袋(VBOW)更强大的机制来解决HIR问题。我们在本文中表明,视频和音频信息的组合使用比单独使用视频可获得更好的分类结果。我们的方法已经在具有挑战性的TVHID数据集中得到了验证,表明所提出的AVB0W与相关文献中采用的VB0W相比,具有统计学上的显着改进。

著录项

  • 来源
    《Machine Vision and Applications》 |2014年第1期|71-84|共14页
  • 作者单位

    Department of Computing and Numerical Analysis, Maimonides Institute for Biomedical Research (IMIBIC), University of Cordoba, 14071 Cordoba, Spain;

    Department of Computing and Numerical Analysis, Maimonides Institute for Biomedical Research (IMIBIC), University of Cordoba, 14071 Cordoba, Spain;

    Department of Computing and Numerical Analysis, Maimonides Institute for Biomedical Research (IMIBIC), University of Cordoba, 14071 Cordoba, Spain;

    Department of Computer Science and Artificial Intelligence, University of Granada, 18071 Granada, Spain;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Human interactions; Audio; Video; BOW;

    机译:人际互动;音频;视频;弓;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号