首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition Workshops >Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video
【24h】

Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video

机译:在操纵视频中发现视听不一致(Savi)

获取原文

摘要

This paper is part of a larger effort to detect manipulations of video by searching for and combining the evidence of multiple types of inconsistencies between the audio and visual channels. Here, we focus on inconsistencies between the type of scenes detected in the audio and visual modalities (e.g., audio indoor, small room versus visual outdoor, urban), and inconsistencies in speaker identity tracking over a video given audio speaker features and visual face features (e.g., a voice change, but no talking face change). The scene inconsistency task was complicated by mismatches in the categories used in current visual scene and audio scene collections. To deal with this, we employed a novel semantic mapping method. The speaker identity inconsistency process was challenged by the complexity of comparing face tracks and audio speech clusters, requiring a novel method of fusing these two sources. Our progress on both tasks was demonstrated on two collections of tampered videos.
机译:本文是通过搜索和组合音频和视觉频道之间的多种类型不一致的证据来检测视频操纵的更大努力的一部分。在这里,我们专注于在音频和视觉方式中检测到的场景类型(例如,音频室内,小型房间与视觉户外,城市),以及在给定音频扬声器功能和视觉面部功能的视频中跟踪扬声器身份跟踪的不一致(例如,语音变化,但没有说话的脸变化)。现场不一致任务在当前视觉场景和音频场景集合中使用的类别中的不匹配是复杂的。要处理这一点,我们采用了一种新颖的语义映射方法。扬声器身份不一致过程受到比较面轨道和音频语音集群的复杂性的挑战,需要一种融合这两个来源的新方法。我们对两个任务的进展情况在两种篡改视频中展示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号