首页> 外文会议>European conference on computer vision >Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames
【24h】

Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames

机译:通过相互表决相关的Web图像和Web视频帧来进行Web监督的视频识别

获取原文

摘要

Video recognition usually requires a large amount of training samples, which are expensive to be collected. An alternative and cheap solution is to draw from the large-scale images and videos from the Web. With modern search engines, the top ranked images or videos are usually highly correlated to the query, implying the potential to harvest the labeling-free Web images and videos for video recognition. However, there are two key difficulties that prevent us from using the Web data directly. First, they are typically noisy and may be from a completely different domain from that of users' interest (e.g. cartoons). Second, Web videos are usually untrimmed and very lengthy, where some query-relevant frames are often hidden in between the irrelevant ones. A question thus naturally arises: to what extent can such noisy Web images and videos be utilized for labeling-free video recognition? In this paper, we propose a novel approach to mutually voting for relevant Web images and video frames, where two forces are balanced, i.e. aggressive matching and passive video frame selection. We validate our approach on three large-scale video recognition datasets.
机译:视频识别通常需要大量的训练样本,而这些样本的收集成本很高。另一种廉价的解决方案是从Web上提取大型图像和视频。使用现代搜索引擎,排名最高的图像或视频通常与查询高度相关,这意味着有可能收获无标签的Web图像和视频以进行视频识别。但是,有两个主要困难使我们无法直接使用Web数据。首先,它们通常很吵,可能来自与用户兴趣(例如卡通片)完全不同的域。其次,网络视频通常没有修饰且很冗长,其中一些与查询相关的帧通常隐藏在不相关的帧之间。因此自然产生了一个问题:在多大程度上可以将这种嘈杂的Web图像和视频用于无标签视频识别?在本文中,我们提出了一种新颖的方法来对相关Web图像和视频帧进行相互投票,这两种力量是均衡的,即积极匹配和被动视频帧选择。我们在三个大型视频识别数据集上验证了我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号