首页> 外文会议>European conference on computer vision >Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames

【24h】

Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames

机译：通过相互表决相关的Web图像和Web视频帧来进行Web监督的视频识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video recognition usually requires a large amount of training samples, which are expensive to be collected. An alternative and cheap solution is to draw from the large-scale images and videos from the Web. With modern search engines, the top ranked images or videos are usually highly correlated to the query, implying the potential to harvest the labeling-free Web images and videos for video recognition. However, there are two key difficulties that prevent us from using the Web data directly. First, they are typically noisy and may be from a completely different domain from that of users' interest (e.g. cartoons). Second, Web videos are usually untrimmed and very lengthy, where some query-relevant frames are often hidden in between the irrelevant ones. A question thus naturally arises: to what extent can such noisy Web images and videos be utilized for labeling-free video recognition? In this paper, we propose a novel approach to mutually voting for relevant Web images and video frames, where two forces are balanced, i.e. aggressive matching and passive video frame selection. We validate our approach on three large-scale video recognition datasets.

机译：视频识别通常需要大量的训练样本，而这些样本的收集成本很高。另一种廉价的解决方案是从Web上提取大型图像和视频。使用现代搜索引擎，排名最高的图像或视频通常与查询高度相关，这意味着有可能收获无标签的Web图像和视频以进行视频识别。但是，有两个主要困难使我们无法直接使用Web数据。首先，它们通常很吵，可能来自与用户兴趣（例如卡通片）完全不同的域。其次，网络视频通常没有修饰且很冗长，其中一些与查询相关的帧通常隐藏在不相关的帧之间。因此自然产生了一个问题：在多大程度上可以将这种嘈杂的Web图像和视频用于无标签视频识别？在本文中，我们提出了一种新颖的方法来对相关Web图像和视频帧进行相互投票，这两种力量是均衡的，即积极匹配和被动视频帧选择。我们在三个大型视频识别数据集上验证了我们的方法。

著录项

来源
《European conference on computer vision》|2016年|849-866|共18页
会议地点
作者
Chuang Gan; Chen Sun; Lixin Duan; Boqing Gong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Localizing relevant frames in web videos using topic model and relevance filtering [J] . Haojie Li, Lei Yi, Bin Liu, Machine Vision and Applications . 2014,第7期

机译：使用主题模型和相关性过滤对网络视频中的相关帧进行本地化
2. Recognizing key segments of videos for video annotation by learning from web image sets [J] . Song Hao, Wu Xinxiao, Liang Wei, Multimedia Tools and Applications . 2017,第5期

机译：通过从Web图像集中学习识别视频的关键片段以进行视频注释
3. Localizing web videos using social images [J] . Cao Liujuan, Liu Xian-Ming, Liu Wei, Information Sciences: An International Journal . 2015,第Null期

机译：使用社交图像本地化网络视频
4. Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames [C] . Chuang Gan, Chen Sun, Lixin Duan, European Conference on Computer Vision . 2016

机译：通过相互投票对相关的网络图像和Web视频帧进行扫视监督的视频识别
5. Celebrating and Discussing the Queerly Masculine: Hollywood Superheroes Reimagined in Fan Videos on Chinese Barrage Video Websites. [D] . Gu, Jingyi. 2017

机译：庆祝和讨论酷男主义：在中国弹幕视频网站的粉丝视频中重新想象的好莱坞超级英雄。
6. Testing the Effects of the Addition of Videos to a Website Promoting Environmental Breast Cancer Risk Reduction Practices: Are Videos Worth It? [O] . Evan K. Perrault, Kami J. Silk -1

机译：测试将视频添加到网站上的效果以促进降低环境乳腺癌风险的做法：视频值得吗？
7. Omni-Sourced Webly-Supervised Learning for Video Recognition [O] . Haodong Duan, Yue Zhao, Yuanjun Xiong, 2020

机译：全源摩擦解视频识别学习

Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames

摘要

著录项

相似文献

相关主题

期刊订阅