Guess where? Actor-supervision for spatiotemporal action localization

Victor Escorcia; Cuong D. Dao; Mihir Jain; Bernard Ghanem; Cees Snoek

首页> 外文期刊>Computer vision and image understanding >Guess where? Actor-supervision for spatiotemporal action localization

【24h】

Guess where? Actor-supervision for spatiotemporal action localization

机译：猜猜是哪儿？时空动作定位的演员监督

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the problem of spatiotemporal localization of actions in videos. Compared to leading approaches, which all learn to localize based on carefully annotated boxes on training video frames, we adhere to a solution only requiring video class labels. We introduce an actor-supervised architecture that exploits the inherent compositionality of actions in terms of actor transformations, to localize actions. We make two contributions. First, we propose actor proposals derived from a detector for human and non-human actors intended for images, which are linked over time by Siamese similarity matching to account for actor deformations. Second, we propose an actor-based attention mechanism enabling localization from action class labels and actor proposals. It exploits a new actor pooling operation and is end-to-end trainable. Experiments on four action datasets show actor supervision is state-of-the-art for action localization from video class labels and is even competitive to some box-supervised alternatives.

机译：本文解决了视频中动作的时空定位问题。与领先的方法相比，所有方法都基于训练视频帧上经过仔细注释的框来进行本地化，相比之下，我们坚持只需要视频类别标签的解决方案。我们介绍了一个参与者监督的体系结构，该体系结构根据参与者的转化来利用行为的固有组合性来对行为进行本地化。我们做出两个贡献。首先，我们提出从提议用于图像的人类和非人类演员检测器中得出的演员提议，这些提议随时间通过暹罗相似匹配进行链接，以解决演员的变形。其次，我们提出了一种基于行为者的注意力机制，可以从动作类标签和行为者建议中进行本地化。它利用了新的演员集合操作，并且可以进行端到端的培训。在四个动作数据集上进行的实验表明，演员监督是从视频类别标签进行动作定位的最新技术，甚至比某些盒式监督的替代方法更具竞争力。

著录项

来源
《Computer vision and image understanding》 |2020年第3期|102886.1-102886.11|共11页
作者
Victor Escorcia; Cuong D. Dao; Mihir Jain; Bernard Ghanem; Cees Snoek;
展开▼
作者单位

King Abdullah University of Science and Technology Thuwal 23955 Saudi Arabia;

Qualcomm AI Research Qualcomm Technologies Netherlands B.V. 1098 XH Amsterdam Netherlands;

University of Amsterdam 1012 WX Amsterdam Netherlands;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Actor-supervision; Spatiotemporal action localization; Action understanding; Video analysis; Weakly-supervised;

机译：演员监督;时空行为定位;行动理解;视频分析;弱监督;

相似文献

外文文献
中文文献
专利

1. Weakly supervised deep network for spatiotemporal localization and detection of human actions in wild conditions [J] . Kumar N., Sukavanam N. The Visual Computer . 2020,第9期

机译：弱势监督的深网络用于野外条件下的时尚本地化和检测人类行为
2. Spatiotemporal Localization and Categorization of Human Actions in Unsegmented Image Sequences [J] . Image Processing, IEEE Transactions on . 2011,第4期

机译：未分割图像序列中人类动作的时空定位和分类
3. Hands-on: deformable pose and motion models for spatiotemporal localization of fine-grained dyadic interactions [J] . Coert van Gemeren, Ronald Poppe, Remco C. Veltkamp EURASIP journal on image and video processing . 2018,第1期

机译：实用：微粒二元相互作用时尚定位的可变形姿势和运动模型
4. CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization [C] . Yuxi Li, Weiyao Lin, John See, European Conference on Computer Vision . 2020

机译：CFAD：用于时空行动定位的粗致细动作探测器
5. Image-set, Temporal and Spatiotemporal Representations of Videos for Recognizing, Localizing and Quantifying Actions [D] . Xiang, Xiang. 2018

机译：用于识别，定位和量化动作的视频的图像集，时间和时空表示
6. Evaluating the Impact of Guessing and Its Interactions With Other Test Characteristics on Confidence Interval Procedures for Coefficient Alpha [O] . Insu Paek 2016

机译：评估猜测及其与其他测试特征的交互对系数α的置信区间过程的影响
7. Guess where? Actor-supervision for spatiotemporal action localization [O] . Victor Escorcia, Cuong D. Dao, Mihir Jain, 2020

机译：猜猜是哪儿？演员监督时尚行动本地化

Guess where? Actor-supervision for spatiotemporal action localization

摘要

著录项

相似文献

相关主题

期刊订阅