...
首页> 外文期刊>Computer vision and image understanding >Guess where? Actor-supervision for spatiotemporal action localization
【24h】

Guess where? Actor-supervision for spatiotemporal action localization

机译:猜猜是哪儿?时空动作定位的演员监督

获取原文
获取原文并翻译 | 示例
           

摘要

This paper addresses the problem of spatiotemporal localization of actions in videos. Compared to leading approaches, which all learn to localize based on carefully annotated boxes on training video frames, we adhere to a solution only requiring video class labels. We introduce an actor-supervised architecture that exploits the inherent compositionality of actions in terms of actor transformations, to localize actions. We make two contributions. First, we propose actor proposals derived from a detector for human and non-human actors intended for images, which are linked over time by Siamese similarity matching to account for actor deformations. Second, we propose an actor-based attention mechanism enabling localization from action class labels and actor proposals. It exploits a new actor pooling operation and is end-to-end trainable. Experiments on four action datasets show actor supervision is state-of-the-art for action localization from video class labels and is even competitive to some box-supervised alternatives.
机译:本文解决了视频中动作的时空定位问题。与领先的方法相比,所有方法都基于训练视频帧上经过仔细注释的框来进行本地化,相比之下,我们坚持只需要视频类别标签的解决方案。我们介绍了一个参与者监督的体系结构,该体系结构根据参与者的转化来利用行为的固有组合性来对行为进行本地化。我们做出两个贡献。首先,我们提出从提议用于图像的人类和非人类演员检测器中得出的演员提议,这些提议随时间通过暹罗相似匹配进行链接,以解决演员的变形。其次,我们提出了一种基于行为者的注意力机制,可以从动作类标签和行为者建议中进行本地化。它利用了新的演员集合操作,并且可以进行端到端的培训。在四个动作数据集上进行的实验表明,演员监督是从视频类别标签进行动作定位的最新技术,甚至比某些盒式监督的替代方法更具竞争力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号