首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition Workshops >Temporal Domain Neural Encoder for Video Representation Learning
【24h】

Temporal Domain Neural Encoder for Video Representation Learning

机译:用于视频表示学习的时间域神经编码器

获取原文

摘要

We address the challenge of learning good video repre-sentations by explicitly modeling the relationship between visual concepts in time space. We propose a novel Temporal Preserving Recurrent Neural Network (TPRNN) that extracts and encodes visual dynamics with frame-level features as input. The proposed network architecture captures temporal dynamics by keeping track of the ordinal relationship of co-occurring visual concepts, and constructs video representations with their temporal order patterns. The resultant video representations effectively encode temporal information of dynamic patterns, which makes them more discriminative to human actions performed with different sequences of action patterns. We evaluate the proposed model on several real video datasets, and the results show that it successfully outperforms the baseline models. In particular, we observe significant improvement on action classes that can only be distinguished by capturing the temporal orders of action patterns.
机译:通过显式建模在时间空间中的视觉概念之间的关系来解决学习良好视频代表代表的挑战。我们提出了一种新的时间保留复发性神经网络(TPRNN),其提取并用帧级别特征提取和编码视觉动态。所提出的网络架构通过跟踪共同发生的视觉概念的序序关系来捕获时间动态,并用其时间顺序模式构造视频表示。所得到的视频表示有效地编码动态模式的时间信息,这使得它们更辨别以不同的作用模式的不同序列执行的人类动作。我们在几个真实视频数据集中评估所提出的模型,结果表明它成功地优于基线模型。特别是,我们遵守对行动类的重大改进,只能通过捕获行动模式的时间顺序来区分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号