Temporal Domain Neural Encoder for Video Representation Learning

机译：用于视频表示学习的时间域神经编码器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the challenge of learning good video repre-sentations by explicitly modeling the relationship between visual concepts in time space. We propose a novel Temporal Preserving Recurrent Neural Network (TPRNN) that extracts and encodes visual dynamics with frame-level features as input. The proposed network architecture captures temporal dynamics by keeping track of the ordinal relationship of co-occurring visual concepts, and constructs video representations with their temporal order patterns. The resultant video representations effectively encode temporal information of dynamic patterns, which makes them more discriminative to human actions performed with different sequences of action patterns. We evaluate the proposed model on several real video datasets, and the results show that it successfully outperforms the baseline models. In particular, we observe significant improvement on action classes that can only be distinguished by capturing the temporal orders of action patterns.

机译：通过显式建模在时间空间中的视觉概念之间的关系来解决学习良好视频代表代表的挑战。我们提出了一种新的时间保留复发性神经网络（TPRNN），其提取并用帧级别特征提取和编码视觉动态。所提出的网络架构通过跟踪共同发生的视觉概念的序序关系来捕获时间动态，并用其时间顺序模式构造视频表示。所得到的视频表示有效地编码动态模式的时间信息，这使得它们更辨别以不同的作用模式的不同序列执行的人类动作。我们在几个真实视频数据集中评估所提出的模型，结果表明它成功地优于基线模型。特别是，我们遵守对行动类的重大改进，只能通过捕获行动模式的时间顺序来区分。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition Workshops》|2017年|1561-2336p|共8页
会议地点
作者
Hao Hu; Zhaowen Wang; Joon-Young Lee; Zhe Lin; Guo-Jun Qi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. Encoding a temporally structured stimulus with a temporally structured neural representation. [J] . Brown SL, Joseph J, Stopfer M Nature neuroscience . 2005,第11期

机译：用时间结构化的神经表示对时间结构化的刺激进行编码。
2. Domain-invariant representation learning using an unsupervised domain adversarial adaptation deep neural network [J] . Jia Xibin, Jin Ya, Su Xing, Neurocomputing . 2019,第AUGa25期

机译：使用无监督领域对抗性适应性深度神经网络的领域不变表示学习
3. Domain-invariant representation learning using an unsupervised domain adversarial adaptation deep neural network [J] . Jia Xibin, Jin Ya, Su Xing, Neurocomputing . 2019,第Auga25期

机译：域名不变的表示学习使用无监督的域对抗性适应深神经网络
4. Temporal Domain Neural Encoder for Video Representation Learning [C] . Hao Hu, Zhaowen Wang, Joon-Young Lee, IEEE Conference on Computer Vision and Pattern Recognition Workshops . 2017

机译：用于视频表示学习的时域神经编码器
5. Image-set, Temporal and Spatiotemporal Representations of Videos for Recognizing, Localizing and Quantifying Actions [D] . Xiang, Xiang. 2018

机译：用于识别，定位和量化动作的视频的图像集，时间和时空表示
6. Neural Encoding and Representation of Time for Sensorimotor Control and Learning [O] . Ramesh Balasubramaniam, Saskia Haegens, Mehrdad Jazayeri, 2021

机译：传感器控制和学习时间的神经编码和时间
7. Online spatio-temporal pattern recognition with evolving spiking neural networks utilising address event representation, rank order, and temporal spike learning [O] . Dhoble, K, Nuntalid, N, Indiveri, G, 2012

机译：利用地址事件表示，等级顺序和时间峰值学习的不断发展的尖峰神经网络进行在线时空模式识别

Temporal Domain Neural Encoder for Video Representation Learning

摘要

著录项

相似文献

相关主题

期刊订阅