Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification

Wu Lin; Wang Yang; Gao Junbin; Li Xue

首页> 外文期刊>IEEE transactions on multimedia >Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification

【24h】

Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification

机译：何时何地：基于视频的人员重新识别的深层暹罗注意网络

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video-based person re-identification (re-id) is a central application in surveillance systems with a significant concern in security. Matching persons across disjoint camera views in their video fragments are inherently challenging due to the large visual variations and uncontrolled frame rates. There are two steps crucial to person re-id, namely, discriminative feature learning and metric learning. However, existing approaches consider the two steps independently, and they do not make full use of the temporal and spatial information in the videos. In this paper, we propose a Siamese attention architecture that jointly learns spatiotemporal video representations and their similarity metrics. The network extracts local convolutional features from regions of each frame and enhances their discriminative capability by focusing on distinct regions when measuring the similarity with another pedestrian video. The attention mechanism is embedded into spatial gated recurrent units to selectively propagate relevant features and memorize their spatial dependencies through the network. The model essentially learns which parts (where) from which frames (when) are relevant and distinctive for matching persons and attaches higher importance therein. The proposed Siamese model is end-to-end trainable to jointly learn comparable hidden representations for paired pedestrian videos and their similarity value. Extensive experiments on three benchmark datasets show the effectiveness of each component of the proposed deep network while outperforming state-of-the-art methods.

机译：基于视频的人员重新识别（re-id）是监视系统中的一个中心应用，其安全性受到极大关注。由于视频变化较大且帧速率不受控制，因此在视频片段中通过不相交的相机视图匹配人本身具有挑战性。对于人员重新识别至关重要的两个步骤，即区分特征学习和度量学习。但是，现有方法独立地考虑了两个步骤，并且它们没有充分利用视频中的时间和空间信息。在本文中，我们提出了一种暹罗注意力体系，该体系可共同学习时空视频表示及其相似性指标。该网络从每个帧的区域提取局部卷积特征，并在测量与另一个行人视频的相似度时将注意力集中在不同的区域上，从而增强了它们的判别能力。注意机制嵌入到空间门控循环单元中，以有选择地传播相关特征并通过网络存储其空间依赖性。该模型从本质上学习了哪个帧（何时），哪个帧（何时）对于匹配人员而言是相关且独特的，并在其中具有更高的重要性。所提出的暹罗模型是端到端可训练的，可以共同学习配对行人视频及其相似度值的可比较隐藏表示。在三个基准数据集上进行的大量实验表明，所提出的深层网络每个组件的有效性都超过了最新方法。

著录项

来源
《IEEE transactions on multimedia》 |2019年第6期|1412-1424|共13页
作者
Wu Lin; Wang Yang; Gao Junbin; Li Xue;
展开▼
作者单位

Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230000, Anhui, Peoples R China|Univ Queensland, St Lucia, Qld 4072, Australia;

Dalian Univ Technol, Dalian 116024, Peoples R China;

Univ Sydney, Discipline Business Analyt, Sch Business, Sydney, NSW 2006, Australia;

Univ Queensland, St Lucia, Qld 4072, Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Video-based person re-identification; gated recurrent units; spatial correlations; visual attention;

机译：基于视频的人重新识别;门控复发单位;空间相关;视觉关注;

相似文献

外文文献
中文文献
专利

1. Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification [J] . Wu Lin, Wang Yang, Gao Junbin, IEEE transactions on multimedia . 2019,第6期

机译：在哪里 - 和何时看的：深度暹罗关注网络用于视频的人重新识别
2. Video-based person re-identification via spatio-temporal attentional and two-stream fusion convolutional networks [J] . Ouyang Deqiang, Zhang Yonghui, Shao Jie Pattern recognition letters . 2019,第JANa期

机译：通过时空注意和两流融合卷积网络的基于视频的人重新识别
3. Triplet Attention Network for Video-Based Person Re-Identification [J] . Rui SUN, Qili LIANG, Zi YANG, IEICE transactions on information and systems . 2021,第10期

机译：基于视频的人的Triplet注意网络重新识别
4. Deep Block Attention and Global-local Aggregation Network for Video-based Person re-identification [C] . Rui Sun, Qili Liang, Xinjian Gao, International Conference on Intelligent Computing and Signal Processing . 2021

机译：基于视频的人重新识别的深度阻止关注和全球局部聚合网络
5. Person Re-Identification and an Adversarial Attack and Defense for Person Re-Identification Networks [D] . Zheng, Yu. 2021

机译：人员重新识别和对侵犯人员重新识别网络的侵犯攻击和辩护
6. Relation-Based Deep Attention Network with Hybrid Memory for One-Shot Person Re-Identification [O] . Runxuan Si, Jing Zhao, Yuhua Tang, 2021

机译：基于关系的深度关注网络用于单击人的混合记忆重新识别
7. Non-Local Spatial and Temporal Attention Network for Video-Based Person Re-Identification [O] . Zheng Liu, Feixiang Du, Wang Li, 2020

机译：基于视频的人的非局部空间和时间注意网络重新识别

Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification

摘要

著录项

相似文献

相关主题

期刊订阅