Video action recognition based on visual rhythm representation

Moreira Thierry Pinheiro; Menotti David; Pedrini Helio

首页> 外文期刊>Journal of visual communication & image representation >Video action recognition based on visual rhythm representation

【24h】

Video action recognition based on visual rhythm representation

机译：基于视觉节奏表示的视频动作识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Advances in video acquisition and storage technologies have promoted a great demand for automatic recognition of actions. The use of cameras for security and surveillance purposes has applications in several scenarios, such as airports, parks, banks, stations, roads, hospitals, supermarkets, industries, stadiums, schools. An inherent difficulty of the problem is the complexity of the scene under usual recording conditions, which may contain complex background and motion, multiple people on the scene, interactions with other actors or objects, and camera motion. Most recent databases are built primarily with shared recordings on YouTube and with snippets of movies, situations where these obstacles are not restricted. Another difficulty is the impact of the temporal dimension since it expands the size of the data, increasing computational cost and storage space. In this work, we present a methodology of volume description using the Visual Rhythm (VR) representation. This technique reshapes the original volume of the video into an image, where two-dimensional descriptors are computed. We investigated different strategies for constructing the representation by combining configurations in several image domains and traversing directions of the video frames. From this, we propose two feature extraction methods, Naive Visual Rhythm (Naive VR) and Visual Rhythm Trajectory Descriptor (VRTD). The first approach is the straightforward application of the technique in the original video volume, forming a holistic descriptor that considers action events as patterns and formats in the visual rhythm image. The second variation focuses on the analysis of small neighborhoods obtained from the process of dense trajectories, which allows the algorithm to capture details unnoticed by the global description. We tested our methods in eight public databases, one of hand gestures (SKIG), two in first person (DogCentric and JPL), and five in third person (Weizmann, KTH, MuHAVi, UCF11 and HMDB51). The results show that the developed techniques are able to extract motion elements along with format and appearance information, achieving competitive accuracy rates compared to state-of-the-art action recognition approaches. (c) 2020 Elsevier Inc. All rights reserved.

机译：视频采集和存储技术的进步促进了对自动识别行动的巨大需求。用于安全和监视目的的相机具有在若干方案中的应用，例如机场，公园，银行，站，道路，医院，超市，行业，体育场，学校。问题的固有难度是通常在通常的录制条件下的场景的复杂性，这可能包含复杂的背景和运动，场景上的多人，与其他演员或物体的交互，以及相机运动。大多数最新数据库主要以YouTube的共享录制和电影片段，这些障碍不受限制的情况。另一个困难是时间维度的影响，因为它扩展了数据的大小，增加了计算成本和存储空间。在这项工作中，我们使用视觉节奏（VR）表示提供了体积描述的方法。该技术将视频的原始音量重新结束到图像中，其中计算二维描述符。我们调查了通过组合多个图像域中的配置和视频帧的遍历方向来构建表示的不同策略。由此，我们提出了两个特征提取方法，天真的视觉节奏（天真VR）和视觉节奏轨迹描述符（VRTD）。第一方法是在原始视频音量中的技术的直接应用，形成一个整体描述符，该描述符将动作事件视为视觉节奏图像中的模式和格式。第二个变型侧重于分析从致密轨迹的过程获得的小邻域，这允许算法通过全局描述捕获无疑的细节。我们在八个公共数据库中测试了我们的方法，手势之一（Skig），第一人称（Dogcentric和JPL），以及第三个人（Weizmann，Kth，Muhavi，UCF11和HMDB51）。结果表明，与最先进的动作识别方法相比，开发技术能够以格式和外观信息提取运动元件，以及格式和外观信息，实现竞争精度率。（c）2020 Elsevier Inc.保留所有权利。

著录项

来源
《Journal of visual communication & image representation》 |2020年第8期|102771.1-102771.14|共14页
作者
Moreira Thierry Pinheiro; Menotti David; Pedrini Helio;
展开▼
作者单位

Univ Estadual Campinas Inst Comp BR-13083852 Campinas SP Brazil;

Univ Fed Parana Dept Informat BR-81531980 Curitiba Parana Brazil;

Univ Estadual Campinas Inst Comp BR-13083852 Campinas SP Brazil;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Action recognition; Visual rhythm; Video sequences; Computer vision;

机译：行动识别;视觉节奏;视频序列;计算机愿景;

相似文献

外文文献
中文文献
专利

1. Survey on visual rhythms: A spatio-temporal representation for video sequences [J] . Roberto e Souza Marcos, Maia Helena de Almeida, Vieira Marcelo Bernardes, Neurocomputing . 2020,第Auga18期

机译：视觉节律调查：视频序列的时空表示
2. Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance Videos [J] . Chu W.-T., Tsai S.-Y. Multimedia, IEEE Transactions on . 2012,第1期

机译：舞蹈视频的运动提取节奏和基于节奏的跨媒体对齐
3. The Multidimensional Motion Features of Spatial Depth Feature Maps: An Effective Motion Information Representation Method for Video-Based Action Recognition [J] . Hongshi Ou, Jifeng Sun Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：空间深度特征映射的多维运动特征：基于视频动作识别的有效运动信息表示方法
4. Multi-stream Convolutional Neural Networks for Action Recognition in Video Sequences Based on Adaptive Visual Rhythms [C] . Darwin Ttito Concha, Helena De Almeida Maia, Helio Pedrini, IEEE International Conference on Machine Learning and Applications . 2018

机译：基于自适应视觉节律的视频流中动作识别的多流卷积神经网络
5. Robust representation and recognition of actions in video. [D] . Natarajan, Pradeep. 2009

机译：视频中动作的可靠表示和识别。
6. A Novel Morphometry-Based Protocol of Automated Video-Image Analysis for Species Recognition and Activity Rhythms Monitoring in Deep-Sea Fauna [O] . Jacopo Aguzzi, Corrado Costa, Yoshihiro Fujiwara, 2009

机译：一种基于形态计量学的自动视频图像分析协议用于深海动物的物种识别和活动节律监测
7. Action recognition in video using a spatial-temporal graph-based feature representation [O] . Jargalsaikhan, Iveel, Little, Suzanne, Trichet, Remi, 2015

机译：使用基于时空图的特征表示进行视频中的动作识别

Video action recognition based on visual rhythm representation

摘要

著录项

相似文献

相关主题

期刊订阅