Spatial-temporal histograms of gradients and HOD-VLAD encoding for human action recognition

机译：梯度时空直方图和人类行为识别的HOD-VLAD编码

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic human action recognition is a core functionality of systems for video surveillance and human object interaction. In the whole recognition system, feature description and encoding represent two crucial key steps. In order to construct a powerful action recognition framework it is important that the two steps must provide reliable performance. In this paper, we proposed a new human action feature descriptor which is called spatial-temporal histograms of gradients (SPHOG). SPHOG is based on the spatial and temporal derivation signal, which extracts the gradient changes between consecutive frames. Compare to the traditional descriptors histograms of optical flow, our proposed SPHOG costs less computation resource. Vector of Locally Aggregated Descriptors (VLAD), which is a popular encoding approach for Bag-of-Feature representation. There is a main drawback of VLAD that it only considers the difference between local descriptor and their centroids. In order to resolve the weakness, we proposed a improved VLAD method called HOD-VLAD, which complementary the distribution information of local descriptors by computing a weight histograms of distance. We validated our proposed algorithm for human action recognition on three public available datasets KTH, UCF Sports and HMDB51. The evaluation experiment results indicate that the proposed descriptor and encoding method can improve the efficiency of human action recognition and the recognition accuracy.

机译：自动人体动作识别是用于视频监视和人体交互的系统的核心功能。在整个识别系统中，特征描述和编码代表两个关键的关键步骤。为了构建功能强大的动作识别框架，重要的是两个步骤必须提供可靠的性能。在本文中，我们提出了一种新的人类动作特征描述符，称为梯度时空直方图（SPHOG）。 SPHOG基于空间和时间推导信号，该信号提取连续帧之间的梯度变化。与传统的光流描述符直方图相比，我们提出的SPHOG花费更少的计算资源。局部聚集描述符向量（VLAD），这是功能包表示的一种流行编码方法。 VLAD的主要缺点是仅考虑局部描述符及其质心之间的差异。为了解决该缺点，我们提出了一种改进的VLAD方法，称为HOD-VLAD，该方法通过计算距离的权重直方图来补充局部描述符的分布信息。我们在三个公共可用数据集KTH，UCF Sports和HMDB51上验证了我们提出的用于人类动作识别的算法。评估实验结果表明，所提出的描述符和编码方法可以提高人体动作识别的效率和识别精度。

著录项

来源
《2017 International Conference on Security, Pattern Analysis, and Cybernetics》|2017年|678-683|共6页
会议地点 Shenzhen(CN)
作者
Bo Lin; Bin Fang;
展开▼
作者单位

Department of Computer Science, Chongqing University, Chongqing, China;

Department of Computer Science, Chongqing University, Chongqing, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Histograms; Encoding; Visualization; Feature extraction; Pattern recognition; Security; Pattern analysis;

机译：直方图;编码;可视化;特征提取;模式识别;安全性;模式分析;;

相似文献

外文文献
中文文献
专利

1. A new spatial-temporal histograms of gradients descriptor and HOD-VLAD encoding for human action recognition [J] . Lin Bo, Fang Bin International Journal of Wavelets, Multiresolution and Information Processing . 2019,第2期

机译：一种新的渐变描述符和人类行动识别编码的新空间 - 时间表和Hod-V层编码
2. A spatial-temporal framework based on histogram of gradients and optical flow for facial expression recognition in video sequences [J] . Fan Xijian, Tjahjadi Tardi Pattern Recognition: The Journal of the Pattern Recognition Society . 2015,第11期

机译：基于梯度直方图和光流的时空框架用于视频序列中的面部表情识别
3. Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information [J] . Duta Ionut C., Uijlings Jasper R. R., Ionescu Bogdan, Multimedia Tools and Applications . 2017,第21期

机译：使用运动梯度直方图和VLAD以及描述符形状信息进行有效的人类动作识别
4. Spatial-temporal histograms of gradients and HOD-VLAD encoding for human action recognition [C] . Bo Lin, Bin Fang International Conference on Security, Pattern Analysis, and Cybernetics . 2017

机译：人体行动识别的梯度和Hod-V层编码的空间 - 时间曲线图
5. The influence of signal level and temporary noise induced hearing loss on estimates of the post-stimulus time histogram and single fiber action potential derived from human compound action potentials. [D] . Lichtenhan, Jeffery T. 2007

机译：信号水平和暂时性噪声诱发的听力损失对刺激后时间直方图和源自人类复合动作电位的单纤维动作电位的估计值的影响。
6. Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences [O] . Chirag I. Patel, Dileep Labana, Sharnil Pandya, 2020

机译：基于面向梯度的特征融合的直方图用于行动视频序列中的人体行动识别
7. A spatial-temporal framework based on histogram of gradients and optical flow for facial expression recognition in video sequences [O] . Fan, Xijian, Tjahjadi, Tardi 2015

机译：基于梯度直方图和光流的时空框架用于视频序列中的面部表情识别

Spatial-temporal histograms of gradients and HOD-VLAD encoding for human action recognition

摘要

著录项

相似文献

相关主题

期刊订阅