Dynamic gesture recognition based on feature fusion network and variant ConvLSTM

Peng Yuqing; Tao Huifang; Li Wei; Yuan Hongtao; Li Tiejun

首页> 外文期刊>Image Processing, IET >Dynamic gesture recognition based on feature fusion network and variant ConvLSTM

【24h】

Dynamic gesture recognition based on feature fusion network and variant ConvLSTM

机译：基于特征融合网络和变体Convlstm的动态手势识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Gesture is a natural form of human communication, and it is of great significance in human-computer interaction. In the dynamic gesture recognition method based on deep learning, the key is to obtain comprehensive gesture feature information. Aiming at the problem of inadequate extraction of spatiotemporal features or loss of feature information in current dynamic gesture recognition, a new gesture recognition architecture is proposed, which combines feature fusion network with variant convolutional long short-term memory (ConvLSTM). The architecture extracts spatiotemporal feature information from local, global and deep aspects, and combines feature fusion to alleviate the loss of feature information. Firstly, local spatiotemporal feature information is extracted from video sequence by 3D residual network based on channel feature fusion. Then the authors use the variant ConvLSTM to learn the global spatiotemporal information of dynamic gesture, and introduce the attention mechanism to change the gate structure of ConvLSTM. Finally, a multi-feature fusion depthwise separable network is used to learn higher-level features including depth feature information. The proposed approach obtains very competitive performance on the Jester dataset with the classification accuracies of 95.59%, achieving state-of-the-art performance with 99.65% accuracy on the SKIG (Sheffifield Kinect Gesture) dataset.

机译：姿态是一种自然的人类交流形式，在人机互动方面具有重要意义。在基于深度学习的动态手势识别方法中，关键是获得综合手势特征信息。针对时尚特征的提取不足或当前动态手势识别中的特征信息丢失的问题，提出了一种新的手势识别架构，其将具有变体卷积长短短期存储器（CONMLSTM）的特征融合网络组合。该架构从本地，全局和深度方面提取了时空特征信息，并结合了特征融合来缓解特征信息的丢失。首先，基于信道特征融合，通过3D剩余网络从视频序列中提取本地时空特征信息。然后，作者使用Variant Convlstm学习动态手势的全局时空信息，并引入更改Convlstm的栅极结构的注意机制。最后，使用多特征融合深度可分离网络来学习包括深度特征信息的更高级别特征。该拟议的方法在杰斯特数据集中获得了非常竞争力的性能，分类准确性为95.59％，实现了最先进的性能，在Skig（Sheffififield Kinect Gesture）数据集上具有99.65％的准确性。

著录项

来源
《Image Processing, IET》 |2020年第11期|2480-2486|共7页
作者
Peng Yuqing; Tao Huifang; Li Wei; Yuan Hongtao; Li Tiejun;
展开▼
作者单位

Hebei Univ Technol Sch Artificial Intelligence Tianjin 300401 Peoples R China|Hebei Prov Key Lab Big Data Calculat Tianjin 300401 Peoples R China;

Hebei Univ Technol Sch Artificial Intelligence Tianjin 300401 Peoples R China|Hebei Prov Key Lab Big Data Calculat Tianjin 300401 Peoples R China;

Hebei Univ Technol Sch Artificial Intelligence Tianjin 300401 Peoples R China|Hebei Prov Key Lab Big Data Calculat Tianjin 300401 Peoples R China;

Hebei Univ Technol Sch Artificial Intelligence Tianjin 300401 Peoples R China|Hebei Prov Key Lab Big Data Calculat Tianjin 300401 Peoples R China;

Hebei Univ Technol Sch Mech Engn Tianjin 300401 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
learning (artificial intelligence); video signal processing; feature extraction; gesture recognition; image classification; image fusion; recurrent neural nets; convolutional neural nets; image sequences; spatiotemporal phenomena; human computer interaction; variant ConvLSTM; human communication; human-computer interaction; dynamic gesture recognition method; deep learning; spatiotemporal feature extraction; gesture recognition architecture; feature fusion network; local aspects; global aspects; deep aspects; local spatiotemporal feature information; 3D residual network; channel feature fusion; global spatiotemporal information; multifeature fusion depthwise separable network; higher-level features; depth feature information; SKIG dataset; Sheffifield Kinect Gesture dataset; video sequence; gesture feature information; variant convolutional long short-term memory; Jester dataset; classification accuracies;

机译：学习（人工智能）;视频信号处理;特征提取;手势识别;图像分类;图像融合;复发性神经网;卷积神经网络;图像序列;时尚现象;人类计算机互动;人类的沟通;人类的互动;动态手势识别方法;深度学习;手势识别架构;特征融合网络;本地方面;全球性方面;深处;地方时尚特征信息;3D残差;频道特征融合;全球时空信息;多聚焦融合深度可分离网络;更高级别的功能;深度特征信息;跳过数据集;Sheffifield Kinect手势数据集;视频序列;手势特征信息;变体卷积长短短期记忆;跳门数据集;分类准确性;

相似文献

外文文献
中文文献
专利

1. Hand Gesture Recognition Using Deep Feature Fusion Network Based on Wearable Sensors [J] . Yuan Guan, Liu Xiao, Yan Qiuyan, IEEE sensors journal . 2021,第1期

机译：使用基于可穿戴传感器的深色特征融合网络手势识别
2. Improve Inter-day Hand Gesture Recognition Via Convolutional Neural Network-based Feature Fusion [J] . Fang Yinfeng, Zhang Xuguang, Zhou Dalin, International journal of humanoid robotics . 2021,第2期

机译：通过基于卷积神经网络的特征融合改善日内手势识别
3. Dynamic and combined gestures recognition based on multi-feature fusion in a complex environment [J] . Wang Liang, Liu Guixi, Duan Hongyan 中国邮电高校学报（英文版） . 2015,第002期

机译：复杂环境中基于多特征融合的动态组合手势识别
4. Dynamic Hand Gesture Based Sign Word Recognition Using Convolutional Neural Network with Feature Fusion [C] . Md Abdur Rahim, Jungpil Shin, Md Rashedul Islam IEEE International Conference on Knowledge Innovation and Invention . 2019

机译：基于特征融合的卷积神经网络基于手势的动态手势识别
5. Evolutionary-based feature extraction for gesture recognition using a motion camera. [D] . Ahn, Eun Yeong. 2012

机译：使用运动相机进行手势识别的基于进化的特征提取。
6. MFA-Net: Motion Feature Augmented Network for Dynamic Hand Gesture Recognition from Skeletal Data [O] . Xinghao Chen, Guijin Wang, Hengkai Guo, 2019

机译：MFA-Net：用于从骨骼数据进行动态手势识别的运动特征增强网络
7. Non-Touch Sign Word Recognition Based on Dynamic Hand Gesture Using Hybrid Segmentation and CNN Feature Fusion [O] . Md Abdur Rahim, Md Rashedul Islam, Jungpil Shin 2019

机译：使用混合分割和CNN特征融合的基于动态手势的非触摸标志字识别

Dynamic gesture recognition based on feature fusion network and variant ConvLSTM

摘要

著录项

相似文献

相关主题

期刊订阅