Multi-View and Multi-Modal Action Recognition with Learned Fusion

机译：具有学习融合的多视图和多模式动作识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we study multi-modal and multi-view action recognition system based on the deep-learning techniques. We extended the Temporal Segment Network with additional data fusion stage to combine information from different sources. In this research, we use multiple types of information from different modality such as RGB, depth, infrared data to detect predefined human actions. We tested various combinations of these data sources to examine their impact on the final detection accuracy. We designed 3 information fusion methods to generate the final decision. The most interested one is the Learned Fusion Net designed by us. It turns out the Learned Fusion structure has the best results but requires more training.

机译：在本文中，我们研究了基于深度学习技术的多模式多视图动作识别系统。我们扩展了时间分段网络，并增加了数据融合阶段，以合并来自不同来源的信息。在这项研究中，我们使用来自不同模式的多种信息，例如RGB，深度，红外数据来检测预定义的人类行为。我们测试了这些数据源的各种组合，以检查它们对最终检测精度的影响。我们设计了3种信息融合方法来生成最终决策。最感兴趣的是我们设计的Learned Fusion Net。事实证明，Learned Fusion结构具有最佳结果，但需要更多培训。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2018年|1601-1604|共4页
会议地点
作者
Sandy Ardianto; Hsueh-Ming Hang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optical flow; Skeleton; Neural networks; Legged locomotion; Training; Cameras; Real-time systems;

机译：光流;骨架;神经网络;腿部运动;训练;相机;实时系统;

相似文献

外文文献
中文文献
专利

1. Weighted averaging fusion for multi-view skeletal data and its application in action recognition [J] . Nur Aziza Azis, Young-Seob Jeong, Ho-Jin Choi, Computer Vision, IET . 2016,第2期

机译：多视图骨骼数据的加权平均融合及其在动作识别中的应用
2. Multi-view action recognition using local similarity random forests and sensor fusion [J] . Fan Zhu, Ling Shao, Mingxiu Lin Pattern recognition letters . 2013,第1期

机译：使用局部相似性随机森林和传感器融合的多视图动作识别
3. Multi-view representation learning for multi-view action recognition [J] . Hao Tong, Wu Dan, Wang Qian, Journal of visual communication & image representation . 2017,第octa期

机译：多视图表示学习，用于多视图动作识别
4. Multi-View and Multi-Modal Action Recognition with Learned Fusion [C] . Sandy Ardianto, Hsueh-Ming Hang Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2018

机译：具有学习融合的多视图和多模态动作识别
5. Observation Points Based Multi-modal Fusion Systems for Skeleton Action Recognition [D] . Singh, Iqbal. 2020

机译：基于观测点的骨架动作识别多模态融合系统
6. Multi-Modal Fusion Emotion Recognition Method of Speech Expression Based on Deep Learning [O] . Dong Liu, Zhiyong Wang, Lifeng Wang, 2021

机译：基于深度学习的语音表达多模态融合情绪识别方法
7. Multi-view Multi-modal Gait Based Human Identity Recognition From Surveillance Videos [O] . Emdad Hossain, Girija Chetty 2015

机译：基于监视视频的多视图多模态步态人类身份识别

Multi-View and Multi-Modal Action Recognition with Learned Fusion

摘要

著录项

相似文献

相关主题

期刊订阅