VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification

Chen Songle; Zheng Lintao; Zhang Yan; Sun Zhixin; Xu Kai

首页> 外文期刊>IEEE transactions on visualization and computer graphics >VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification

【24h】

VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification

机译：VERAM：用于3D形状分类的视图增强的循环注意力模型

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-view deep neural network is perhaps the most successful approach in 3D shape classification. However, the fusion of multi-view features based on max or average pooling lacks a view selection mechanism, limiting its application in, e.g., multi-view active object recognition by a robot. This paper presents VERAM, a view-enhanced recurrent attention model capable of actively selecting a sequence of views for highly accurate 3D shape classification. VERAM addresses an important issue commonly found in existing attention-based models, i.e., the unbalanced training of the subnetworks corresponding to next view estimation and shape classification. The classification subnetwork is easily overfitted while the view estimation one is usually poorly trained, leading to a suboptimal classification performance. This is surmounted by three essential view-enhancement strategies: 1) enhancing the information flow of gradient backpropagation for the view estimation subnetwork, 2) devising a highly informative reward function for the reinforcement training of view estimation and 3) formulating a novel loss function that explicitly circumvents view duplication. Taking grayscale image as input and AlexNet as CNN architecture, VERAM with 9 views achieves instance-level and class-level accuracy of 95.5 and 95.3 percent on ModelNet10, 93.7 and 92.1 percent on ModelNet40, both are the state-of-the-art performance under the same number of views.

机译：多视图深度神经网络可能是3D形状分类中最成功的方法。然而，基于最大或平均池的多视图特征的融合缺乏视图选择机制，从而限制了其在例如机器人的多视图活动对象识别中的应用。本文介绍了VERAM，这是一种视图增强的循环注意力模型，能够主动选择视图序列以进行高精度3D形状分类。 VERAM解决了在现有的基于注意力的模型中普遍存在的重要问题，即与下一视图估计和形状分类相对应的子网的不平衡训练。分类子网络很容易过度拟合，而视图估计通常训练不力，导致分类性能不佳。这是通过三种基本的视图增强策略来克服的：1）增强用于视图估计子网的梯度反向传播的信息流，2）设计用于增强视图估计训练的信息量大的奖励函数，以及3）提出一种新颖的损失函数明确规避视图重复。以灰度图像作为输入，以AlexNet作为CNN架构，具有9个视图的VERAM在ModelNet10上的实例级别和类级别的精度分别达到95.5％和95.3％，在ModelNet40上达到93.7％和92.1％，两者都是最新的性能在相同数量的视图下。

著录项

来源
《IEEE transactions on visualization and computer graphics》 |2019年第12期|3244-3257|共14页
作者
Chen Songle; Zheng Lintao; Zhang Yan; Sun Zhixin; Xu Kai;
展开▼
作者单位

Nanjing Univ Posts & Telecommun Jiangsu High Technol Res Key Lab Wireless Sensor Nanjing 210023 Jiangsu Peoples R China;

Natl Univ Def Technol Sch Comp Changsha 410073 Hunan Peoples R China;

Nanjing Univ Dept Comp Sci & Technol Nanjing 210008 Jiangsu Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Three-dimensional displays; Shape; Solid modeling; Estimation; Visualization; Task analysis; Computational modeling; 3D shape classification; multi-view 3D shape recognition; visual attention model; recurrent neural network; reinforcement learning; convolutional neural network;

机译：三维显示器;形状;实体建模;估计;可视化;任务分析;计算建模;3D形状分类;多视图3D形状识别;视觉注意力模型;递归神经网络强化学习;卷积神经网络;

相似文献

外文文献
中文文献
专利

1. Fine-Grained 3D Shape Classification With Hierarchical Part-View Attention [J] . Xinhai Liu, Zhizhong Han, Yu-Shen Liu, IEEE Transactions on Image Processing . 2021,第1期

机译：细粒度3D形状分类，具有分层部分关注
2. Deep learning models and datasets for aspect term sentiment classification: Implementing holistic recurrent attention on target-dependent memories [J] . Park Hyun-jung, Song Minchae, Shin Kyung-Shik Knowledge-Based Systems . 2020,第Jana期

机译：用于方面术语情感分类的深度学习模型和数据集：对依赖于目标的记忆进行整体的周期性关注
3. Detection of holes in 3D architectural models using shape classification based Bubblegum algorithm [J] . Aadil Kazi, Akshay Sausthanmath, Meena S M, Procedia Computer Science . 2020,第5期

机译：基于形状分类的泡沫算法的3D架构模型中的孔检测
4. Sentiment classification using Comprehensive Attention Recurrent models [C] . Yong Zhang, Meng Joo Er, Rajasekar Venkatesan, International Joint Conference on Neural Networks . 2016

机译：使用综合注意力递归模型进行情感分类
5. Discrete-synapse recurrent neural network for nonlinear system modeling and seismic signal classification. [D] . Park, Hyung Ook. 2010

机译：离散突触递归神经网络用于非线性系统建模和地震信号分类。
6. 3D Shape Modeling for Cell Nuclear Morphological Analysis and Classification [O] . Alexandr A. Kalinin, Ari Allyn-Feuer, Alex Ade, -1

机译：用于细胞核形态分析和分类的3D形状建模
7. Fine-Grained 3D Shape Classification With Hierarchical Part-View Attention [O] . Xinhai Liu, Zhizhong Han, Yu-Shen Liu, 2021

机译：细粒度3D形状分类，具有分层部分关注

VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification

摘要

著录项

相似文献

相关主题

期刊订阅