High Performance Implementation of 3D Convolutional Neural Networks on a GPU

机译：在GPU上实现3D卷积神经网络的高性能实现

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional neural networks have proven to be highly successful in applications such as image classification, object tracking, and many other tasks based on 2D inputs. Recently, researchers have started to apply convolutional neural networks to video classification, which constitutes a 3D input and requires far larger amounts of memory and much more computation. FFT based methods can reduce the amount of computation, but this generally comes at the cost of an increased memory requirement. On the other hand, the Winograd Minimal Filtering Algorithm (WMFA) can reduce the number of operations required and thus can speed up the computation, without increasing the required memory. This strategy was shown to be successful for 2D neural networks. We implement the algorithm for 3D convolutional neural networks and apply it to a popular 3D convolutional neural network which is used to classify videos and compare it to cuDNN. For our highly optimized implementation of the algorithm, we observe a twofold speedup for most of the 3D convolution layers of our test network compared to the cuDNN version.

机译：卷积神经网络已被证明在诸如图像分类，对象跟踪以及基于2D输入的许多其他任务等应用中非常成功。最近，研究人员已开始将卷积神经网络应用于视频分类，视频分类构成了3D输入，并且需要大量的内存和更多的计算量。基于FFT的方法可以减少计算量，但这通常以增加内存需求为代价。另一方面，Winograd最小过滤算法（WMFA）可以减少所需的操作数量，从而可以加快计算速度，而无需增加所需的内存。对于二维神经网络，该策略已被证明是成功的。我们为3D卷积神经网络实现了该算法，并将其应用于流行的3D卷积神经网络，该网络用于对视频进行分类并将其与cuDNN进行比较。对于我们的算法的高度优化实现，与cuDNN版本相比，我们观察到了测试网络中大多数3D卷积层的两倍加速。

著录项

期刊名称 Computational Intelligence and Neuroscience
作者
Qiang Lan; Zelong Wang; Mei Wen; Chunyuan Zhang; Yijie Wang;
展开▼
作者单位

展开▼
年(卷),期 2017(2017),-1
年度 2017
页码 8348671
总页数 8
原文格式 PDF
正文语种
中图分类神经科学;
关键词

相似文献

外文文献
中文文献
专利

1. A GPU-Based Framework for Generating Implicit Datasets of Voxelized Polygonal Models for the Training of 3D Convolutional Neural Networks [J] . Ogayar-Anguita Carlos J., Rueda-Ruiz Antonio J., Segura-Sanchez Rafael J., Quality Control, Transactions . 2020,第期

机译：基于GPU的框架，用于为3D卷积神经网络训练产生虚拟多边形模型的隐式数据集
2. CSTAT +: A GPU-accelerated spatial pattern analysis algorithm for high-resolution 2D/3D hydrologic connectivity using array vectorization and convolutional neural network operators [J] . Yu Feng, Harbor Jonathan M. Environmental Modelling & Software . 2019,第Octa期

机译：CSTAT +：使用阵列矢量化和卷积神经网络运算符的高分辨率2D / 3D水文连通性的GPU加速空间模式分析算法
3. CSTAT +: A GPU-accelerated spatial pattern analysis algorithm for high-resolution 2D/3D hydrologic connectivity using array vectorization and convolutional neural network operators [J] . Yu Feng, Harbor Jonathan M. Environmental Modelling & Software . 2019,第OCTa期

机译：CSTAT +：使用阵列矢量化和卷积神经网络运算符的高分辨率2D / 3D水文连通性的GPU加速空间模式分析算法
4. Performance of Convolution Neural Network based on Multiple GPUs with Different Data Communication Models [C] . Che-Lun Hung, Yi-Yang Lin, Chuan Yi Tang, IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing . 2018

机译：基于具有不同数据通信模型的多个GPU的卷积神经网络的性能
5. Blocked Algorithms for Neural Networks: Design and Implementation on GPUs [D] . Tillet, Philippe. 2020

机译：神经网络的阻止算法：GPU上的设计与实现
6. Performance of convolutional neural networks for identification of bacteria in 3D microscopy datasets [O] . Edouard A. Hay, Raghuveer Parthasarathy 2018

机译：卷积神经网络在3D显微镜数据集中识别细菌的性能
7. Implementation of Convolutional Neural Networks for Warp Detection in 3D Printed Components manufactured via Fused Filament Fabrication: A Bayesian-Based Automated Approach [O] . Aditya Saluja 2021

机译：通过熔丝丝状制造制造的3D印刷部件卷积神经网络的实施：基于贝叶斯的自动化方法

High Performance Implementation of 3D Convolutional Neural Networks on a GPU

摘要

著录项

相似文献

相关主题

期刊订阅