首页> 外国专利> - RGB-D Multi-layer Residual Feature Fusion Network for Indoor Semantic Segmentation

- RGB-D Multi-layer Residual Feature Fusion Network for Indoor Semantic Segmentation

机译：-用于室内语义分割的RGB-D多层残差特征融合网络

页面导航

摘要
著录项
相似文献

摘要

The present invention relates to a network apparatus using one indoor color image and a depth image thereof to perform deep learning-based pixel unit semantic segmentation. According to the present invention, the network apparatus comprises: a first residual network module including a plurality of steps, which are connected from an upper step to a lower step and formed with a convolutional neural network (CNN) for each step, to extract gradual feature information from a color image; a second residual network module including a plurality of steps, which are connected from an upper step to a lower step and formed with a CNN for each step, to extract gradual feature information from a depth image; and a multimodal feature fusion network (MMFNet) module fusing the feature information extracted from each step of the first and second residual network modules. The MMFNet module is formed by sequentially connecting: a convolution block reducing dimensions of the feature information with respect to color and depth feature information extracted from a step corresponding to the first and second residual network modules for each step in order to smoothen a rapid increase of a parameter; two residual convolution units performing nonlinear modification for shape combination; and a convolution block adaptively combining feature information of different types and adjusting a scale of a feature value for addition. Moreover, scaled color feature information and scaled depth feature information are combined by addition. The present invention effectively extracts and combines various dimensions of feature information at the same time such that the feature information can be efficiently learned.

机译：网络设备技术领域本发明涉及一种使用一个室内彩色图像及其深度图像来执行基于深度学习的像素单元语义分割的网络设备。根据本发明，该网络设备包括：第一残余网络模块，其包括多个步骤，这些步骤从较高的步骤连接到较低的步骤并且针对每个步骤形成有卷积神经网络（CNN），以提取梯度彩色图像的特征信息;第二残留网络模块，其包括多个步骤，从上级步骤到下级步骤连接，并为每个步骤形成一个CNN，以从深度图像中提取渐变特征信息;多模式特征融合网络（MMFNet）模块，融合从第一和第二残余网络模块的每个步骤中提取的特征信息。 MMFNet模块是通过依次连接以下步骤形成的：卷积块，针对每个步骤从与第一和第二残留网络模块相对应的步骤中提取的颜色和深度特征信息方面，减少特征信息的维数，以平滑快速增加参数;两个残余卷积单元对形状组合进行非线性修改;卷积块自适应地组合不同类型的特征信息并调整特征值的大小以进行相加。此外，按比例将色彩特征信息和按比例深度特征信息相加。本发明有效地同时提取和组合特征信息的各个维度，从而可以有效地学习特征信息。

著录项

公开/公告号KR101970488B1

专利类型
公开/公告日2019-04-19

原文格式PDF
申请/专利权人 POSTECH ACADEMY-INDUSTRY FOUNDATION;
展开▼

申请/专利号KR20170182798
发明设计人 PARK SEONGJIN;HONG KISANG;LEE SEUNGYONG;
展开▼

申请日2017-12-28
分类号G06T7/10;G06T7/50;G06T7/90;
国家 KR
入库时间 2022-08-21 11:48:43

相似文献

专利
外文文献
中文文献