A Fusion Network for Semantic Segmentation Using RGB-D Data

机译：使用RGB-D数据进行语义分割的融合网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semantic scene parsing is considerable in many intelligent field, including perceptual robotics. For the past few years, pixel-wise prediction tasks like semantic segmentation with RGB images has been extensively studied and has reached very remarkable parsing levels, thanks to convolutional neural networks (CNNs) and large scene datasets. With the development of stereo cameras and RGBD sensors, it is expected that additional depth information will help improving accuracy. In this paper, we propose a semantic segmentation framework incorporating RGB and complementary depth information. Motivated by the success of fully convolutional networks (FCN) in semantic segmentation field, we design a fully convolutional networks consists of two branches which extract features from both RGB and depth data simultaneously and fuse them as the network goes deeper. Instead of aggregating multiple model, our goal is to utilize RGB data and depth data more effectively in a single model. We evaluate our approach on the NYU-Depth V2 dataset, which consists of 1449 cluttered indoor scenes, and achieve competitive results with the state-of-the-art methods.

机译：语义场景解析在许多智能领域都非常重要，包括感知机器人。在过去的几年中，由于卷积神经网络（CNN）和大型场景数据集，对RGB图像进行语义分割等像素级预测任务已经得到了广泛研究，并且达到了非常出色的解析水平。随着立体相机和RGBD传感器的发展，预计更多的深度信息将有助于提高准确性。在本文中，我们提出了一种结合RGB和互补深度信息的语义分割框架。受全卷积网络（FCN）在语义分割领域的成功推动，我们设计了一个由两个分支组成的全卷积网络，该两个分支同时从RGB和深度数据中提取特征，并随着网络的深入而融合。我们的目标不是汇总多个模型，而是在单个模型中更有效地利用RGB数据和深度数据。我们在NYU-Depth V2数据集上评估了我们的方法，该数据集由1449个混乱的室内场景组成，并通过最先进的方法获得了竞争性结果。

著录项

来源
《International conference on graphic and image processing》|2017年|1061523.1-1061523.8|共8页
会议地点
作者
Jiahui Yuan; Kun Zhang; Yifan Xia; Lin Qi; Junyu Dong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantic segmentation; RGB-D; fusion model;

机译：语义分割; RGB-D;融合模型;

相似文献

外文文献
中文文献
专利

1. A survey on indoor RGB-D semantic segmentation: from hand-crafted features to deep convolutional neural networks [J] . Fahimeh Fooladgar, Shohreh Kasaei Multimedia Tools and Applications . 2020,第7a8期

机译：室内RGB-D语义细分调查：从手工制作功能到深度卷积神经网络
2. LinkNet: 2D-3D linked multi-modal network for online semantic segmentation of RGB-D videos [J] . Cai Jun-Xiong, Mu Tai-Jiang, Lai Yu-Kun, Computers & Graphics . 2021,第Auga期

机译：LinkNet：2D-3D链接RGB-D视频的在线语义分割多模态网络
3. SCN: Switchable Context Network for Semantic Segmentation of RGB-D Images [J] . Lin Di, Zhang Ruimao, Ji Yuanfeng, Cybernetics, IEEE Transactions on . 2020,第3期

机译：SCN：RGB-D图像的语义分割的可切换上下文网络
4. A Fusion Network for Semantic Segmentation Using RGB-D Data [C] . Jiahui Yuan, Kun Zhang, Yifan Xia, International Conference on Graphic and Image Processing . 2017

机译：使用RGB-D数据进行语义分割的融合网络
5. Deep Multimodal Fusion Networks for Semantic Segmentation [D] . Tetreault, Jesse 2017

机译：用于语义分割的深层多模融合网络
6. Helping the Blind to Get through COVID-19: Social Distancing Assistant Using Real-Time Semantic Segmentation on RGB-D Video [O] . Manuel Martinez, Kailun Yang, Angela Constantinescu, 2020

机译：帮助盲人通过Covid-19：在RGB-D视频上使用实时语义分割来实现社交疏散助理
7. Real-Time Fusion Network for RGB-D Semantic Segmentation Incorporating Unexpected Obstacle Detection for Road-Driving Images [O] . Lei Sun, Kailun Yang, Xinxin Hu, 2020

机译：用于RGB-D语义分割的实时融合网络，包括道路驾驶图像的意外障碍物检测

A Fusion Network for Semantic Segmentation Using RGB-D Data

摘要

著录项

相似文献

相关主题

期刊订阅