Efficient Outdoor Video Semantic Segmentation Using Feedback-Based Fully Convolution Neural Network

Wong Chi-Chong; Gan Yanfen; Vong Chi-Man

首页> 外文期刊>IEEE transactions on industrial informatics >Efficient Outdoor Video Semantic Segmentation Using Feedback-Based Fully Convolution Neural Network

【24h】

Efficient Outdoor Video Semantic Segmentation Using Feedback-Based Fully Convolution Neural Network

机译：高效的户外视频语义分割使用基于反馈的完全卷积神经网络

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article, we focus on efficient semantic segmentation problem from sequential two-dimensional images, in which all pixels are classified into certain classes for scene understanding. Such problem is challenging because it involves constraints of both spatial and temporal consistencies, which have large difficulties in explicitly determining such structural constraints. Traditionally, such a problem is tackled using structured prediction method, such as conditional random field (CRF). However, pure CRF method suffers from very high complexity in computing high-order potentials and slow performance during inference step, which is unsuitable for efficient video segmentation in real scenario. In this article, a novel feedback-based deep fully convolutional neural network (CNN) is proposed to inherently incorporate spatial context through appending output feedback mechanism. The proposed method has the following contributions: 1) spatial context in images are easily captured through iterative feedback refinement, without the expensive postprocess step such as CRF refinement; 2) easily integrated with generic deep CNN structure; and 3) the inference time is greatly reduced for efficient image segmentation. Compared to current state-of-the-art methods, our proposed method was shown to provide up to 14% better accuracy on semantic segmentation task in challenging Camvid and Cityscapes datasets, while taking up to relatively 980% shorter inference time. The proposed method also shows its effectiveness for real-time road detection task of autonomous driving.

机译：在本文中，我们专注于序贯二维图像的高效语义分段问题，其中所有像素都被分类为某些类别的场景理解。这些问题是具有挑战性的，因为它涉及空间和时间常量的限制，在明确地确定这种结构约束时具有很大的困难。传统上，使用结构化预测方法（例如条件随机字段（CRF））来解决这种问题。然而，纯CRF方法在计算高阶电位和推理步骤期间的性能缓慢的情况下遭受了非常高的复杂性，这是不适用于实际方案中有效的视频分段。在本文中，提出了一种新的基于反馈的深度完全卷积神经网络（CNN），以通过附加输出反馈机制固有地结合空间上下文。所提出的方法具有以下贡献：1）通过迭代反馈细化容易捕获图像中的空间上下文，没有昂贵的后处理步骤，如CRF细化; 2）易于与通用深层CNN结构集成; 3）有效图像分割大大降低了推理时间。与目前的最先进的方法相比，我们提出的方法显示在挑战Camvid和CityCAPES数据集中提供高达14％的语义细分任务精度，同时占用相对980％的推理时间。该方法还显示了自主驾驶实时道路检测任务的有效性。

著录项

来源
《IEEE transactions on industrial informatics》 |2020年第8期|5128-5136|共9页
作者
Wong Chi-Chong; Gan Yanfen; Vong Chi-Man;
展开▼
作者单位

Univ Macau Dept Comp & Informat Sci Macau 999078 Peoples R China;

Guangdong Univ Foreign Studies South China Business Coll Guangzhou 510545 Peoples R China;

Univ Macau Dept Comp & Informat Sci Macau 999078 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantics; Image segmentation; Task analysis; Computational modeling; Computational complexity; Convolutional neural networks; Context modeling; Feedback network; fully convolution; image segmentation;

机译：语义;图像分割;任务分析;计算建模;计算复杂性;卷积神经网络;上下文建模;反馈网络;完全卷积;图像分割;

相似文献

外文文献
中文文献
专利

1. Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions [J] . Duc My Vo, Lee Sang-Woong Multimedia Tools and Applications . 2018,第14期

机译：使用具有多尺度图像和多尺度扩张卷积的全卷积神经网络进行语义图像分割
2. Weakly Supervised Learning with Deep Convolutional Neural Networks for Semantic Segmentation: Understanding Semantic Layout of Images with Minimum Human Supervision [J] . Seunghoon Hong, Suha Kwak, Bohyung Han IEEE Signal Processing Magazine . 2017,第6期

机译：使用深度卷积神经网络进行语义监督的弱监督学习：以最少的人工监督了解图像的语义布局
3. Multi-label semantic concept detection in videos using fusion of asymmetrically trained deep convolutional neural networks and foreground driven concept co-occurrence matrix [J] . Janwe Nitin J., Bhoyar Kishor K. Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2018,第8期

机译：使用非对称训练的深卷积神经网络和前景驱动概念共发生矩阵的视频中的多标签语义概念检测
4. Semantic Segmentation of UAV Aerial Videos using Convolutional Neural Networks [C] . Girisha S, Manohara Pai, Ujjwal Verma, 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering . 2019

机译：利用卷积神经网络对无人机航拍图像进行语义分割
5. Identifying Sports Players in Broadcast Videos Using Recurrent and Convolutional Neural Networks [D] . Chan, Alvin. 2018

机译：使用反复和卷积神经网络识别广播视频中的体育运动者
6. Convolutional Neural Networks-Based Object Detection Algorithm by Jointing Semantic Segmentation for Images [O] . Baohua Qiang, Ruidong Chen, Mingliang Zhou, 2020

机译：基于卷积神经网络的对象检测算法通过连接图像的语义分割
7. Deep convolutional neural networks for semantic video object segmentation [O] . Wang Huiling 2016

机译：深度卷积神经网络用于语义视频对象分割

Efficient Outdoor Video Semantic Segmentation Using Feedback-Based Fully Convolution Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅