Motion-Based Occlusion-Aware Pixel Graph Network for Video Object Segmentation

机译：基于运动的遮挡感知像素图网络，用于视频对象分割

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a dual-channel based Graph Convolutional Network (GCN) for the Video Object Segmentation (VOS) task. The main contribution lies in formulating two pixel graphs based on the raw RGB and optical flow features. Both spatial and temporal features are learned independently, making the network robust to various challenging scenarios in real-world videos. Additionally, a motion orientation-based aggregator scheme efficiently captures long-range dependencies among objects. This not only deals with the complex issue of modelling velocity differences among multiple objects moving in various directions, but also adapts to change of appearance of objects due to pose and scale deformations. Also, an occlusion-aware attention mechanism has been employed to facilitate accurate segmentation under scenarios where multiple objects have temporal discontinuity in their appearance due to occlusion. Performance analysis on DAVIS-2016 and DAVIS-2017 datasets show the effectiveness of our proposed method in foreground segmentation of objects in videos over the existing state-of-the-art techniques. Control experiments using CamVid dataset show the generalising capability of the model for scene segmentation.

机译：本文针对视频对象分割（VOS）任务提出了一种基于双通道的图卷积网络（GCN）。主要贡献在于基于原始RGB和光流特征制定两个像素图。空间和时间特征都是独立学习的，从而使网络对于实际视频中的各种挑战性场景都具有强大的鲁棒性。此外，基于运动方向的聚合器方案可以有效地捕获对象之间的远程依赖关系。这不仅解决了建模沿多个方向移动的多个对象之间的速度差异这一复杂问题，而且还适应了由于姿态和比例变形而导致的对象外观变化。另外，在多个对象由于遮挡而在外观上具有时间不连续性的情况下，采用了遮挡意识的注意力机制来促进准确的分割。对DAVIS-2016和DAVIS-2017数据集的性能分析表明，与现有的最新技术相比，我们提出的方法在视频对象前景分割中的有效性。使用CamVid数据集的控制实验显示了该模型对场景分割的泛化能力。

著录项

来源
《International conference on neural information processing;Annual conference of Asia-Pacific Neural Network Society》|2019年|516-527|共12页
会议地点
作者
Saptakatha Adak; Sukhendu Das;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Video Object Segmentation (VOS); Graph Convolutional Networks (GCN); Aggregation mechanisms; Adversarial training;

机译：视频对象分割（VOS）;图卷积网络（GCN）;聚集机制;对抗训练;

相似文献

外文文献
中文文献
专利

1. Accurate moving object segmentation in unconstraint videos based on robust seed pixels selection [J] . Wenlong Zhang, Xiaoliang Sun, Qifeng Yu International Journal of Advanced Robotic Systems . 2020,第4期

机译：基于鲁棒种子像素选择的非诱惑视频中的准确移动对象分割
2. Initial Object Segmentation for Video Object Plane GenerationUsing Cellular Neural Networks [J] . 王慧, 杨高波, 张兆杨上海大学学报：英文版 . 2003,第002期

机译：使用细胞神经网络的视频对象平面生成的初始对象分割
3. Unsupervised pixel-level video foreground object segmentation via shortest path algorithm [J] . Cao Xiaochun, Wang Feng, Zhang Bao, Neurocomputing . 2016,第JANa8期

机译：通过最短路径算法的无监督像素级视频前景对象分割
4. Motion-Based Occlusion-Aware Pixel Graph Network for Video Object Segmentation [C] . Saptakatha Adak, Sukhendu Das International Conference on Neural Information Processing . 2019

机译：基于运动的遮挡感知像素图形网络用于视频对象分段
5. Motion-based object segmentation from digital video. [D] . Wang, Jian. 1998

机译：基于运动的数字视频对象分割。
6. Automatic Organ Segmentation for CT Scans Based on Super-Pixel and Convolutional Neural Networks [O] . Xiaoming Liu, Shuxu Guo, Bingtao Yang, 2018

机译：基于超像素和卷积神经网络的CT扫描自动器官分割
7. Pixel-Level Matching for Video Object Segmentation using Convolutional Neural Networks [O] . Yoon, Jae Shin, Rameau, Francois, Kim, Junsik, 2017

机译：使用卷积算法进行视频对象分割的像素级匹配神经网络

Motion-Based Occlusion-Aware Pixel Graph Network for Video Object Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅