MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning

机译：MultInet ++：多行学习的多流特征聚合和几何损耗策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-task learning is commonly used in autonomous driving for solving various visual perception tasks. It offers significant benefits in terms of both performance and computational complexity. Current work on multi-task learning networks focus on processing a single input image and there is no known implementation of multi-task learning handling a sequence of images. In this work, we propose a multi-stream multi-task network to take advantage of using feature representations from preceding frames in a video sequence for joint learning of segmentation, depth, and motion. The weights of the current and previous encoder are shared so that features computed in the previous frame can be leveraged without additional computation. In addition, we propose to use the geometric mean of task losses as a better alternative to the weighted average of task losses. The proposed loss function facilitates better handling of the difference in convergence rates of different tasks. Experimental results on KITTI, Cityscapes and SYNTHIA datasets demonstrate that the proposed strategies outperform various existing multi-task learning solutions.

机译：多任务学习通常用于自主驾驶，以解决各种视觉感知任务。它在性能和计算复杂性方面提供了显着的好处。当前关于多任务学习网络的工作专注于处理单个输入图像，并且没有已知实现多任务学习处理一系列图像的实现。在这项工作中，我们提出了一种多流多任务网络，以利用视频序列中的前面帧中的特征表示来利用用于共同学习分割，深度和运动的联合学习。共享当前和先前编码器的权重，从而可以利用在前一帧中计算的特征而无需额外计算。此外，我们建议使用任务损失的几何平均值作为任务损失的加权平均值的更好的替代方案。该损失功能有助于更好地处理不同任务的收敛率差异。基蒂，城市景观和合成足数据集的实验结果表明，拟议的策略优于各种现有的多任务学习解决方案。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops》|2019年|1 v.|共11页
会议地点
作者
Sumanth Chennupati; Ganesh Sistu; Senthil Yogamani; Samir A Rawashdeh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41;
关键词
Task analysis; Feature extraction; Decoding; Video sequences; Estimation; Complexity theory; Streaming media;

机译：任务分析;特征提取;解码;视频序列;估计;复杂性理论;流媒体;

相似文献

外文文献
中文文献
专利

1. 3MNet: Multi-task, multi-level and multi-channel feature aggregation network for salient object detection [J] . Xinghe Yan, Zhenxue Chen, Q. M. Jonathan Wu, Machine Vision and Applications . 2021,第2期

机译：3MNET：多任务，多级和多通道特征聚合网络，用于突出对象检测
2. An effective combination of loss gradients for multi-task learning applied on instance segmentation and depth estimation [J] . Angelica Tiemi Mizuno Nakamura, Valdir Grassi Jr., Denis Fernando Wolf Engineering Applications of Artificial Intelligence . 2021,第Apra期

机译：应用于实例分割和深度估计的多任务学习损失梯度的有效组合
3. Joint Decision of Anti-Spoofing and Automatic Speaker Verification by Multi-Task Learning With Contrastive Loss [J] . Li Jiakang, Sun Meng, Zhang Xiongwei, Quality Control, Transactions . 2020,第期

机译：多任务学习与对比损失的联合决定反欺骗和自动演讲者核查
4. MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning [C] . Sumanth Chennupati, Ganesh Sistu, Senthil Yogamani, IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops . 2019

机译：MultiNet ++：多任务学习的多流特征聚合和几何损失策略
5. Ensemble feature selection for multi-stream automatic speech recognition. [D] . Gelbart, David. 2008

机译：集成特征选择，用于多流自动语音识别。
6. Crack Damage Detection Method via Multiple Visual Features and Efficient Multi-Task Learning Model [O] . Baoxian Wang, Weigang Zhao, Po Gao, 2018

机译：多种视觉特征和高效多任务学习模型的裂纹损伤检测方法
7. ACTNET: End-to-End Learning of Feature Activations and Multi-stream Aggregation for Effective Instance Image Retrieval [O] . Syed Sameed Husain, Eng-Jon Ong, Miroslaw Bober 2021

机译：ACTNET：有效实例检索的功能激活的端到端学习和多流聚合

MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning

摘要

著录项

相似文献

相关主题

期刊订阅