Deep Mixture of Diverse Experts for Large-Scale Visual Recognition

Zhao Tianyi; Chen Qiuyu; Kuang Zhenzhong; Yu Jun; Zhang Wei; Fan Jianping

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Deep Mixture of Diverse Experts for Large-Scale Visual Recognition

【24h】

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition

机译：用于大规模视觉识别的多元专家深层混合

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a deep mixture of diverse experts algorithm is developed to achieve more efficient learning of a huge (mixture) network for large-scale visual recognition application. First, a two-layer ontology is constructed to assign large numbers of atomic object classes into a set of task groups according to the similarities of their learning complexities, where certain degrees of inter-group task overlapping are allowed to enable sufficient inter-group message passing. Second, one particular base deep CNNs with M + 1 outputs is learned for each task group to recognize its M atomic object classes and identify one special class of "not-in-group", where the network structure (numbers of layers and units in each layer) of the well-designed deep CNNs (such as AlexNet, VGG, GoogleNet, ResNet) is directly used to configure such base deep CNNs. For enhancing the separability of the atomic object classes in the same task group, two approaches are developed to learn more discriminative base deep CNNs: (a) our deep multi-task learning algorithm that can effectively exploit the inter-class visual similarities; (b) our two-layer network cascade approach that can improve the accuracy rates for the hard object classes at certain degrees while effectively maintaining the high accuracy rates for the easy ones. Finally, all these complementary base deep CNNs with diverse but overlapped outputs are seamlessly combined to generate a mixture network with larger outputs for recognizing tens of thousands of atomic object classes. Our experimental results have demonstrated that our deep mixture of diverse experts algorithm can achieve very competitive results on large-scale visual recognition.

机译：在本文中，开发了一种深入的不同专家算法的混合，实现了对大型视觉识别应用的巨大（混合）网络的更有效学习。首先，构造双层本体，以根据其学习复杂性的相似性为一组任务组分配大量原子对象类，其中允许允许某些组间任务重叠进行足够的组间消息通过。其次，为每个任务组学习具有M + 1输出的一个特定基础深度CNN，以识别其M原子对象类并识别一个特殊类别的“Not-In-oc-group”，其中网络结构（层数和单位数每个层）的良好设计的深CNN（例如AlexNet，VGG，Googlenet，Reset）直接用于配置此类基本深CNN。为了提高同一任务组中的原子对象类的可分离性，开发了两种方法来了解更多的判别基础CNN：（a）我们可以有效利用级别的视觉相似性的深度多任务学习算法; （b）我们的双层网络级联方法，可以提高某些程度的硬对象类的精度率，同时有效地保持易于的高精度率。最后，所有这些具有多样化但重叠输出的互补基础深度CNN无缝地组合以产生具有更大输出的混合网络，用于识别成千上万的原子对象类。我们的实验结果表明，我们对各种专家算法的深度混合可以在大规模视觉识别方面实现非常竞争力的结果。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2019年第5期|1072-1087|共16页
作者
Zhao Tianyi; Chen Qiuyu; Kuang Zhenzhong; Yu Jun; Zhang Wei; Fan Jianping;
展开▼
作者单位

Univ North Carolina Charlotte Dept Comp Sci Charlotte NC 28223 USA;

Univ North Carolina Charlotte Dept Comp Sci Charlotte NC 28223 USA;

Hangzhou Dianzi Univ Sch Comp Sci Hangzhou 310018 Zhejiang Peoples R China|UNC Charlotte Charlotte NC 28223 USA;

Hangzhou Dianzi Univ Sch Comp Sci Hangzhou 310018 Zhejiang Peoples R China|UNC Charlotte Charlotte NC 28223 USA;

UNC Charlotte Charlotte NC 28223 USA|Fudan Univ Sch Comp Sci Shanghai 200433 Peoples R China;

Univ North Carolina Charlotte Dept Comp Sci Charlotte NC 28223 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep mixture of diverse experts; base deep CNNs; mixture network; deep multi-task learning; large-scale visual recognition;

机译：各种专家的深厚混合;基础深CNN;混合网络;深度多任务学习;大规模的视觉认可;

相似文献

外文文献
中文文献
专利

1. Deep Mixture of Diverse Experts for Large-Scale Visual Recognition [J] . Zhao Tianyi, Chen Qiuyu, Kuang Zhenzhong, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2019,第5期

机译：深度专家混合在一起进行大规模的视觉识别
2. Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks [J] . Rajalingham Rishi, Issa Elias B., Bashivan Pouya, The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2018,第33期

机译：大规模，高分辨率对人类，猴子和最先进的深层人工神经网络的核心视觉物体识别行为的比较
3. Integrating multi-level deep learning and concept ontology for large-scale visual recognition [J] . Kuang Zhenzhong, Yu Jun, Li Zongmin, Pattern Recognition: The Journal of the Pattern Recognition Society . 2018,第期

机译：为大规模视觉识别集成多级深度学习和概念本体
4. Deep Mixture of Experts with Diverse Task Spaces [C] . Jianping Fan, Tianyi Zhao, Zhenzhong Kuang, IEEE International Conference on Machine Learning and Applications . 2017

机译：具有不同任务空间的专家的深度融合
5. Tree-based Deep Mixture of Experts with Applications to Visual Saliency Prediction and Quality Robust Visual Recognition [D] . Dodge, Samuel. 2018

机译：基于树的专家深度混合及其在视觉显着性预测和质量鲁棒的视觉识别中的应用
6. Towards a Robust Visual Place Recognition in Large-Scale vSLAM Scenarios Based on a Deep Distance Learning [O] . Liang Chen, Sheng Jin, Zhoujun Xia 2021

机译：基于深度远程学习的大规模VSLAM情景中的强大视觉识别
7. Deep Mixture of Diverse Experts for Large-Scale Visual Recognition [O] . Zhao, Tianyi, Yu, Jun, Kuang, Zhenzhong, 2017

机译：大规模视觉识别的多元化专家的深层融合

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition

摘要

著录项

相似文献

相关主题

期刊订阅