首页> 外文会议>IEEE International Conference on Computer Vision >Cross-Domain Image Retrieval with a Dual Attribute-Aware Ranking Network
【24h】

Cross-Domain Image Retrieval with a Dual Attribute-Aware Ranking Network

机译:具有双重属性感知排名网络的跨域图像检索

获取原文

摘要

We address the problem of cross-domain image retrieval, considering the following practical application: given a user photo depicting a clothing image, our goal is to retrieve the same or attribute-similar clothing items from online shopping stores. This is a challenging problem due to the large discrepancy between online shopping images, usually taken in ideal lighting/pose/background conditions, and user photos captured in uncontrolled conditions. To address this problem, we propose a Dual Attribute-aware Ranking Network (DARN) for retrieval feature learning. More specifically, DARN consists of two sub-networks, one for each domain, whose retrieval feature representations are driven by semantic attribute learning. We show that this attribute-guided learning is a key factor for retrieval accuracy improvement. In addition, to further align with the nature of the retrieval problem, we impose a triplet visual similarity constraint for learning to rank across the two subnetworks. Another contribution of our work is a large-scale dataset which makes the network learning feasible. We exploit customer review websites to crawl a large set of online shopping images and corresponding offline user photos with fine-grained clothing attributes, i.e., around 450,000 online shopping images and about 90,000 exact offline counterpart images of those online ones. All these images are collected from real-world consumer websites reflecting the diversity of the data modality, which makes this dataset unique and rare in the academic community. We extensively evaluate the retrieval performance of networks in different configurations. The top-20 retrieval accuracy is doubled when using the proposed DARN other than the current popular solution using pre-trained CNN features only (0.570 vs. 0.268).
机译:考虑到以下实际应用,我们解决了跨域图像检索的问题:给定描述衣物图像的用户照片,我们的目标是从在线商店中检索相同或属性相似的衣物。由于通常在理想的照明/姿势/背景条件下拍摄的在线购物图像与在不受控制的条件下拍摄的用户照片之间存在巨大差异,因此这是一个具有挑战性的问题。为了解决此问题,我们提出了一种用于识别特征学习的双重属性感知排名网络(DARN)。更具体地说,DARN由两个子网组成,每个子网一个域,其检索特征表示由语义属性学习驱动。我们表明,这种以属性为导向的学习是提高检索准确性的关键因素。另外,为了进一步与检索问题的性质保持一致,我们为学习跨两个子网的等级设置了三元组视觉相似性约束。我们工作的另一个贡献是使网络学习变得可行的大规模数据集。我们利用客户评论网站来抓取大量具有精细服装属性的在线购物图像和相应的脱机用户照片,即大约450,000个在线购物图像和这些在线图像的大约90,000个确切的离线对应图像。所有这些图像都是从现实生活中的消费者网站收集的,反映了数据模态的多样性,这使得该数据集在学术界是独一无二且罕见的。我们广泛评估了不同配置下的网络检索性能。当使用建议的DARN而不是仅使用预训练的CNN功能的当前流行解决方案时,前20个检索精度将翻倍(0.570对0.268)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号