...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images
【24h】

Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

机译:Refipe1M +:用于学习跨莫代尔嵌入式烹饪食谱和食物图像的数据集

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper, we introduce Recipe 1M+, a new large-scale, structured corpus of over one million cooking recipes and 13 million food images. As the largest publicly available collection of recipe data, Recipes 1M+ affords the ability to train high-capacity models on aligned, multimodal data. Using these data, we train a neural network to learn a joint embedding of recipes and images that yields impressive results on an image-recipe retrieval task. Moreover, we demonstrate that regularization via the addition of a high-level classification objective both improves retrieval performance to rival that of humans and enables semantic vector arithmetic. We postulate that these embeddings will provide a basis for further exploration of the Recipes 1M+ dataset and food and cooking in general. Code, data and models are publicly available.(1)
机译:在本文中,我们介绍了1M +的食谱,新的大规模,结构化毒品用量超过100万烹饪食谱和1300万食物图像。作为最大的配方数据收集,食谱1M +提供了能够在对齐的多模式数据上培训高容量模型。使用这些数据,我们训练一个神经网络,学习联合嵌入食谱和图像,在图像配方检索任务上产生令人印象深刻的结果。此外,我们证明了通过增加高级分类目标的正规化,既可以提高对人类的竞争力的检索性能,并实现语义矢量算术。我们假设这些嵌入式将为进一步探索1M +数据集和食品以及烹饪提供基础。代码,数据和模型是公开可用的。(1)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号