Diabetes60 — Inferring Bread Units From Food Images Using Fully Convolutional Neural Networks

机译：Diabetes60 —使用完全卷积神经网络从食物图像推断面包单元

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose a challenging new computer vision task of inferring Bread Units (BUs) from food images. Assessing nutritional information and nutrient volume from a meal is an important task for diabetes patients. At the moment, diabetes patients learn the assessment of BUs on a scale of one to ten, by learning correspondence of BU and meals from textbooks. We introduce a large scale data set of around 9k different RGB-D images of 60 western dishes acquired using a Microsoft Kinect v2 sensor. We recruited 20 diabetes patients to give expert assessments of BU values to each dish based on several images. For this task, we set a challenging baseline using state-of-the-art CNNs and evaluated it against the performance of human annotators. In our work we present a CNN architecture to infer the depth from RGB-only food images to be used in BU regression such that the pipeline can operate on RGB data only and compare its performance to RGB-D input data. We show that our inferred depth maps from RGB images can replace RGB-D input data at high significance for the BU regression task. In its best configuration, our proposed method achieves a RMSE of 1.53 BUs using RGB and inferred depth. Considering the variability among the raters themselves of RMSE = 0.89, we can show that our baseline method with depth prediction can extract reasonable nutritional information from RGB image data only.

机译：在本文中，我们提出了一种挑战从食物图像推断面包单元（公共汽车）的新计算机视觉任务。评估膳食的营养信息和营养量是糖尿病患者的重要任务。目前，糖尿病患者通过从教科书的膳食的学习对应学习一至十的规模学习公共汽车的评估。我们介绍了使用Microsoft Kinect V2传感器获取的60个Western Dishes的大约9k不同RGB-D图像的大规模数据集。我们招募了20名糖尿病患者，基于几种图像对每个菜肴进行了专家评估。对于此任务，我们使用最先进的CNN设置了一个具有挑战性的基线，并评估了人类注释器的表现。在我们的工作中，我们提出了一种CNN架构，可从RGB的食物图像推断在BU回归中使用的深度，使得管道仅可以在RGB数据上运行并将其性能与RGB-D输入数据进行比较。我们表明，来自RGB图像的推断深度映射可以替换RGB-D输入数据对BU回归任务的高意义。在其最佳配置中，我们的提出方法使用RGB和推断深度实现了1.53总线的RMSE。考虑到RMSE = 0.89的评级人自己之间的变异性，我们可以表明我们的深度预测的基线方法只能从RGB图像数据中提取合理的营养信息。

著录项

来源
《IEEE International Conference on Computer Vision Workshops》|2017年|1526-1535|共10页
会议地点
作者
Patrick Ferdinand Christ; Sebastian Schlecht; Florian Ettlinger; Felix Grün; Christoph Heinle; Sunil Tatavatry; Seyed-Ahmad Ahmadi; Klaus Diepold; Bjoern H. Menze;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Diabetes; Three-dimensional displays; Cameras; Computer vision;

机译：糖尿病;三维显示;相机;计算机视觉;

相似文献

外文文献
中文文献
专利

1. Convolutional neural networks for relevance feedback in content based image retrieval A Content based image retrieval system that exploits convolutional neural networks both for feature extraction and for relevance feedback [J] . Lorenzo Putzu, Luca Piras, Giorgio Giacinto Multimedia Tools and Applications . 2020,第37a38期

机译：基于内容的图像检索的相关反馈的卷积神经网络基于内容的图像检索系统，用于利用特征提取和相关性反馈的卷积神经网络
2. Comparing fully convolutional networks, random forest, support vector machine, and patch-based deep convolutional neural networks for object-based wetland mapping using images from small unmanned aircraft system [J] . Liu Tao, Abd-Elrahman Amr, Morton Jon, GIScience & remote sensing . 2018,第2期

机译：比较全卷积网络，随机森林，支持向量机和基于补丁的深度卷积神经网络，使用来自小型无人机系统的图像进行基于对象的湿地映射
3. Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions [J] . Duc My Vo, Lee Sang-Woong Multimedia Tools and Applications . 2018,第14期

机译：使用具有多尺度图像和多尺度扩张卷积的全卷积神经网络进行语义图像分割
4. Diabetes60 - Inferring Bread Units From Food Images Using Fully Convolutional Neural Networks [C] . Patrick Ferdinand Christ, Sebastian Schlecht, Florian Ettlinger, IEEE International Conference on Computer Vision Workshops . 2017

机译：糖尿病60 - 使用完全卷积神经网络从食物图像推断面包单元
5. Combining Convolutional Neural Networks and Graph Neural Networks for Image Classification [D] . Trivedy, Vivek. 2021

机译：结合卷积神经网络和图形神经网络的图像分类
6. Inferring Drug-Related Diseases Based on Convolutional Neural Network and Gated Recurrent Unit [O] . Ping Xuan, Lianfeng Zhao, Tiangang Zhang, 2019

机译：基于卷积神经网络和门控循环单元的药物相关疾病推断
7. Inferring Emotion Tags from Object Images Using Convolutional Neural Network [O] . Anam Manzoor, Waqar Ahmad, Muhammad Ehatisham-ul-Haq, 2020

机译：使用卷积神经网络从物体图像推断情绪标签

Diabetes60 — Inferring Bread Units From Food Images Using Fully Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅