首页> 外文学位 >Towards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning.

【24h】

Towards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning.

机译：走向场景理解：对象检测，分割和上下文推理。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Scene understanding is one of the holy grails of computer vision. Despite decades of research on scene understanding, it is still considered an unsolved problem. The difficulty arises mainly because of the huge space of possible images. We require models to capture this variability of scenes and their constituents (e.g., objects) given the limited memory resources. Additionally, we require efficient learning and inference techniques for our models to find the optimal solution in the enormous space of possible solutions.;In this thesis, we propose a set of novel techniques for object detection, segmentation, and contextual reasoning and take a further step towards the ultimate goal of holistic scene understanding. In particular, we propose a compositional method for representing objects and show inference can be performed for an exponential number of objects in linear time. Subsequently, we propose a series of discriminative learning methods for object detection and segmentation and show that our methods achieve the state-of-the-art performance on difficult benchmarks in the computer vision community. Finally, through a series of hybrid human-machine experiments, we try to identify bottlenecks in scene understanding to better guide future research efforts in this area.

机译：场景理解是计算机视觉的圣地之一。尽管对场景理解进行了数十年的研究，但仍被认为是尚未解决的问题。出现困难的主要原因是可能的图像空间巨大。在内存资源有限的情况下，我们需要使用模型来捕获场景及其组成部分（例如对象）的这种可变性。此外，我们还需要对模型进行有效的学习和推理，以在可能的解决方案的巨大空间中找到最佳解决方案。本文提出了一套用于对象检测，分割和上下文推理的新技术，并进一步进行了研究。向整体场景理解的最终目标迈进。特别地，我们提出了一种用于表示对象的合成方法，并表明可以在线性时间内对指数数量的对象执行推理。随后，我们提出了一系列用于对象检测和分割的判别性学习方法，并表明我们的方法在计算机视觉社区的困难基准上达到了最先进的性能。最后，通过一系列混合人机实验，我们尝试确定场景理解中的瓶颈，以更好地指导该领域的未来研究工作。

著录项

作者
Mottaghi, Roozbeh.;
展开▼
作者单位

University of California, Los Angeles.;

展开▼
授予单位 University of California, Los Angeles.;
学科 Computer Science.
学位 Ph.D.
年度 2013
页码 177 p.
总页数 177
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. 基于联合定位和上下文推理的深度密集描述方法 [J] . 孔锐, 谢玮中南大学学报（英文版） . 2021,第009期
2. Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation [J] . Gupta Saurabh, Arbelaez Pablo, Girshick Ross, International Journal of Computer Vision . 2015,第2期

机译：使用RGB-D图像了解室内场景：自底向上分割，对象检测和语义分割
3. Multiband Image Segmentation and Object Recognition for Understanding Road Scenes [J] . Kang Y., Yamaguchi K., Naito T., Intelligent Transportation Systems, IEEE Transactions on . 2011,第4期

机译：用于了解道路场景的多波段图像分割和目标识别
4. Bag of Contextual-Visual Words for Road Scene Object Detection From Mobile Laser Scanning Data [J] . Yongtao Yu, Jonathan Li, Haiyan Guan, IEEE Transactions on Intelligent Transportation Systems . 2016,第12期

机译：从移动激光扫描数据中检测道路场景目标的上下文视觉单词包
5. Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation [C] . Yao Jian Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on . 2012

机译：整体描述场景：联合对象检测，场景分类和语义分割
6. A Quest for Visual Commonsense: Scene Understanding by Functional and Physical Reasoning. [D] . Zhao, Yibiao. 2015

机译：对视觉常识的追求：通过功能和物理推理进行场景理解。
7. Exploring the role of gaze behavior and object detection in scene understanding [O] . Kiwon Yun, Yifan Peng, Dimitris Samaras, 2013

机译：探索凝视行为和物体检测在场景理解中的作用
8. Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation [O] . Jian Yao, Sanja Fidler, Raquel Urtasun 2013

机译：将场景描述为一个整体：联合对象检测，场景分类和语义分割

Towards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning.

摘要

著录项

相似文献

相关主题

期刊订阅