首页> 外文学位 >Towards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning.
【24h】

Towards Scene Understanding: Object Detection, Segmentation, and Contextual Reasoning.

机译:走向场景理解:对象检测,分割和上下文推理。

获取原文
获取原文并翻译 | 示例

摘要

Scene understanding is one of the holy grails of computer vision. Despite decades of research on scene understanding, it is still considered an unsolved problem. The difficulty arises mainly because of the huge space of possible images. We require models to capture this variability of scenes and their constituents (e.g., objects) given the limited memory resources. Additionally, we require efficient learning and inference techniques for our models to find the optimal solution in the enormous space of possible solutions.;In this thesis, we propose a set of novel techniques for object detection, segmentation, and contextual reasoning and take a further step towards the ultimate goal of holistic scene understanding. In particular, we propose a compositional method for representing objects and show inference can be performed for an exponential number of objects in linear time. Subsequently, we propose a series of discriminative learning methods for object detection and segmentation and show that our methods achieve the state-of-the-art performance on difficult benchmarks in the computer vision community. Finally, through a series of hybrid human-machine experiments, we try to identify bottlenecks in scene understanding to better guide future research efforts in this area.
机译:场景理解是计算机视觉的圣地之一。尽管对场景理解进行了数十年的研究,但仍被认为是尚未解决的问题。出现困难的主要原因是可能的图像空间巨大。在内存资源有限的情况下,我们需要使用模型来捕获场景及其组成部分(例如对象)的这种可变性。此外,我们还需要对模型进行有效的学习和推理,以在可能的解决方案的巨大空间中找到最佳解决方案。本文提出了一套用于对象检测,分割和上下文推理的新技术,并进一步进行了研究。向整体场景理解的最终目标迈进。特别地,我们提出了一种用于表示对象的合成方法,并表明可以在线性时间内对指数数量的对象执行推理。随后,我们提出了一系列用于对象检测和分割的判别性学习方法,并表明我们的方法在计算机视觉社区的困难基准上达到了最先进的性能。最后,通过一系列混合人机实验,我们尝试确定场景理解中的瓶颈,以更好地指导该领域的未来研究工作。

著录项

  • 作者

    Mottaghi, Roozbeh.;

  • 作者单位

    University of California, Los Angeles.;

  • 授予单位 University of California, Los Angeles.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 177 p.
  • 总页数 177
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号