首页> 外文学位 >Neighborhood-level learning techniques for nonparametric scene models.
【24h】

Neighborhood-level learning techniques for nonparametric scene models.

机译:非参数场景模型的邻域级学习技术。

获取原文
获取原文并翻译 | 示例

摘要

Scene model based segmentation of video into foreground and background structure has long been an important and ongoing research topic in image processing and computer vision. Segmentation of complex video scenes into binary foreground/background label images is often the first step in a wide range of video processing applications. Examples of common applications include surveillance, Traffic Monitoring, People Tracking, Activity Recognition, and Event Detection.;A wide range of scene modeling techniques have been proposed for identifying foreground pixels or regions in surveillance video. Broadly speaking, the purpose of a scene model is to characterize the distribution of features in an image block or pixel over time. In the majority of cases, the scene model is used to represent the distribution of background features (background modeling) and the distribution of foreground features is assumed to be uniform or Gaussian. In other cases, the model characterizes the distribution of foreground and background values and the segmentation is performed by maximum likelihood.;Pixel-level scene models characterize the distributions of spatiotemporally localized image features centered about each pixel location in video over time. Individual video frames are segmented into foreground and background regions based on a comparison between pixel-level features from within the frame under segmentation and the appropriate elements of the scene model at the corresponding pixel location. Prominent pixel level scene models include the Single Gaussian, Gaussian Mixture Model and Kernel Density Estimation.;Recently reported advancements in scene modeling techniques have been largely based on the exploitation of local coherency in natural imagery based on integration of neighborhood information among nonparametric pixel-level scene models. The earliest scene models inadvertently made use of neighborhood information because they modeled images at the block level. As the resolution of the scene models progressed, textural image features such as the spatial derivative, local binary pattern (LBP) or Wavelet coefficients were employed to provide neighborhood-level structural information in the pixel-level models. In the most recent case, Barnich and Van DroogenBroeck proposed the Visual Background Extractor (ViBe), where neighborhood-level information is incorporated into the scene model in the learning step. In ViBe, the learning function is distributed over a small region such that new background information is absorbed at both the pixel and neighborhood level.;In this dissertation, I present a nonparametric pixel level scene model based on several recently reported stochastic video segmentations algorithms. I propose new stochastic techniques for updating scene models over time that are focused on the incorporation of neighborhood-level features into the model learning process and demonstrate the effectiveness of the system on a wide range of challenging visual tasks. Specifically, I propose a model maintenance policy that is based on the replacement of outliers within each nonparametric pixel level model through kernel density estimation (KDE) and a neighborhood diffusion procedure where information sharing between adjacent models having significantly different shapes is discouraged. Quantitative results are compared using the well known percentage correct classification (PCC) and a new probability correct classification (PrCC) metric, where the underlying models are scrutinized prior to application of a final segmentation threshold. In all cases considered, the superiority of the proposed model with respect to the existing state-of-the-art techniques is well established.
机译:长期以来,基于场景模型的视频分割成前景和背景结构一直是图像处理和计算机视觉中一个重要且正在进行的研究课题。将复杂的视频场景分割成二进制前景/背景标签图像通常是广泛的视频处理应用程序中的第一步。常见的应用示例包括监视,流量监视,人员跟踪,活动识别和事件检测。已提出了各种各样的场景建模技术,用于识别监视视频中的前景像素或区域。广义而言,场景模型的目的是表征图像块或像素随时间变化的特征分布。在大多数情况下,场景模型用于表示背景特征的分布(背景建模),而前景特征的分布则假定为均匀或高斯分布。在其他情况下,该模型可以表征前景和背景值的分布,并且可以通过最大似然来进行分割。像素级场景模型可以表征以视频中每个像素位置为中心的时空局部图像特征随时间的分布。根据分割后的帧内像素级特征与相应像素位置处场景模型的适当元素之间的比较,将各个视频帧分割为前景和背景区域。突出的像素级场景模型包括单一高斯模型,高斯混合模型和核密度估计。最近报道的场景建模技术的进步主要是基于对自然图像中局部相干性的利用,其基于非参数像素级之间的邻域信息集成场景模型。最早的场景模型无意中使用了邻域信息,因为它们在块级别对图像进行了建模。随着场景模型分辨率的提高,采用了诸如空间导数,局部二值模式(LBP)或小波系数之类的纹理图像特征,以在像素级模型中提供邻域级结构信息。在最近的情况下,Barnich和Van DroogenBroeck提出了视觉背景提取器(ViBe),其中在学习步骤中将邻域级别的信息合并到场景模型中。在ViBe中,学习功能分布在一个很小的区域,因此新的背景信息在像素和邻域水平上都得到吸收。;本文,基于最近报道的随机视频分割算法,提出了一种非参数像素水平的场景模型。我提出了用于随时间更新场景模型的新随机技术,重点是将邻域级特征纳入模型学习过程中,并演示了该系统在各种具有挑战性的视觉任务中的有效性。具体而言,我提出了一种模型维护策略,该策略基于通过内核密度估计(KDE)和不鼓励相邻形状之间具有明显不同形状的相邻模型之间共享信息的邻域扩散过程来替换每个非参数像素级模型中的异常值。使用众所周知的百分比正确分类(PCC)和新的概率正确分类(PrCC)度量对定量结果进行比较,其中在应用最终分割阈值之前会仔细检查基础模型。在所有考虑的情况下,相对于现有的最新技术,所提出的模型的优越性已得到充分确立。

著录项

  • 作者

    Mould, Nicholas Allen.;

  • 作者单位

    The University of Oklahoma.;

  • 授予单位 The University of Oklahoma.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 182 p.
  • 总页数 182
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号