...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Structure Inference for Bayesian Multisensory Scene Understanding
【24h】

Structure Inference for Bayesian Multisensory Scene Understanding

机译:贝叶斯多感觉场景理解的结构推断

获取原文
获取原文并翻译 | 示例
           

摘要

We investigate a solution to the problem of multi-sensor scene understanding by formulating it in the framework of Bayesian model selection and structure inference. Humans robustly associate multimodal data as appropriate, but previous modelling work has focused largely on optimal fusion, leaving segregation unaccounted for and unexploited by machine perception systems. We illustrate a unifying, Bayesian solution to multi-sensor perception and tracking which accounts for both integration and segregation by explicit probabilistic reasoning about data association in a temporal context. Such explicit inference of multimodal data association is also of intrinsic interest for higher level understanding of multisensory data. We illustrate this using a probabilistic implementation of data association in a multi-party audio-visual scenario, where unsupervised learning and structure inference is used to automatically segment, associate and track individual subjects in audiovisual sequences. Indeed, the structure inference based framework introduced in this work provides the theoretical foundation needed to satisfactorily explain many confounding results in human psychophysics experiments involving multimodal cue integration and association.
机译:我们通过在贝叶斯模型选择和结构推断的框架中制定多传感器场景理解问题来研究解决方案。人类会适当地稳健地关联多模式数据,但是以前的建模工作主要集中在最佳融合上,而机器感知系统却无法解释和利用隔离。我们举例说明了一种针对多传感器感知和跟踪的统一贝叶斯解决方案,该解决方案通过在时间上下文中有关数据关联的显式概率推理来考虑集成和隔离。多模式数据关联的这种显式推断对于更高层次地理解多传感器数据也具有内在的兴趣。我们在多方视听场景中使用数据关联的概率实现方式对此进行了说明,其中无监督学习和结构推断用于自动分割,关联和跟踪视听序列中的各个主题。确实,这项工作中引入的基于结构推断的框架提供了令人满意地解释涉及多峰提示整合和关联的人类心理物理学实验中许多混淆结果所需的理论基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号