首页> 外文期刊>The Journal of Problem Solving >Human-Machine Cooperation in Large-Scale Multimedia Retrieval: A Survey
【24h】

Human-Machine Cooperation in Large-Scale Multimedia Retrieval: A Survey

机译:大型多媒体检索中的人机合作:一项调查

获取原文
           

摘要

Large-Scale Multimedia Retrieval(LSMR) is the task to fast analyze a large amount of multimedia data like images or videos and accurately find the ones relevant to a certain semantic meaning. Although LSMR has been investigated for more than two decades in the fields of multimedia processing and computer vision, a more interdisciplinary approach is necessary to develop an LSMR system that is really meaningful for humans. To this end, this paper aims to stimulate attention to the LSMR problem from diverse research fields. By explaining basic terminologies in LSMR, we first survey several representative methods in chronological order. This reveals that due to prioritizing the generality and scalability for large-scale data, recent methods interpret semantic meanings with a completely different mechanism from humans, though such humanlike mechanisms were used in classical heuristic-based methods. Based on this, we discuss human-machine cooperation, which incorporates knowledge about human interpretation into LSMR without sacrificing the generality and scalability. In particular, we present three approaches to human-machine cooperation (cognitive, ontological, and adaptive), which are attributed to cognitive science, ontology engineering, and metacognition, respectively. We hope that this paper will create a bridge to enable researchers in different fields to communicate about the LSMR problem and lead to a ground-breaking next generation of LSMR systems.
机译:大规模多媒体检索(LSMR)是一项任务,用于快速分析大量的多媒体数据,例如图像或视频,并准确地找到与某种语义相关的数据。尽管在多媒体处理和计算机视觉领域对LSMR进行了20多年的研究,但仍需要更多的跨学科方法来开发对人类真正有意义的LSMR系统。为此,本文旨在激发来自各个研究领域的LSMR问题的关注。通过解释LSMR中的基本术语,我们首先按时间顺序调查了几种代表性方法。这表明,由于优先考虑大规模数据的通用性和可伸缩性,尽管经典的启发式方法中使用了类似人的机制,但最近的方法以与人完全不同的机制来解释语义。基于此,我们讨论了人机协作,该协作将有关人机解释的知识整合到了LSMR中,而不会牺牲通用性和可伸缩性。特别是,我们介绍了三种人机协作方法(认知,本体和自适应),它们分别归因于认知科学,本体工程和元认知。我们希望本文能够为不同领域的研究人员就LSMR问题进行交流提供桥梁,并开创性的下一代LSMR系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号