Multimodal pattern matching for audio-visual query and retrieval

机译：用于视听查询和检索的多模式模式匹配

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

A necessary capability for content-based retrieval is to support the paradigm of query by example. In the past, there have been several attempts to use low-level features for video retrieval. None of the approaches however uses the multimedia information content of the video. We present an algorithm for matching multimodal (audio-visual) patterns for the purpose of content-based video retrieval. The novel ability of our approach to use the information content in multiple media coupled with a strong emphasis on temporal similarity differentiates it from the state-of-the-art in content-based retrieval. At the core of the pattern matching scheme is a dynamic programming algorithm, which leads to a significant improvement in performance. Coupling the use of audio with video this algorithm can be applied to grouping of shots based on audio-visual similarity. This is much more effective in constructing scenes from shots than using only visual content to do the same.

机译：基于内容的检索的必要功能是通过示例支持查询范式。过去，曾有几次尝试使用低级功能进行视频检索。但是，没有一种方法使用视频的多媒体信息内容。为了基于内容的视频检索，我们提出了一种用于匹配多模式（视听）模式的算法。我们的方法在多种媒体中使用信息内容的新颖能力，以及对时间相似性的高度重视，使其与基于内容的检索技术脱颖而出。模式匹配方案的核心是动态编程算法，该算法可显着提高性能。将该音频与视频的使用结合起来，可以将该算法应用于基于视听相似性的镜头分组。与仅使用视觉内容进行拍摄相比，此方法在根据镜头构建场景方面更为有效。

著录项

来源
《Conference on Storage and Retrieval for Media Databases 2001 Jan 24-26, 2001, San Jose, USA》|2001年|p.188-195|共8页
会议地点 San Jose CA(US)
作者
Milind R. Naphade; Roy Wang; Thomas S. Huang;
展开▼
作者单位

Department of Electrical and Computer Engineering Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
video query by example; dynamic programming; local optimality; pattern matching;

机译：视频查询示例；动态编程局部最优模式匹配;

相似文献

外文文献
中文文献
专利

1. Audio-visual query and retrieval: A system that uses dynamic programming and relevance feedback [J] . Miind Ramesh Naphade, Ruoyu Roy Wang, Thoas S. Huang Journal of electronic imaging . 2001,第4期

机译：视听查询和检索：使用动态编程和相关性反馈的系统
2. Unordered Tree Matching And Ordered Tree Matching: The Evaluation Of Tree Pattern Queries [J] . Yangjun Chen, Leping Zou International Journal of Information Technology,Communications and Convergence . 2011,第3期

机译：无序树匹配和有序树匹配：树模式查询的评估
3. Query-by-visual-search: multimodal framework for content-based image retrieval [J] . Bibi Ruqia, Mehmood Zahid, Yousef Rehan Mehmood, Journal of ambient intelligence and humanized computing . 2020,第11期

机译：逐个视觉搜索：基于内容的图像检索的多模式框架
4. Multimodal pattern matching for audio-visual query and retrieval [C] . Milind R. Naphade, Roy Wang, Thomas S. Huang Conference on storage and retrieval for media databases . 2001

机译：用于视听查询和检索的多模式模式匹配
5. A three-dimensional model retrieval system: Shape matching, database and query interface [D] . Zhang, Jingsheng. 2011

机译：三维模型检索系统：形状匹配，数据库和查询界面
6. Multimodal Sensory-Spatial Integration and Retrieval of Trained Motor Patterns for Body Coordination in Musicians and Dancers [O] . Aija Marie Ladda, Sarah B. Wallwork, Martin Lotze 2020

机译：音乐家与舞者身体协调培训电机模式的多式觉感觉 - 空间集成与检索
7. Multimodal Pattern Matching for Audio-Visual Query and Retrieval [O] . Milind R. Naphade, Roy Wang, Thomas S. Huang 2007

机译：用于视听查询和检索的多模式模式匹配

Multimodal pattern matching for audio-visual query and retrieval

摘要

著录项

相似文献

相关主题

期刊订阅