CLOVIS: towards precision-oriented text-based video retrieval through the unification of automatically-extracted concepts and relations of the visual and audio/speech contents

M. Belkhatir

首页> 外文期刊>Journal of Intelligent Information Systems >CLOVIS: towards precision-oriented text-based video retrieval through the unification of automatically-extracted concepts and relations of the visual and audio/speech contents

【24h】

CLOVIS: towards precision-oriented text-based video retrieval through the unification of automatically-extracted concepts and relations of the visual and audio/speech contents

机译：CLOVIS：通过统一自动提取的概念以及视音频和语音内容之间的关系，实现基于精度的基于文本的视频检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional multimedia (video) retrieval systems use the keyword-based approach in order to make the search process fast although this approach has several shortcomings and limitations related to the way the user is able to formulate her/his information need. Typical Web multimedia retrieval systems illustrate this paradigm in the sense that the result of a search consists of a collection of thousands of multimedia documents, many of which would be irrelevant or not fully exploited by the typical user. Indeed, according to studies related to users' behavior, an individual is mostly interested in the initial documents returned during a search session and therefore a multimedia retrieval system is to model the multimedia content as precisely as possible to allow for the first retrieved images to be fully relevant to the user's information need. For this, the keyword-based approach proves to be clearly insufficient and the need for a high-level index and query language, addressing the issue of combining modalities within expressive frameworks for video indexing and retrieval is of huge importance and the only solution for achieving significant retrieval performance. This paper presents a multi-facetted conceptual framework integrating multiple characterizations of the visual and audio contents for automatic video retrieval. It relies on an expressive representation formalism handling high-level video descriptions and a full-text query framework in an attempt to operate video indexing and retrieval beyond trivial low-level processes, keyword-annotation frameworks and state-of-the art architectures loosely-coupling visual and audio descriptions. Experiments on the multimedia topic search task of the TRECVID evaluation campaign validate our proposal.

机译：传统的多媒体（视频）检索系统使用基于关键字的方法来加快搜索过程，尽管该方法存在一些缺点和局限性，这些缺点和局限性与用户能够表达其信息需求的方式有关。典型的Web多媒体检索系统从搜索结果包括成千上万个多媒体文档的集合的角度说明了这种范例，其中许多文档与典型用户无关或没有被充分利用。实际上，根据与用户行为有关的研究，个人对搜索会话期间返回的初始文档最感兴趣，因此，多媒体检索系统应尽可能准确地对多媒体内容进行建模，以使第一个检索到的图像成为可能。与用户的信息需求完全相关。为此，事实证明基于关键字的方法显然是不够的，并且需要高级索引和查询语言，解决在视频索引和检索的表达框架内组合模式的问题非常重要，这是实现以下目标的唯一解决方案显着的检索性能。本文提出了一个多方面的概念框架，该框架集成了视频和音频内容的多种特征，可进行自动视频检索。它依靠可表达的形式化形式来处理高级视频描述和全文查询框架，以尝试在普通的低级过程，关键字注释框架和最新体系结构之外松散地操作视频索引和检索，结合视觉和音频描述。 TRECVID评估活动的多媒体主题搜索任务的实验验证了我们的建议。

著录项

来源
《Journal of Intelligent Information Systems》 |2010年第2期|135-175|共41页
作者
M. Belkhatir;
展开▼
作者单位

Center for Multimedia Computing, Communications and Applications Research,Monash University, Sunway Campus, Sunway, Malaysia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
video indexing and retrieval; visual/audio integration; conceptual graphs; large-scale experimental validation;

机译：视频索引和检索;视听整合概念图;大规模实验验证;

相似文献

外文文献
中文文献
专利

1. Robust Audio-Visual Speech Recognition Under Noisy Audio-Video Conditions [J] . Stewart D., Seymour R., Pass A., Cybernetics, IEEE Transactions on . 2014,第2期

机译：嘈杂的视听条件下的鲁棒视听语音识别
2. No, There Is No 150 ms Lead of Visual Speech on Auditory Speech, but a Range of Audiovisual Asynchronies Varying from Small Audio Lead to Large Audio Lag [J] . Jean-Luc Schwartz, Christophe Savariaux PLoS Computational Biology . 2014,第7期

机译：不，听觉语音没有150 ms的视觉语音引导，但是视听异步范围从小音频导致大音频滞后
3. Content Based Lecture Video Retrieval Using Speech and Video Text Information [J] . Yang H., Meinel C. Learning Technologies, IEEE Transactions on . 2014,第2期

机译：使用语音和视频文本信息检索基于内容的演讲视频
4. Integral manager audiovisual content autonomy, unification and centralization in the management of audiovisual content [C] . Rodrigo Covadonga, Santos Martin, Alonso Vanesa, Iberian Conference on Information Systems and Technologies . 2013

机译：视听内容管理的整体管理者自主，统一和集中
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. No There Is No 150 ms Lead of Visual Speech on Auditory Speech but a Range of Audiovisual Asynchronies Varying from Small Audio Lead to Large Audio Lag [O] . Jean-Luc Schwartz, Christophe Savariaux 2014

机译：不听觉语音没有150 ms的视觉语音导联但是视听异步范围从小音频导联到大音频滞后
7. Improving interactive video retrieval by exploiting automatically-extracted video structural semantics [O] . Vasileios Mezaris, Panagiotis Sidiropoulos, Ioannis Kompatsiaris 2013

机译：利用自动提取的视频结构语义改进交互式视频检索

CLOVIS: towards precision-oriented text-based video retrieval through the unification of automatically-extracted concepts and relations of the visual and audio/speech contents

摘要

著录项

相似文献

相关主题

期刊订阅