首页> 外文会议>Conference on multimedia storage and archiving systems >Video content parsing based on combined audio and visual information
【24h】

Video content parsing based on combined audio and visual information

机译:基于组合的视听信息的视频内容解析

获取原文

摘要

Abstract: While previous research on audiovisual data segmentation and indexing primarily focuses on the pictorial part, significant clues contained in the accompanying audio flow are often ignored. A fully functional system for video content parsing can be achieved more successfully through a proper combination of audio and visual information. By investigating the data structure of different video types, we present tools for both audio and visual content analysis and a scheme for video segmentation and annotation in this research. In the proposed system, video data are segmented into audio scenes and visual shots by detecting abrupt changes in audio and visual features, respectively. Then, the audio scene is categorized and indexed as one of the basic audio types while a visual shot is presented by keyframes and associate image features. An index table is then generated automatically for each video clip based on the integration of outputs from audio and visual analysis. It is shown that the proposed system provides satisfying video indexing results. !12
机译:摘要:虽然先前有关视听数据分割和索引的研究主要集中在图像部分,但伴随的音频流中包含的重要线索通常被忽略。通过适当地组合音频和视频信息,可以更成功地实现用于视频内容解析的功能齐全的系统。通过研究不同视频类型的数据结构,我们在本研究中提供了用于音频和视频内容分析的工具以及用于视频分割和注释的方案。在提出的系统中,通过分别检测音频和视觉特征的突然变化,将视频数据分割为音频场景和视觉镜头。然后,音频场景被分类和索引为基本音频类型之一,而关键帧和关联的图像特征则呈现了视觉镜头。然后,根据音频和视觉分析的输出结果,为每个视频剪辑自动生成一个索引表。结果表明,所提出的系统提供了令人满意的视频索引结果。 !12

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号