Video content parsing based on combined audio and visual information

机译：基于组合的视听信息的视频内容解析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract: While previous research on audiovisual data segmentation and indexing primarily focuses on the pictorial part, significant clues contained in the accompanying audio flow are often ignored. A fully functional system for video content parsing can be achieved more successfully through a proper combination of audio and visual information. By investigating the data structure of different video types, we present tools for both audio and visual content analysis and a scheme for video segmentation and annotation in this research. In the proposed system, video data are segmented into audio scenes and visual shots by detecting abrupt changes in audio and visual features, respectively. Then, the audio scene is categorized and indexed as one of the basic audio types while a visual shot is presented by keyframes and associate image features. An index table is then generated automatically for each video clip based on the integration of outputs from audio and visual analysis. It is shown that the proposed system provides satisfying video indexing results. !12

机译：摘要：虽然先前有关视听数据分割和索引的研究主要集中在图像部分，但伴随的音频流中包含的重要线索通常被忽略。通过适当地组合音频和视频信息，可以更成功地实现用于视频内容解析的功能齐全的系统。通过研究不同视频类型的数据结构，我们在本研究中提供了用于音频和视频内容分析的工具以及用于视频分割和注释的方案。在提出的系统中，通过分别检测音频和视觉特征的突然变化，将视频数据分割为音频场景和视觉镜头。然后，音频场景被分类和索引为基本音频类型之一，而关键帧和关联的图像特征则呈现了视觉镜头。然后，根据音频和视觉分析的输出结果，为每个视频剪辑自动生成一个索引表。结果表明，所提出的系统提供了令人满意的视频索引结果。！12

著录项

来源
《Conference on multimedia storage and archiving systems》|1999年|p.78-89|共12页
会议地点
作者
Tong Zhang; Univ. of Southern California; Los Angeles; CA; USA; C.-C. J. Kuo; Univ. of Southern California; Los Angeles; CA; USA.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Content-based video parsing and indexing based on audio-visualinteraction [J] . Tsekeridou S., Pitas I. IEEE Transactions on Circuits and Systems for Video Technology . 2001,第4期

机译：基于视听交互的基于内容的视频解析和索引
2. Audiovisual Integration With Segment Models For Tennis Video Parsing [J] . Manolis Delakis, Guillaume Gravier, Patrick Gros Computer vision and image understanding . 2008,第2期

机译：视听集成与用于网球视频解析的分段模型
3. Bitrate-Based No-Reference Video Quality Assessment Combining the Visual Perception of Video Contents [J] . Yao Juncai, Liu Guizhong IEEE Transactions on Broadcasting . 2019,第3期

机译：结合视频内容的视觉感知的基于比特率的无参考视频质量评估
4. Video content parsing based on combined audio and visual information [C] . Tong Zhang, C.-C. J. Kuo Conference on multimedia storage and archiving systems . 1999

机译：基于组合音频和视觉信息的视频内容解析
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. Original research: Quantifying alcohol audio-visual content in UK broadcasts of the 2018 Formula 1 Championship: a content analysis and population exposure [O] . Alexander Barker, Magdalena Opazo-Breton, Emily Thomson, 2020

机译：原始研究：量化2018级锦标赛英国广播中的酒精视听内容：内容分析和人口曝光
7. Impairment-Factor-Based Audiovisual Quality Model for IPTV: Influence of Video Resolution, Degradation Type, and Content Type [O] . M. N. Garcia, R. Schleicher, A. Raake 2011

机译：IPTV基于减损因子的视听质量模型：视频分辨率，降级类型和内容类型的影响

Video content parsing based on combined audio and visual information

摘要

著录项

相似文献

相关主题

期刊订阅