首页> 外文会议>International conference on machine vision >Hierarchical VS Non-hierarchical Audio Indexation and Classification for Video Genres
【24h】

Hierarchical VS Non-hierarchical Audio Indexation and Classification for Video Genres

机译:视频流的分级VS非分级音频索引和分类

获取原文

摘要

In this paper, Support Vector Machines (SVMs) are used for segmenting and indexing video genres based on only audio features extracted at block level, which has a prominent asset by capturing local temporal information. The main contribution of our study is to show the wide effect on the classification accuracies while using an hierarchical categorization structure based on Mel Frequency Cepstral Coefficients (MFCC) audio descriptor. In fact, the classification consists in three common video genres: sports videos, music clips and news scenes. The sub-classification may divide each genre into several multi-speaker and multi-dialect sub-genres. The validation of this approach was carried out on over 360 minutes of video span yielding a classification accuracy of over 99%.
机译:在本文中,支持向量机(SVM)用于仅基于块级提取的音频特征对视频流派进行分段和索引,该特征通过捕获本地时间信息而具有显着的优势。我们的研究的主要贡献在于,在使用基于梅尔频率倒谱系数(MFCC)音频描述符的分层分类结构时,显示了对分类准确性的广泛影响。实际上,分类包括三种常见的视频类型:体育视频,音乐剪辑和新闻场景。子分类可以将每个流派划分为几个多扬声器和多方言子流派。这种方法的验证是在超过360分钟的视频跨度上进行的,分类精度超过99%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号