首页> 外文会议>Advances in Multimedia Information Processing - PCM 2008 >An Efficient Approach for Classification of Speech and Music
【24h】

An Efficient Approach for Classification of Speech and Music

机译:语音和音乐分类的一种有效方法

获取原文
获取原文并翻译 | 示例

摘要

A new method to classify an audio segment into speech and music related to the automatic transcription of broadcast news is presented. To discriminate between speech and music, sample entropy (SampEn), a time complexity measure, mainly operates as a feature. SampEn is a variant of the approximate entropy (ApEn) that measures the regularity of time series. The basic idea is to label a given audio into speech or music depending on its regularity. Based on the SampEn sequence calculated over a window, the regularity of a given audio stream is measured. The effectiveness of the proposed method is tested on experiments, including broadcast news shows from BBC radio stations, WBAI news, UN news and music genres with different temporal distributions. Results show the robustness of the proposed method achieving high discrimination accuracy for all tested experiments.
机译:提出了一种将音频片段分类为与广播新闻的自动转录相关的语音和音乐的新方法。为了区分语音和音乐,样本熵(SampEn)是一种时间复杂性度量,主要用作功能。 SampEn是近似熵(ApEn)的一种变体,用于测量时间序列的规律性。基本思想是根据给定音频的规律性将其标记为语音或音乐。基于在窗口上计算的SampEn序列,可以测量给定音频流的规律性。该方法的有效性在实验上得到了验证,包括来自BBC广播电台的广播新闻节目,WBAI新闻,联合国新闻以及具有不同时间分布的音乐流派。结果表明,所提出方法的鲁棒性在所有测试实验中均达到了高判别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号