首页> 外文会议>German annual conference on acoustics;International conference on acoustics;NAG/DAGA 2009 >Subband instantaneous-frequency analysis to determine masking with high temporal resolution for use in audio codecs
【24h】

Subband instantaneous-frequency analysis to determine masking with high temporal resolution for use in audio codecs

机译:子带瞬时频率分析,以确定具有高时间分辨率的掩膜,用于音频编解码器

获取原文

摘要

We have shown that mono audio coding with auditory system like spectro-temporal block shaping achieves an avarage bitrate of 3.3 bit per sample (compression rate of = 1/5) The Instantaneous-frequenc-based tonality estimation employs a high temporal resolution and is capable to improve this result to an average bitrate of 3 bit per sample with no perceptual impact. If a minor perceptual impact (PEAQ quality index > -1) is allowed, decreasing the avarge bitrate to 1.8 bps is possible which is nearly a compression rate of w 1/10. This is most remarkable, because spectral masking across frequency bands was not included in the masking model. Extending the model towards instantaneous frequency based spectral masking estimation will be part of subsequent studies and should decrease of the bitrate further. Further studies with a larger database of audio samples is required to confirm the presented results.
机译:我们已经表明,具有听觉系统的单音频编码(如频谱时态块成形)可实现每个样本3.3位的平均比特率(压缩率= 1/5)。基于瞬时频率的音调估计采用了高时间分辨率,并且能够将该结果提高到每个样本3位的平均比特率,而不会产生感知影响。如果允许较小的感知影响(PEAQ质量指数> -1),则可以将avarge比特率降低到1.8 bps,这几乎是w 1/10的压缩率。这是最显着的,因为跨频带的频谱屏蔽没有包含在屏蔽模型中。将模型扩展到基于瞬时频率的频谱掩蔽估计将是后续研究的一部分,并且应进一步降低比特率。需要对更大的音频样本数据库进行进一步研究,以确认给出的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号