首页> 外文学位 >Low Delay, Low Complexity Multimode Tree Coding and Practical Rate Distortion Bounds for Speech.
【24h】

Low Delay, Low Complexity Multimode Tree Coding and Practical Rate Distortion Bounds for Speech.

机译:低延迟,低复杂度的多模树编码和语音的实际速率失真边界。

获取原文
获取原文并翻译 | 示例

摘要

A low-delay and low-complexity Multimode Tree coder with perceptual pre- and post-weighting and backward pitch prediction for both narrowband and wideband speech is developed. In addition, we develop composite source models for both narrowband and wideband speech, and apply these to classical rate distortion theory. Since classical rate distortion theory is based on MSE distortion, we generate mapping functions by calculating the MSE and PESQ/WPESQ pairs from ADPCM coders. As a result, the performance of a standardized speech codec can be compared with rate distortion bounds based on PESQ/WPESQ distortion.;In our experiments, the results show that perceptual pre- and post-weighting filters and backward pitch prediction does improve speech quality without increasing bit rate and delay for voiced speech. Compared with narrowband speech codecs, the worst-case complexity of the Multimode Tree coder is one-third of AMR-NB and one-eighth of G.728, and the delay of the Multimode Tree coder is a quarter of AMR-NB. Compared with wideband standardized speech codecs, the worst-case computational complexity of the Multimode Tree coder is one-third of AMR-WB and the delay of the Multimode Tree coder is half of AMR-WB and one-third of G.722.1.;In addition, composite source models for both narrowband and wideband speech are developed. In order to generate the mapping function for MSE and PESQ/WPESQ, we use G.726/G.727 for narrowband speech mapping and generate a wideband ADPCM coder based on G.726 and G.727 for wideband mapping. The rate distortion bounds calculated from composite source models based on MSE distortion are mapped to PESQ/WPESQ distortion by mapping functions. Therefore, the performance of standardized speech codecs is compared with the rate distortion bounds based on PESQ/WPESQ distortion.
机译:开发了一种低延迟,低复杂度的多模式树编码器,该编码器具有针对窄带和宽带语音的感知前加权和后加权以及后向音调预测。此外,我们开发了适用于窄带和宽带语音的复合源模型,并将其应用于经典速率失真理论。由于经典的速率失真理论基于MSE失真,因此我们通过从ADPCM编码器计算MSE和PESQ / WPESQ对来生成映射函数。结果,可以将标准化语音编解码器的性能与基于PESQ / WPESQ失真的速率失真范围进行比较。;在我们的实验中,结果表明,感知前加权和后加权滤波器以及后向音调预测确实可以改善语音质量而不会增加浊音的比特率和延迟。与窄带语音编解码器相比,多模树编码器的最坏情况复杂度是AMR-NB的三分之一,是G.728的八分之一,而多模树编码器的延迟是AMR-NB的四分之一。与宽带标准化语音编解码器相比,多模树编码器的最坏情况下的计算复杂度是AMR-WB的三分之一,而多模树编码器的延迟是AMR-WB的一半,而G.722.1则是三分之一。此外,还开发了适用于窄带和宽带语音的复合源模型。为了生成MSE和PESQ / WPESQ的映射功能,我们使用G.726 / G.727进行窄带语音映射,并生成基于G.726和G.727的宽带ADPCM编码器进行宽带映射。根据映射功能,将基于MSE失真的复合源模型计算出的速率失真范围映射到PESQ / WPESQ失真。因此,将标准化语音编解码器的性能与基于PESQ / WPESQ失真的速率失真范围进行比较。

著录项

  • 作者

    Li, Ying-Yi.;

  • 作者单位

    University of California, Santa Barbara.;

  • 授予单位 University of California, Santa Barbara.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 148 p.
  • 总页数 148
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号