首页> 外文学位 >Blind estimation of perceptual quality for modern speech communications.
【24h】

Blind estimation of perceptual quality for modern speech communications.

机译:盲目估计现代语音通信的感知质量。

获取原文
获取原文并翻译 | 示例

摘要

Modern speech communication technologies expose users to perceptual quality degradations that were not experienced earlier with conventional telephone systems. Since perceived speech quality is a major contributor to the end user's perception of quality of service, speech quality estimation has become an important research field. In this dissertation, perceptual quality estimators are proposed for several emerging speech communication applications, in particular for (i) wireless communications with noise suppression capabilities, (ii) wireless-VoIP communications, (iii) far-field hands-free speech communications, and (iv) text-to-speech systems.;First, a general-purpose speech quality estimator is proposed based on statistical models of normative speech behaviour and on innovative techniques to detect multiple signal distortions. The estimators do not depend on a clean reference signal hence are termed "blind." Quality meters are then distributed along the network chain to allow for both quality degradations and quality enhancements to be handled. In order to improve estimation performance for wireless communications, statistical models of noise-suppressed speech are also incorporated.;Next, a hybrid signal-and-link-parametric quality estimation paradigm is proposed for emerging wireless-VoIP communications. The algorithm uses VoIP connection parameters to estimate a base quality representative of the packet switching network. Signal-based distortions are then detected and quantified in order to adjust the base quality accordingly. The proposed hybrid methodology is shown to overcome the limitations of existing pure signal-based and pure link parametric algorithms.;Temporal dynamics information is then investigated for quality diagnosis for hands-free speech communications. A spectro-temporal signal representation, where speech and reverberation tail components are shown to be separable, is used for blind characterization of room acoustics. In particular, estimators of reverberation time, direct-to-reverberation energy ratio, and reverberant speech quality are developed.;Lastly, perceptual quality estimation for text-to-speech systems is addressed. Text- and speaker-independent hidden Markov models, trained on naturally produced speech, are used to capture normative spectral-temporal information. Deviations from the models, computed by means of a log-likelihood measure, are shown to be reliable indicators of multiple quality attributes including naturalness, fluency, and intelligibility.
机译:现代语音通信技术使用户面临感知质量的下降,而传统电话系统则没有。由于感知的语音质量是最终用户对服务质量的感知的主要贡献者,因此语音质量估计已成为重要的研究领域。在本文中,提出了针对几种新兴语音通信应用的感知质量估计器,特别是(i)具有噪声抑制功能的无线通信,(ii)无线VoIP通信,(iii)远场免提语音通信,以及(iv)文本到语音系统。首先,基于规范语音行为的统计模型和创新的检测多信号失真的技术,提出了一种通用语音质量估计器。估计器不依赖于干净的参考信号,因此被称为“盲”。然后,质量计沿着网络链分布,以允许同时处理质量下降和质量增强。为了提高无线通信的估计性能,还引入了噪声抑制语音的统计模型。接下来,提出了一种用于新兴的无线VoIP通信的混合信号和链路参数质量估计范例。该算法使用VoIP连接参数来估计代表分组交换网络的基本质量。然后检测并量化基于信号的失真,以便相应地调整基本质量。证明了所提出的混合方法克服了现有的基于纯信号和纯链接参数算法的局限性;然后研究了时空信息以进行免提语音通信的质量诊断。频谱-时间信号表示(其中语音和混响尾声分量显示为可分离的)用于室内声学的盲目表征。特别是,开发了混响时间,直接混响能量比和混响语音质量的估计器。最后,提出了文本到语音系统的感知质量估计。在自然产生的语音上训练的独立于文本和说话者的隐式马尔可夫模型用于捕获规范的频谱时域信息。通过对数似然度量计算得出的与模型的偏差被证明是多种质量属性(包括自然度,流利度和清晰度)的可靠指标。

著录项

  • 作者

    Falk, Tiago Henrique.;

  • 作者单位

    Queen's University (Canada).;

  • 授予单位 Queen's University (Canada).;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 211 p.
  • 总页数 211
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号