首页> 外文学位 >Blind estimation of perceptual quality for modern speech communications.

【24h】

Blind estimation of perceptual quality for modern speech communications.

机译：盲目估计现代语音通信的感知质量。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modern speech communication technologies expose users to perceptual quality degradations that were not experienced earlier with conventional telephone systems. Since perceived speech quality is a major contributor to the end user's perception of quality of service, speech quality estimation has become an important research field. In this dissertation, perceptual quality estimators are proposed for several emerging speech communication applications, in particular for (i) wireless communications with noise suppression capabilities, (ii) wireless-VoIP communications, (iii) far-field hands-free speech communications, and (iv) text-to-speech systems.;First, a general-purpose speech quality estimator is proposed based on statistical models of normative speech behaviour and on innovative techniques to detect multiple signal distortions. The estimators do not depend on a clean reference signal hence are termed "blind." Quality meters are then distributed along the network chain to allow for both quality degradations and quality enhancements to be handled. In order to improve estimation performance for wireless communications, statistical models of noise-suppressed speech are also incorporated.;Next, a hybrid signal-and-link-parametric quality estimation paradigm is proposed for emerging wireless-VoIP communications. The algorithm uses VoIP connection parameters to estimate a base quality representative of the packet switching network. Signal-based distortions are then detected and quantified in order to adjust the base quality accordingly. The proposed hybrid methodology is shown to overcome the limitations of existing pure signal-based and pure link parametric algorithms.;Temporal dynamics information is then investigated for quality diagnosis for hands-free speech communications. A spectro-temporal signal representation, where speech and reverberation tail components are shown to be separable, is used for blind characterization of room acoustics. In particular, estimators of reverberation time, direct-to-reverberation energy ratio, and reverberant speech quality are developed.;Lastly, perceptual quality estimation for text-to-speech systems is addressed. Text- and speaker-independent hidden Markov models, trained on naturally produced speech, are used to capture normative spectral-temporal information. Deviations from the models, computed by means of a log-likelihood measure, are shown to be reliable indicators of multiple quality attributes including naturalness, fluency, and intelligibility.

机译：现代语音通信技术使用户面临感知质量的下降，而传统电话系统则没有。由于感知的语音质量是最终用户对服务质量的感知的主要贡献者，因此语音质量估计已成为重要的研究领域。在本文中，提出了针对几种新兴语音通信应用的感知质量估计器，特别是（i）具有噪声抑制功能的无线通信，（ii）无线VoIP通信，（iii）远场免提语音通信，以及（iv）文本到语音系统。首先，基于规范语音行为的统计模型和创新的检测多信号失真的技术，提出了一种通用语音质量估计器。估计器不依赖于干净的参考信号，因此被称为“盲”。然后，质量计沿着网络链分布，以允许同时处理质量下降和质量增强。为了提高无线通信的估计性能，还引入了噪声抑制语音的统计模型。接下来，提出了一种用于新兴的无线VoIP通信的混合信号和链路参数质量估计范例。该算法使用VoIP连接参数来估计代表分组交换网络的基本质量。然后检测并量化基于信号的失真，以便相应地调整基本质量。证明了所提出的混合方法克服了现有的基于纯信号和纯链接参数算法的局限性；然后研究了时空信息以进行免提语音通信的质量诊断。频谱-时间信号表示（其中语音和混响尾声分量显示为可分离的）用于室内声学的盲目表征。特别是，开发了混响时间，直接混响能量比和混响语音质量的估计器。最后，提出了文本到语音系统的感知质量估计。在自然产生的语音上训练的独立于文本和说话者的隐式马尔可夫模型用于捕获规范的频谱时域信息。通过对数似然度量计算得出的与模型的偏差被证明是多种质量属性（包括自然度，流利度和清晰度）的可靠指标。

著录项

作者
Falk, Tiago Henrique.;
展开▼
作者单位

Queen's University (Canada).;

展开▼
授予单位 Queen's University (Canada).;
学科 Engineering Electronics and Electrical.
学位 Ph.D.
年度 2009
页码 211 p.
总页数 211
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Impact of Languages and Accent on Perceived Speech Quality Predicted by Perceptual Evaluation of Speech Quality (PESQ) and Perceptual Objective Listening Quality Assessment (POLQA): Case of Moore, Dioula, French and English [J] . Daouda Konane, Sibiri Tiemounou, Wend Yam Serge Boris Ouedraogo 应用科学（英文） . 2021,第012期

机译：Impact of Languages and Accent on Perceived Speech Quality Predicted by Perceptual Evaluation of Speech Quality (PESQ) and Perceptual Objective Listening Quality Assessment (POLQA): Case of Moore, Dioula, French and English
2. Benefits of perceptual speech quality metrics in modern cellular systems [J] . B. Rohani, B. Rohani, M. Caldera, Electronics Letters . 2006,第21期

机译：现代蜂窝系统中感知语音质量指标的好处
3. PRACTICAL ESTIMATION OF VOIP QOS BASED ON CORRELATED ANALYSIS OF TRANSPORT PROTOCOL IMPAIRMENTS AND PERCEPTUAL SPEECH QUALITY [J] . Nae more, journal of marine sciences . 2006,第5a6期

机译：基于运输协议减损和感知语音质量相关分析的VOIP QOS实用估计
4. Perceptual dimensions of speech sound quality in modern transmission systems [C] . Alexander Raake 6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16.-Oct.20 2000 Beijing International Convention Center,Beijing, China . 2000

机译：现代传输系统中语音质量的感知维度
5. Blind channel equalization and estimation for wireless communications. [D] . Zhuang, Xiangyang. 2000

机译：无线通信的盲信道均衡和估计。
6. Mental imagery of speech: linking motor and perceptual systems through internal simulation and estimation [O] . Xing Tian, David Poeppel 2012

机译：语音心理图像：通过内部模拟和估计将运动系统与感知系统联系起来
7. Blind Quality Estimation by Disentangling Perceptual and Noisy Features in High Dynamic Range Images [O] . Kottayil, N.K., Valenzise, Giuseppe, Dufaux, Frederic, 2018

机译：通过分解高动态范围图像中的感知和噪声特征进行盲质量估计

Blind estimation of perceptual quality for modern speech communications.

摘要

著录项

相似文献

相关主题

期刊订阅