Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

Cassia Valentini-Botinhao; Junichi Yamagishi; Simon King; Ranniery Maia

首页> 外文期刊>Computer speech and language >Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

【24h】

Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

机译：通过修改Mel倒谱系数以增加瞥见比例来增强HMM生成的语音在可加性噪声中的清晰度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes speech intelligibility enhancement for Hidden Markov Model (HMM) generated synthetic speech in noise. We present a method for modifying the Mel cepstral coefficients generated by statistical parametric models that have been trained on plain speech. We update these coefficients such that the glimpse proportion - an objective measure of the intelligibility of speech in noise - increases, while keeping the speech energy fixed. An acoustic analysis reveals that the modified speech is boosted in the region 1-4kHz, particularly for vowels, nasals and approximants. Results from listening tests employing speech-shaped noise show that the modified speech is as intelligible as a synthetic voice trained on plain speech whose duration, Mel cepstral coefficients and excitation signal parameters have been adapted to Lombard speech from the same speaker. Our proposed method does not require these additional recordings of Lombard speech. In the presence of a competing talker, both modification and adaptation of spectral coefficients give more modest gains.

机译：本文介绍了在语音中针对隐马尔可夫模型（HMM）生成的合成语音的语音清晰度增强。我们提出了一种方法，用于修改由已经在普通语音上训练的统计参数模型生成的Mel倒谱系数。我们更新这些系数，以便在保持语音能量固定的前提下，瞥见比例（即语音在语音中的清晰度）的客观衡量指标得以提高。声学分析表明，修改后的语音会在1-4kHz的范围内增强，特别是对于元音，鼻音和近似音而言。使用语音形噪声的听力测试结果表明，修改后的语音与在普通语音上训练的合成语音一样可理解，其持续时间，梅尔倒谱系数和激励信号参数已适应来自同一说话者的伦巴德语音。我们提出的方法不需要这些额外的伦巴底语语音记录。在有竞争性发言者的情况下，频谱系数的修改和自适应都可提供更为适度的增益。

著录项

来源
《Computer speech and language》 |2014年第2期|665-686|共22页
作者
Cassia Valentini-Botinhao; Junichi Yamagishi; Simon King; Ranniery Maia;
展开▼
作者单位

The Centre for Speech Technology Research, University of Edinburgh, UK;

The Centre for Speech Technology Research, University of Edinburgh, UK;

The Centre for Speech Technology Research, University of Edinburgh, UK;

Cambridge Research Laboratory, Toshiba Research Europe Limited, UK;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Intelligibility of speech in noise; HMM-based speech synthesis; Mel cepstral coefficients; Glimpse proportion measure;

机译：语音中的语音清晰度;基于HMM的语音合成;梅尔倒谱系数;瞥见比例度量;

相似文献

外文文献
中文文献
专利

1. Foreground Speech Segmentation and Enhancement Using Glottal Closure Instants and Mel Cepstral Coefficients [J] . K. T. Deepak, S. R. Mahadeva Prasanna Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第7期

机译：使用声门关闭瞬间和梅尔倒谱系数进行前景语音分割和增强
2. MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) FEATURE EXTRACTION ENHANCEMENT IN THE APPLICATION OF SPEECH RECOGNITION: A COMPARISON STUDY [J] . SAYF A. MAJEED, HAFIZAH HUSAIN, SALINA ABDUL SAMAD, Journal of Theoretical and Applied Information Technology . 2015,第1期

机译：MEL频率倒谱系数（MFCC）特征提取在语音识别中的应用：对比研究
3. Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures [J] . Darch J, Milner B, Vaseghi S The Journal of the Acoustical Society of America . 2008,第6期

机译：分布式语音识别架构中基于mel-频率倒谱系数的声学语音特征分析和预测
4. Mel cepstral coefficient modification based on the Glimpse Proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise [C] . Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King Annual conference of the International Speech Communication Association . 2012

机译：基于Glimpse比例测度的Mel倒谱系数修改，以提高HMM生成的合成语音在噪声中的清晰度
5. Estimation of cepstral coefficients for robust speech recognition. [D] . Indrebo, Kevin M. 2008

机译：倒频谱系数的估计，用于鲁棒的语音识别。
6. The application of fractional Mel cepstral coefficient in deceptive speech detection [O] . Xinyu Pan, Heming Zhao, Yan Zhou -1

机译：分数梅尔倒谱系数在欺骗性语音检测中的应用
7. Improving Intelligibility in Noise of HMM-Generated Speech via Noise-Dependent and -Independent Methods [O] . Valentini-Botinhao, Cassia, Godoy, Elizabeth, Stylianou, Yannis, 2013

机译：通过依赖于噪声和独立于噪声的方法来提高HMM生成的语音的噪声可理解性

Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

摘要

著录项

相似文献

相关主题

期刊订阅