首页> 外文会议>International Conference on Acoustics, Speech, and Signal Processing >A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
【24h】

A comparison of several acoustic representations for speech recognition with degraded and undegraded speech

机译:若干声音表示与劣化和解析语音的若干声学表示

获取原文

摘要

Several acoustic representations have been compared in speaker-dependent and independent connected and isolated-word recognition tests with undegraded speech and with speech degraded by adding white noise and by applying a 6-dB/octave spectral tilt. The representations comprised the output of an auditory model, cepstrum coefficients derived from an FFT-based mel-scale filter bank with various weighting schemes applied to the coefficients, cepstrum coefficients augmented with measures of their rates of change with time, and sets of linear discriminant functions derived from the filter-bank output and called IMELDA. The model outperformed the cepstrum representations except in noise-free connected-word tests, where it had a high insertion rate. The best cepstrum weighting scheme was derived from within-class variances. Its behavior may explain the empirical adjustments found necessary with other schemes. IMELDA outperformed all other representations in all conditions and is computationally simple.
机译:在扬声器依赖性和独立的连接和隔离字识别测试中进行了几种声学表示,并且通过添加白噪声并通过应用6-db / octrave光谱倾斜来降低语音和言语。该表示包括听觉模型的输出,从基于FFT的MEL级滤波器组导出的具有各种加权方案的Cepstrum系数,其适用于系数,Cepstrum系数通过它们随时间的变化率的测量来增强,以及线性判别的措施和一组线性判别源自滤波器存储体输出并称为Imelda的函数。除非无噪声连接字测试,模型外,模型表现出综衣表示,其具有高插入率。最佳的克斯特劳加权方案来自课堂内差异。其行为可以解释与其他方案所必需的实证调整。 Imelda在所有条件下表现了所有其他陈述,并且是计算方式简单。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号