Wavelet-Based Power Normalized Spectrum for Hindi Phoneme Classification

Mishra Shipra; Chandra Mahesh

首页> 外文期刊>Circuits, systems, and signal processing >Wavelet-Based Power Normalized Spectrum for Hindi Phoneme Classification

【24h】

Wavelet-Based Power Normalized Spectrum for Hindi Phoneme Classification

机译：基于小波的功率归一化谱用于印地语音素分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents wavelet-based power normalized spectrum for computing robust cepstral features named WP-PNCC features. The proposed technique computes wavelet packet-based short-time spectrum of speech signal. A nonlinear function is defined as relating power spectrum of clean speech to the power spectrum of speech corrupted with noise. The constants of function are computed from longer-duration speech spectrum, and the short-time spectrum for each frame is weighted with the power function. The weighted speech spectrum is processed with logarithmic and discrete cosine transform operation to compute cepstral coefficients. The cepstral coefficients thus obtained are processed with quantile-based cepstral dynamics normalization technique. The proposed features are examined with hidden Markov model classifier on TIFR database for Hindi phoneme classification task and on TIMIT database for English phoneme classification task along with mel-frequency cepstral coefficients, power normalized cepstral coefficients and 24-band wavelet-based features in clean and noisy environments. Different noises from NOISEX-92 database are used for preparing noisy database with SNR ranging from 20 dB to 0 dB. The results show enhanced performance of proposed features in all the considered cases. The simulations are performed on MATLAB 2015b. The performance of proposed features is also evaluated on hidden Markov model toolkit-based speech recognition system. The comparative results confirm the robustness of proposed features with sufficient improvement over other features examined in this paper.

机译：本文提出了基于小波的功率归一化频谱，用于计算鲁棒的倒谱特征，称为WP-PNCC特征。所提出的技术计算基于小波包的语音信号的短时频谱。非线性函数被定义为将干净语音的功率谱与被噪声破坏的语音的功率谱相关。从较长的语音频谱中计算出函数常数，并使用幂函数对每帧的短时频谱进行加权。用对数和离散余弦变换运算处理加权语音频谱，以计算倒频谱系数。这样获得的倒谱系数用基于分位数的倒谱动力学归一化技术处理。使用隐藏的马尔可夫模型分类器在TIFR数据库中对印地语音素分类任务和TIMIT数据库中的隐式马尔可夫模型分类器进行了检验，并在纯音和纯谱中使用了梅尔频率倒谱系数，功率归一化倒谱系数和基于24波段小波的特征。嘈杂的环境。来自NOISEX-92数据库的不同噪声用于准备SNR为20 dB至0 dB的嘈杂数据库。结果表明，在所有考虑的情况下，所提出功能的性能均得到增强。仿真在MATLAB 2015b上执行。在基于隐马尔可夫模型工具箱的语音识别系统上还评估了提出的功能的性能。比较结果证实了所提出功能的鲁棒性，与本文所研究的其他功能相比有足够的改进。

著录项

来源
《Circuits, systems, and signal processing》 |2019年第11期|5149-5168|共20页
作者
Mishra Shipra; Chandra Mahesh;
展开▼
作者单位

Birla Inst Technol Dept Elect & Commun Engn Ranchi 835215 Bihar India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Hindi phoneme; English phoneme; Wavelet packet decomposition; Nonlinear power function; QCN; HMM; HTK;

机译：印地语音素;英文音素;小波包分解;非线性幂函数;QCN;HMM;HTK;

相似文献

外文文献
中文文献
专利

1. Classification of myoelectric signal for sub-vocal Hindi phoneme speech recognition [J] . Khan Munna, Jahan Mosarrat Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2018,第5期

机译：子发作型印地施音音识别的磁电信号分类
2. Hindi phoneme classification using Wiener filtered wavelet packet decomposed periodic and aperiodic acoustic feature [J] . Biswas Astik, Sahu P. K., Bhowmick Anirban, Computers and Electrical Engineering . 2015,第Null期

机译：使用维纳滤波小波包分解周期性和非周期性声学特征的印地语音素分类
3. Fuzzy Phoneme Classification Using Multi-speaker Vocal Tract Length Normalization [J] . Jensen Wong Jing Lung, Sah Hj. Salam, Amjad Rehman, IETE Technical Review . 2014,第2期

机译：多说话人语音长度归一化的模糊音素分类
4. Numerical Transformation of Power Azimuth Spectrum into Normalized Doppler Spectrum [C] . Jan M. Kelner, Cezary Ziółkowski European Conference on Antennas and Propagation . 2019

机译：功率方位角频谱到归一化多普勒频谱的数值转换
5. Classification of phonemes using pitch synchronous glottal cycle analysis. [D] . Prieto, Ramon Eduardo. 2004

机译：使用音高同步声门周期分析对音素进行分类。
6. Classification of Resting-State Status Based on Sample Entropy and Power Spectrum of Electroencephalography (EEG) [O] . Ahmed M. A. Mohamed, Osman N. Uçan, Oğuz Bayat, 2020

机译：基于脑电图的样本熵和功率谱的静态状态分类（EEG）
7. Phoneme classification using the Hartley Phase Spectrum [O] . Ioannis Paraskevas, Maria Rangoussi 2019

机译：使用Hartley阶段谱进行音素分类
8. Feasibility of Using Optical Power Spectrum Analysis Techniques for Automatic Feature Classification from High Resolution Thermal, Radar, and Panchromatic Imagery [R] . Kasdan, H. L. 1979

机译：使用光功率谱分析技术从高分辨率热，雷达和全色图像自动特征分类的可行性

Wavelet-Based Power Normalized Spectrum for Hindi Phoneme Classification

摘要

著录项

相似文献

相关主题

期刊订阅