Emotion recognition in speech using MFCC with SVM, DSVM and auto-encoder

机译：使用带有SVM，DSVM和自动编码器的MFCC进行语音情感识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Emotions recognition from speech is one of the most important sub domains in the field of signal processing. In this work, our system is a two-stage approach, namely feature extraction and classification engine. Firstly, two sets of feature are investigated which are: 39 Mel-frequency Cepstral Coefficient (MFCC) coefficients and 65 MFCC features extracted based on the work of [20]. Secondly, we use the Support Vector Machine (SVM) as the main classifier engine since it is the most common technique in the field of speech recognition. Besides that, we investigate the importance of the recent advances in machine learning including the deep kernel learning, as well as the various types of auto-encoder (the basic auto-encoder and the stacked auto-encoder). A large set of experiments are conducted on the SAVEE audio database. The experimental results show that DSVM method outperforms the standard SVM with a classification rate of 69.84% and 68.25% using 39 MFCC, respectively. Additionally, the auto-encoder method outperforms the standard SVM, yielding a classification rate of 73.01%.

机译：来自语音的情感识别是信号处理领域中最重要的子领域之一。在这项工作中，我们的系统分为两个阶段，即特征提取和分类引擎。首先，研究了两组特征：基于[20]的工作提取的39个梅尔频率倒谱系数（MFCC）系数和65个MFCC特征。其次，我们使用支持向量机（SVM）作为主要的分类器引擎，因为它是语音识别领域中最常见的技术。除此之外，我们还研究了机器学习的最新进展的重要性，包括深度内核学习以及各种类型的自动编码器（基本自动编码器和堆叠式自动编码器）。在SAVEE音频数据库上进行了大量实验。实验结果表明，使用39 MFCC，DSVM方法的分类率分别为69.84 \％和68.25 \％，优于标准SVM。此外，自动编码器方法优于标准SVM，分类率为73.01 \％。

著录项

来源
《International Conference on Advanced Technologies for Signal and Image Processing》|2018年|1-5|共5页
会议地点
作者
Hadhami Aouani; Yassine Ben Ayed;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Mel frequency cepstral coefficient; Support vector machines; Emotion recognition; Feature extraction; Speech recognition; Kernel; Databases;

机译：梅尔频率倒谱系数;支持向量机;情感识别;特征提取;语音识别;内核;数据库;

相似文献

外文文献
中文文献
专利

1. Real Time Speech Recognition based on PWP Thresholding and MFCC using SVM [J] . W. Helali, Ζ. Hajaiej, A. Cherif Engineering Technology and Applied Science Research . 2020,第5期

机译：基于PWP阈值和MFCC的实时语音识别使用SVM
2. MFCC Based Enlargement of the Training Set for Emotion Recognition in Speech [J] . Inma Mohino-Herranz, Roberto Gil-Pita, Sagrario Alonso-Diaz, Signal & Image Processing : An International Journal (SIPIJ) . 2014,第1期

机译：基于MFCC的语音情感识别训练集的扩展
3. Speech Emotion Recognition Using Residual Phase and MFCC Features [J] . N.J. Nalini, S. Palanivel, M. Balasubramanian International Journal of Engineering and Technology . 2013,第6期

机译：语音情感识别使用残差阶段和MFCC功能
4. Emotion recognition in speech using MFCC with SVM, DSVM and auto-encoder [C] . Hadhami Aouani, Yassine Ben Ayed International Conference on Advanced Technologies for Signal and Image Processing . 2018

机译：使用MFCC使用SVM，DSVM和自动编码器的情感认可
5. A speech recognition IC with an efficient MFCC extraction algorithm and multi-mixture models. [D] . Han, Wei. 2006

机译：具有高效MFCC提取算法和多混合模型的语音识别IC。
6. Emotion Recognition from Chinese Speech for Smart Affective Services Using a Combination of SVM and DBN [O] . Lianzhang Zhu, Leiming Chen, Dehai Zhao, 2017

机译：SVM与DBN结合使用中文语音进行智能情感服务的情感识别
7. SVM Scheme for Speech Emotion Recognition using MFCC Feature [O] . A. Milton, S. Sharmy Roy, S. Tamil Selvi 2014

机译：使用mFCC特征的语音情感识别sVm方案

Emotion recognition in speech using MFCC with SVM, DSVM and auto-encoder

摘要

著录项

相似文献

相关主题

期刊订阅