Emotion recognition of audio/speech data using deep learning approaches

Gupta Vedika; Juyal Stuti; Singh Gurvinder Pal; Killa Chirag; Gupta Nishant

首页> 外文期刊>Journal of information and optimization sciences >Emotion recognition of audio/speech data using deep learning approaches

【24h】

Emotion recognition of audio/speech data using deep learning approaches

机译：使用深度学习方法对音频/语音数据的情感认识

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Speech has been the most prominent and intelligent mode of sound. It is an effective medium to communicate the emotions and attitude in a particular language. Researchers have been extensively using speech to understand the emotions of a person. Contemplating emotions by intonation and pitch of voice and relative loudness can be used for enhancing human computer interaction. In this work, we have surveyed feature extraction techniques based on prosodic features or spectral feature extraction such as MFCC, LPCC, LPC, etc. and found out that MFCC extracts the best features for recognition of emotional content. This research work utilizes some of the best existing classification techniques for recognizing human emotions and conducts a detailed comparative analysis based on statistical and mathematical results. Finally, this paper proposes an optimal model DSCNN for raw spectrogram that resulted in an un-weighted accuracy of 61% for raw spectrogram (with noise) and 79% for clean spectrogram (without noise) for the enhancement of the human emotion evaluation system.

机译：演讲是最突出和最聪明的声音模式。它是一种有效的媒介，用于传达特定语言的情绪和态度。研究人员已经广泛使用言语来了解一个人的情绪。通过语调和语音和音调音高和相对响度的调节情绪可用于提高人力计算机互动。在这项工作中，我们通过MFCC，LPCC，LPC等韵律特征或光谱特征提取等进行了调查的特征提取技术，并发现MFCC提取了识别情绪内容的最佳特征。该研究工作利用一些用于识别人类情绪的最佳现有分类技术，并根据统计和数学结果进行详细的比较分析。最后，本文提出了用于原始谱图的最佳模型DSCNN，其原始谱图（具有噪声）的未加权精度为61％，对于人类情感评估系统的增强，对清洁谱图（没有噪声）的79％。

著录项

来源
《Journal of information and optimization sciences》 |2020年第6期|1309-1317|共9页
作者
Gupta Vedika; Juyal Stuti; Singh Gurvinder Pal; Killa Chirag; Gupta Nishant;
展开▼
作者单位

Bharati Vidyapeeths Coll Engn Dept Comp Sci & Engn New Delhi 110063 India;

Bharati Vidyapeeths Coll Engn Dept Comp Sci & Engn New Delhi 110063 India;

Bharati Vidyapeeths Coll Engn Dept Comp Sci & Engn New Delhi 110063 India;

Bharati Vidyapeeths Coll Engn Dept Comp Sci & Engn New Delhi 110063 India;

Bharati Vidyapeeths Coll Engn Dept Comp Sci & Engn New Delhi 110063 India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Convolutional neural network; Deep learning; Emotion recognition; Recurrent neural network; Speech;

机译：卷积神经网络;深入学习;情感识别;经常性神经网络;言语;

Emotion recognition of audio/speech data using deep learning approaches

摘要

著录项

相关主题

期刊订阅