ShEMO: a large-scale validated database for Persian speech emotion detection

Nezami Omid Mohamad; Lou Paria Jamshid; Karami Mansoureh

首页> 外文期刊>Language Resources and Evaluation >ShEMO: a large-scale validated database for Persian speech emotion detection

【24h】

ShEMO: a large-scale validated database for Persian speech emotion detection

机译：ShEMO：用于波斯语语音情感检测的大规模验证数据库

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a large-scale, validated database for Persian called Sharif Emotional Speech Database (ShEMO). The database includes 3000 semi-natural utterances, equivalent to 3h and 25min of speech data extracted from online radio plays. The ShEMO covers speech samples of 87 native-Persian speakers for five basic emotions including anger, fear, happiness, sadness and surprise, as well as neutral state. Twelve annotators label the underlying emotional state of utterances and majority voting is used to decide on the final labels. According to the kappa measure, the inter-annotator agreement is 64% which is interpreted as substantial agreement. We also present benchmark results based on common classification methods in speech emotion detection task. According to the experiments, support vector machine achieves the best results for both gender-independent (58.2%) and gender-dependent models (female=59.4%, male=57.6%). The ShEMO will be available for academic purposes free of charge to provide a baseline for further research on Persian emotional speech.

机译：本文介绍了一个大型的，经过验证的波斯语数据库，称为Sharif情感语音数据库（ShEMO）。该数据库包含3000种半自然语音，相当于从在线广播剧本中提取的语音数据的3小时25分钟。 ShEMO涵盖了87位以波斯语为母语的人的语音样本，涵盖了五种基本情绪，包括愤怒，恐惧，幸福，悲伤和惊奇以及中立状态。十二个注释器标记了语音的潜在情感状态，多数表决用于决定最终的标记。根据kappa度量，注释者之间的协议为64％，这被解释为实质协议。我们还提出了基于语音情感检测任务中常见分类方法的基准测试结果。根据实验，支持向量机在不依赖性别的模型（58.2％）和依赖性别的模型（女性= 59.4％，男性= 57.6％）上均达到最佳结果。 ShEMO将免费用于学术目的，为进一步研究波斯语情感言语提供基准。

著录项

来源
《Language Resources and Evaluation》 |2019年第1期|1-16|共16页
作者
Nezami Omid Mohamad; Lou Paria Jamshid; Karami Mansoureh;
展开▼
作者单位

Islamic Azad Univ, Bijar Branch, Bijar, Iran;

Sharif Univ Technol, Tehran, Iran;

Sharif Univ Technol, Tehran, Iran;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Emotional speech; Speech database; Emotion detection; Benchmark; Persian;

机译：情感语音;语音数据库;情感检测;基准;波斯语;

相似文献

外文文献
中文文献
专利

1. Recognizing emotional speech in Persian: Avalidated database of Persian emotional speech (Persian ESD) [J] . Niloofar Keshtiari, Michael Kuhlmann, Moharram Eslami, Behavior Research Methods . 2015,第1期

机译：在波斯语中识别情绪讲话：波斯情感演讲的被培养数据库（波斯岛ESD）
2. Erratumto: Recognizing emotional speech in Persian: Avalidated database of Persian emotional speech (Persian ESD) [J] . Niloofar Keshtiari, Michael Kuhlmann, Moharram Eslami, Behavior Research Methods . 2015,第1期

机译：误诊：识别波斯语中的情感演讲：波斯情感语音（波斯岛ESD）的被培养的数据库
3. Assessment of spontaneous emotional speech database toward emotion recognition: Intensity and similarity of perceived emotion from spontaneously expressed emotional speech [J] . Arimoto Y., Ohno S., Iida H. Acoustical science and technology . 2011,第1期

机译：评估针对情绪识别的自发情绪语音数据库：自发表达情绪语音时感知到的情绪的强度和相似性
4. A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: The Deepmine Database [C] . Hossein Zeinali, Lukáš Burget, Jan Honza Černocký IEEE Automatic Speech Recognition and Understanding Workshop . 2019

机译：用于演讲者和语音识别的多用途大型波斯语和英语语音语料库：Deepmine数据库
5. Real-world, high-stakes deceptive speech: Theoretical validation and an examination of its potential for detection automation. [D] . Thomas, Joseph York. 2014

机译：真实的，高风险的欺骗性演讲：理论验证及其对检测自动化潜力的检验。
6. Deep Learning Techniques for Speech Emotion Recognition from Databases to Models [O] . Babak Joze Abbaschian, Daniel Sierra-Sosa, Adel Elmaghraby 2021

机译：语音情感认可的深度学习技术从数据库到模型
7. Erratum to: Recognizing emotional speech in Persian: A validated database of Persian emotional speech (Persian ESD) [O] . Niloofar Keshtiari, Michael Kuhlmann, Moharram Eslami, 2014

机译：错误：识别波斯语中的情绪演讲：波斯情感演讲的验证数据库（波斯ESD）
8. Validating detection probabilities for the ASSESS insider database. [R] . Renis, T. A., Saleh, R. A., Sicherman, A. 1990

机译：验证assEss内部数据库的检测概率。

ShEMO: a large-scale validated database for Persian speech emotion detection

摘要

著录项

相似文献

相关主题

期刊订阅