Using a PCA-based dataset similarity measure to improve cross-corpus emotion recognition

Ingo Siegert; Ronald Böck; Andreas Wendemuth

首页> 外文期刊>Computer speech and language >Using a PCA-based dataset similarity measure to improve cross-corpus emotion recognition

【24h】

Using a PCA-based dataset similarity measure to improve cross-corpus emotion recognition

机译：使用基于PCA的数据集相似性度量来改善跨主体情感识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In emotion recognition from speech, huge amounts of training material are needed for the development of classification engines. As most current corpora do not supply enough material, a combination of different datasets is advisable. Unfortunately, data recording is done differently and various emotion elicitation and emotion annotation methods are used. Therefore, a combination of corpora is usually not possible without further effort. The manuscript’s aim is to answer the question which corpora are similar enough to jointly be used as training material. A corpus similarity measure based on PCA-ranked features is presented and similar datasets are identified. To evaluate our method we used nine well-known benchmark corpora and automatically identified a sub-set of six most similar datasets. To test that the identified most similar six datasets influence the classification performance, we conducted several cross-corpora emotion recognition experiments comparing our identified six most similar datasets with other combinations. Our most similar sub-set outperforms all other combinations of corpora, the combination of all nine datasets as well as feature normalization techniques. Also influencing side-effects on the recognition rate were excluded. Finally, the predictive power of our measure is shown: increasing similarity score, expressing decreasing similarity, result in decreasing recognition rates. Thus, our similarity measure answers the question which corpora should be included into joint training.

机译：在语音识别中，分类引擎的开发需要大量的培训材料。由于当前大多数语料库不能提供足够的材料，因此建议使用不同数据集的组合。不幸的是，数据记录的方式有所不同，并且使用了各种情感启发和情感注释方法。因此，没有更多的努力，通常不可能合并语料库。该手稿的目的是回答一个问题，即足够相似的语料库可以共同用作培训材料。提出了基于PCA排序特征的语料库相似性度量，并识别了相似的数据集。为了评估我们的方法，我们使用了9个著名的基准语料库，并自动识别了6个最相似的数据集的子集。为了测试识别出的最相似的六个数据集对分类性能的影响，我们进行了几次跨语料库情感识别实验，将识别出的六个最相似的数据集与其他组合进行了比较。我们最相似的子集优于所有其他语料库组合，所有九个数据集的组合以及特征归一化技术。还排除了影响识别率的副作用。最后，显示了我们度量的预测能力：增加相似度得分，表示减少相似度，导致降低识别率。因此，我们的相似性度量回答了应该在联合训练中包括哪个语料库的问题。

著录项

来源
《Computer speech and language》 |2018年第9期|1-23|共23页
作者
Ingo Siegert; Ronald Böck; Andreas Wendemuth;
展开▼
作者单位

Cognitive Systems Group, Faculty of Electrical Engineering and Information Technology, Otto von Guericke University;

Cognitive Systems Group, Faculty of Electrical Engineering and Information Technology, Otto von Guericke University,Center for Behavioral Brain Sciences;

Cognitive Systems Group, Faculty of Electrical Engineering and Information Technology, Otto von Guericke University,Center for Behavioral Brain Sciences;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
PCA; Dataset similarity; Cross-corpus emotion recognition; Automatic similarity scoring;

机译：PCA;数据集相似度;跨主体情感识别;自动相似度评分;

相似文献

外文文献
中文文献
专利

1. Improving relevant subjective testing for validation: Comparing machine learning algorithms for finding similarities in VQA datasets using objective measures [J] . Aldahdooh Ahmed, Masala Enrico, Van Wallendael Glenn, Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2019,第期

机译：提高验证相关主观测试：使用客观措施将机器学习算法与VQA数据集中的相似性进行比较
2. Transfer Sparse Discriminant Subspace Learning for Cross-Corpus Speech Emotion Recognition [J] . Weijian Zhang, Peng Song Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：转移稀疏判别子空间学习跨语料库语音情感识别
3. Cross-Corpus Speech Emotion Recognition Based on Deep Domain-Adaptive Convolutional Neural Network [J] . Jiateng LIU, Wenming ZHENG, Yuan ZONG, IEICE transactions on information and systems . 2020,第2期

机译：基于深域自适应卷积神经网络的交叉语料库语音情感识别
4. f-Similarity Preservation Loss for Soft Labels: A Demonstration on Cross-Corpus Speech Emotion Recognition [C] . Biqiao Zhang, Yuqing Kong, Georg Essl, AAAI Conference on Artificial Intelligence . 2019

机译：软标签的F相似度保存损失：交叉语料库语音情感识别的演示
5. Similarity measures and indexing methods for time series and multiclass recognition. [D] . Stefan, Alexandra. 2012

机译：时间序列和多类识别的相似性度量和索引方法。
6. Fusing Visual Attention CNN and Bag of Visual Words for Cross-Corpus Speech Emotion Recognition [O] . Minji Seo, Myungho Kim 2020

机译：融合视觉关注CNN和跨语料语音情感识别的视觉词语
7. f-Similarity Preservation Loss for Soft Labels: A Demonstration on Cross-Corpus Speech Emotion Recognition [O] . Biqiao Zhang, Yuqing Kong, Georg Essl, 2019

机译：软标签的F相似度保存损失：交叉语料库语音情感识别的演示
8. Quantifying Similarity and Distance Measures for Vector-Based Datasets: Histograms, Signals, and Probability Distribution Functions. [R] . Tschopp, M. A., Hernandez-Rivera, E. 2017

机译：量化基于矢量的数据集的相似性和距离度量：直方图，信号和概率分布函数。

Using a PCA-based dataset similarity measure to improve cross-corpus emotion recognition

摘要

著录项

相似文献

相关主题

期刊订阅