Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling

机译：基于潜在因子分析的特征通道特征向量建模的说话人状态识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an automatic speaker state recognition approach which models the factor vectors in the latent factor analysis framework improving upon the Gaussian Mixture Model (GMM) baseline performance. We investigate both intoxicated and affective speaker states. We consider the affective speech signal as the original normal average speech signal being corrupted by the affective channel effects. Rather than reducing the channel variability to enhance the robustness as in the speaker verification task, we directly model the speaker state on the channel factors under the factor analysis framework. In this work, the speaker state factor vectors are extracted and modeled by the latent factor analysis approach in the GMM modeling framework and support vector machine classification method. Experimental results show that the proposed speaker state factor vector modeling system achieved 5.34% and 1.49% unweighted accuracy improvement over the GMM baseline on the intoxicated speech detection task (Alcohol Language Corpus) and the emotion recognition task (IEMOCAP database), respectively.

机译：本文提出了一种自动的说话人状态识别方法，该方法在潜在因子分析框架中对因子向量进行建模，从而改善了高斯混合模型（GMM）的基线性能。我们调查着迷的说话者状态。我们认为情感语音信号是被情感通道效应破坏的原始正常平均语音信号。与其像说话者验证任务中那样减少声道可变性以增强鲁棒性，不如在因素分析框架下直接根据声道因素对说话者状态进行建模。在这项工作中，说话人状态因素向量是通过GMM建模框架中的潜在因素分析方法和支持向量机分类方法来提取和建模的。实验结果表明，所提出的说话人状态因子矢量建模系统在醉人语音检测任务（酒精语言语料库）和情感识别任务（IEMOCAP数据库）上分别比GMM基线提高了5.34％和1.49％的未加权准确性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.1937- 1940|共4页
会议地点 Kyoto(JP)
作者
Li, Ming;
展开▼
作者单位

Signal Analysis and Interpretation Laboratory Department of Electrical Engineering University of Southern California Los Angeles USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Joint Factor Analysis Versus Eigenchannels in Speaker Recognition [J] . Kenny P., Boulianne G., Ouellet P., IEEE transactions on audio, speech and language processing . 2007,第4期

机译：说话人识别中的联合因素分析与特征通道
2. Convergence Analysis of Single Latent Factor-Dependent, Nonnegative, and Multiplicative Update-Based Nonnegative Latent Factor Models [J] . Liu Zhigang, Luo Xin, Wang Zidong Neural Networks and Learning Systems, IEEE Transactions on . 2021,第4期

机译：单潜因素依赖性，非负和基于乘法更新的非负潜在因子模型的收敛分析
3. The Role of Time-Varying Contextual Factors in Latent Attrition Models for Customer Base Analysis [J] . Bachmann Patrick, Meierer Markus, Naf Jeffrey Marketing Science . 2021,第4期

机译：时变的上下文因素在客户群分析潜逃模型中的作用
4. Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling [C] . Ming Li, Metallinou A., Bone D., IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：扬声器状态识别基于潜在因子分析的Eigenchannel因子矢量建模
5. Robust speaker recognition based on latent variable models. [D] . Garcia-Romero, Daniel. 2012

机译：基于潜在变量模型的可靠说话人识别。
6. Spatial Bayesian Latent Factor Regression Modeling of Coordinate-based Meta-analysis Data [O] . Silvia Montagna, Tor Wager, Lisa Feldman Barrett, -1

机译：基于坐标的荟萃分析数据的空间贝叶斯潜在因子回归建模
7. Speaker states recognition using latent factor analysis based eigenchannel factor vector modeling [O] . Ming Li, Angeliki Metallinou, Daniel Bone, 2012

机译：扬声器状态识别使用基于潜在因子分析的特征信道因子矢量建模
8. Integrated Feature Normalization and Enhancement for Robust Speaker Recognition Using Acoustic Factor Analysis (Preprint). [R] . Hasan, T., Hansen, J. H. 2012

机译：使用声学因子分析（预印本）进行稳健的说话人识别的集成特征归一化和增强。

Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling

摘要

著录项

相似文献

相关主题

期刊订阅