Multitask speaker profiling for estimating age, height, weight and smoking habits from spontaneous telephone speech signals

机译：多任务扬声器配置文件，用于根据自发的电话语音信号估算年龄，身高，体重和吸烟习惯

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a novel approach for automatic estimation of four important traits of speakers, namely age, height, weight and smoking habit, from speech signals. In this method, each utterance is modeled using the i-vector framework which is based on the factor analysis on Gaussian Mixture Model (GMM) mean supervectors, and the Non-negative Factor Analysis (NFA) framework which is based on a constrained factor analysis on GMM weights. Then, Artificial Neural Networks (ANNs) and Least Squares Support Vector Regression (LSSVR) are employed to estimate age, height and weight of speakers from given utterances, and ANNs and logistic regression (LR) are utilized to perform smoking habit detection. Since GMM weights provide complementary information to GMM means, a score-level fusion of the i-vector-based and the NFA-based recognizers is considered for age and smoking habit estimation tasks to improve the performance. In addition, a multitask speaker profiling approach is proposed to evaluate the correlated tasks simultaneously and in interaction with each other, and consequently, to boost the accuracy in speaker age, height, weight and smoking habit estimations. To this end, a hybrid architecture involving the score-level fusion of the i-vector-based and the NFA-based recognizers is proposed to exploit the available information in both Gaussian means and Gaussian weights. ANNs are then employed to share the learned information with all tasks while they are learned in parallel. The proposed method is evaluated on telephone speech signals of National Institute for Standards and Technology (NIST) 2008 and 2010 Speaker Recognition Evaluation (SRE) corpora. Experimental results over 1194 utterances show the effectiveness of the proposed method in automatic speaker profiling.

机译：本文提出了一种新颖的方法，可以根据语音信号自动估计说话者的四个重要特征，即年龄，身高，体重和吸烟习惯。在这种方法中，使用基于高斯混合模型（GMM）平均超向量的因子分析的i-vector框架和基于约束因子分析的非负因子分析（NFA）框架对每种话语进行建模在GMM重量上。然后，使用人工神经网络（ANN）和最小二乘支持向量回归（LSSVR）来根据给定发音估算说话者的年龄，身高和体重，并利用ANN和Logistic回归（LR）进行吸烟习惯检测。由于GMM权重为GMM手段提供了补充信息，因此考虑将基于i向量的识别器和基于NFA的识别器进行得分级融合，以提高年龄和吸烟习惯估计任务。另外，提出了一种多任务说话者概要分析方法，以同时并相互交互地评估相关任务，从而提高了说话者年龄，身高，体重和吸烟习惯估计的准确性。为此，提出了一种混合架构，该架构涉及基于i向量的识别器和基于NFA的识别器的分数级别融合，以利用高斯均值和高斯权重中的可用信息。然后，在并行学习ANN的同时，将它们与所有任务共享学习的信息。该方法在美国国家标准技术研究院（NIST）2008和2010说话者识别评估（SRE）语料库的电话语音信号上进行了评估。超过1194次发声的实验结果证明了该方法在自动说话人特征分析中的有效性。

著录项

来源
《International conference on computer and knowledge engineering》|2014年|7-12|共6页
会议地点
作者
Poorjam Amir Hossein; Bahari Mohamad Hasan; Van hamme Hugo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Estimation; Kernel; Speech; Support vector machine classification; Testing; Training; Vectors; Artificial Neural Networks; Multitask Speaker Characterization; Non-negative Factor Analysis; i-vector;

机译：估计;核;语音;支持向量机分类;测试;训练;向量;人工神经网络;多任务说话者表征;非负因素分析; i-向量;

相似文献

外文文献
中文文献
专利

1. Can you hear my age? Influences of speech rate and speech spontaneity on estimation of speaker age [J] . Sara Skoog Waller, M?￥rten Eriksson, Patrik S??rqvist Frontiers in Psychology . 2015,第4期

机译：你能听到我的年龄吗？语速和言语自发性对说话人年龄估计的影响
2. Speaker clustering using telephone speech database of a large number of speakers [J] . Tsuneo Kato, Shingo Kuroiwa, Tohru Shimizu, 電子情報通信学会技術研究報告. 音声. Speech . 2000,第136期

机译：使用大量演讲者的电话语音数据库进行演讲者聚类
3. Speaker clustering using telephone speech database of a large number of speakers [J] . Tsuneo Kato, Shingo Kuroiwa, Tohru Shimizu, 電子情報通信学会技術研究報告. 音声. Speech . 2000,第136期

机译：扬声器聚类使用大量扬声器的电话语音数据库
4. Accuracy of automatic speaker recognition for telephone speech signal quality [C] . 8th International Symposium on Intelligent Systems and Informatics . 2010

机译：自动语音识别器对电话语音信号质量的准确性
5. HEMATOCRIT AND HEMOGLOBIN, ATP AND DPG CONCENTRATIONS IN ANDEAN MAN: VARIABILITY BY SEX, AGE, VILLAGE, ALTITUDE, WEIGHT, SMOKING HABITS AND GENETIC CONSTITUTION [D] . CLENCH-AAS, JOCELYNE MARGUERITE RIGAUD 1980

机译：安第斯人的血细胞比容和血红蛋白，ATP和DPG浓度：按性别，年龄，村庄，海拔，体重，吸烟习惯和遗传组成的差异
6. Can you hear my age? Influences of speech rate and speech spontaneity on estimation of speaker age [O] . Sara Skoog Waller, Mårten Eriksson, Patrik Sörqvist -1

机译：你能听到我的年龄吗？语速和言语自发性对说话人年龄估计的影响
7. Multitask speaker profiling for estimating age, height, weight and smoking habits from spontaneous telephone speech signals [O] . Poorjam Amir Hossein, Bahari Mohamad Hasan, Van hamme Hugo 2014

机译：多任务扬声器配置文件，用于根据自发的电话语音信号估算年龄，身高，体重和吸烟习惯

Multitask speaker profiling for estimating age, height, weight and smoking habits from spontaneous telephone speech signals

摘要

著录项

相似文献

相关主题

期刊订阅