Audio Features Selection for Automatic Height Estimation from Speech

机译：从语音自动估计高度的音频功能选择

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aiming at the automatic estimation of the height of a person from speech, we investigate the applicability of various subsets of speech features, which were formed on the basis of ranking the relevance and the individual quality of numerous audio features. Specifically, based on the relevance ranking of the large set of openSMILE audio descriptors, we performed selection of subsets with different sizes and evaluated them on the height estimation task. In brief, during the speech parameterization process, every input utterance is converted to a single feature vector, which consists of 6552 parameters. Next, a subset of this feature vector is fed to a support vector machine (SVM)-based regression model, which aims at the straight estimation of the height of an unknown speaker. The experimental evaluation performed on the TIMIT database demonstrated that: (ⅰ) the feature vector composed of the top-50 ranked parameters provides a good trade-off between computational demands and accuracy, and that (ⅱ) the best accuracy, in terms of mean absolute error and root mean square error, is observed for the top-200 subset.

机译：为了自动估计一个人的语音高度，我们研究了语音特征的各个子集的适用性，这些子集是在对众多音频特征的相关性和个性进行排名的基础上形成的。具体来说，基于大量openSMILE音频描述符的相关性排名，我们选择了具有不同大小的子集，并在高度估计任务上对其进行了评估。简而言之，在语音参数化过程中，每个输入语音都转换为单个特征向量，其中包含6552个参数。接下来，将此特征向量的子集馈送到基于支持向量机（SVM）的回归模型，该模型旨在直接估计未知说话者的身高。在TIMIT数据库上进行的实验评估表明：（ⅰ）由排名前50位的参数组成的特征向量在计算需求和准确性之间提供了良好的折衷，并且（ⅱ）就均值而言，最佳准确性对于前200个子集，可以观察到绝对误差和均方根误差。

著录项

来源
《Artificial intelligence: Theories, models and applications》|2010年|p.81-90|共10页
会议地点 Athens(GR);Athens(GR)
作者
Todor Ganchev; Iosif Mporas; Nikos Fakotakis;
展开▼
作者单位

Artificial Intelligence Group, Wire Communications Laboratory, Dept. of Electrical and Computer Engineering, University of Patras, 26500 Rion-Patras, Greece;

Artificial Intelligence Group, Wire Communications Laboratory, Dept. of Electrical and Computer Engineering, University of Patras, 26500 Rion-Patras, Greece;

Artificial Intelligence Group, Wire Communications Laboratory, Dept. of Electrical and Computer Engineering, University of Patras, 26500 Rion-Patras, Greece;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
height estimation from speech; speech parameterization; feature ranking; feature selection; SVM regression models;

机译：根据语音估计身高；语音参数化特征排名；特征选择； SVM回归模型;

相似文献

外文文献
中文文献
专利

1. Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features [J] . Ben Alex Starlet, Mary Leena, Babu Ben P. Circuits, systems and signal processing . 2020,第11期

机译：用话语和音节级韵律特征对自动语音情感识别的关注和特征选择
2. Audio-visual feature fusion via deep neural networks for automatic speech recognition [J] . Mohammad Hasan Rahmani, Farshad Almasganj, Seyyed Ali Seyyedsalehi Digital Signal Processing . 2018,第期

机译：通过深度神经网络进行视听功能融合，用于自动语音识别
3. Audio-visual feature fusion via deep neural networks for automatic speech recognition [J] . Mohammad Hasan Rahmani, Farshad Almasganj, Seyyed Ali Seyyedsalehi Digital Signal Processing . 2018,第期

机译：通过深度神经网络进行视听功能融合，用于自动语音识别
4. Audio Features Selection for Automatic Height Estimation from Speech [C] . Todor Ganchev, Iosif Mporas, Nikos Fakotakis Hellenic Conference on Artificial Intelligence . 2010

机译：音频功能选择用于语音的自动高度估计
5. Ensemble feature selection for multi-stream automatic speech recognition. [D] . Gelbart, David. 2008

机译：集成特征选择，用于多流自动语音识别。
6. Audio-Based System for Automatic Measurement of Jump Height in Sports Science [O] . Basilio Pueo, Jose J. Lopez, Jose M. Jimenez-Olmedo 2019

机译：基于音频的体育科学跳跃高度自动测量系统
7. AUDIO-VISUAL FEATURE INTEGRATION BASED ON PIECEWISE LINEAR TRANSFORMATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION [O] . Yosuke Kashiwagi, Masayuki Suzuki, Nobuaki Minematsu, 2013

机译：基于分段线性变换的音频—视觉特征集成鲁棒自动语音识别

Audio Features Selection for Automatic Height Estimation from Speech

摘要

著录项

相似文献

相关主题

期刊订阅