A Brazilian Speech Database

机译：巴西语音数据库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work introduces a Brazilian Speech Database (BrSD), a novel dataset freely available created to support the development of speech-based recognition tasks. As far as we know, this is the first Portuguese language based database with these characteristics created and made available to the research community. We also describe experiments accomplished on BrSD exploring its different possibilities of classification tasks, i.e., age group and gender classification. We use four well-known acoustic features extracted directly from the audio signal and one texture-based feature extracted from a visual representation of the audio signal, the spectrogram. We considered three different classification scenarios: each feature individually, early fusion of the features, and late fusion of the features. Experiments were conducted using Support Vector Machine (SVM) and Multi-layer Perceptron (MLP) classifiers. The obtained results showed that SVM classifier achieved the best recognition rates both in early and late fusion scenarios. The best recognition rates achieved were 91.25%, 88.75%, and 80.25% for gender, age group, and age-gender classification tasks, respectively.

机译：这项工作介绍了一个巴西语音数据库（BRSD），这是一个自由的新型数据集，以支持基于语音的识别任务的开发。据我们所知，这是第一个基于葡萄牙语语言的数据库，这些数据库具有创建的这些特征，并为研究界提供。我们还描述了BRSD完成的实验，探索其不同的分类任务可能性，即年龄组和性别分类。我们使用直接从音频信号提取的四个众所周知的声学特征和从音频信号的视觉表示提取的一个基于纹理的特征，频谱图。我们考虑了三种不同的分类方案：每个功能单独，早期融合的功能，以及功能的晚期融合。使用支撑载体机（SVM）和多层Perceptron（MLP）分类器进行实验。所得结果表明，SVM分类器在早期和后期融合情景中实现了最佳识别率。达到的最佳识别率分别为51.25％，88.75％和80.25％，分别为性别，年龄组和年龄 - 性别分类任务。

著录项

来源
《IEEE International Conference on Tools with Artificial Intelligence》|2018年|526p|共8页
会议地点
作者
Marco A. D. Paulino; Yandre M. G. Costa; Alceu S. Britto; Alisson R. Svaigen; Linnyer B. R. Aylon; Luiz E. S. Oliveira;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Speech database; Speech recognition; Pattern recognition;

机译：语音数据库;语音识别;模式识别;

相似文献

外文文献
中文文献
专利

1. A Review on Marathi Language Speech Database Development for Automatic Speech Recognition (ASR) System [J] . Mrs. Chhaya S. Patil, Prof.Dr.Vaishali B.Patil International Journal of Engineering Research and Applications . 2017,第3期

机译：用于自动语音识别（ASR）系统的Marathi语言语音数据库开发的回顾
2. A Tool to Solve Sentence Segmentation Problem on Preparing Speech Database for Indonesian Text-to-speech System [J] . Mohammad Teduh Uliniansyah, Gunarso, Elvira Nurfadhilah, Procedia Computer Science . 2016,第1期

机译：为印尼文字转语音系统准备语音数据库时解决句子分割问题的工具
3. Recognizing emotional speech in Persian: Avalidated database of Persian emotional speech (Persian ESD) [J] . Niloofar Keshtiari, Michael Kuhlmann, Moharram Eslami, Behavior Research Methods . 2015,第1期

机译：在波斯语中识别情绪讲话：波斯情感演讲的被培养数据库（波斯岛ESD）
4. A Brazilian Speech Database [C] . Marco A. D. Paulino, Yandre M. G. Costa, Alceu S. Britto, IEEE International Conference on Tools with Artificial Intelligence . 2018

机译：巴西语音数据库
5. An analysis of the Pataxo pharmacopoeia of Bahia, Brazil using an object oriented database model. [D] . Thomas, Michael Bradley. 2001

机译：使用面向对象的数据库模型对巴西巴伊亚州的Pataxo药典进行分析。
6. The impact of asthma in Brazil: a longitudinal analysis of data from a Brazilian national database system [O] . Thiago de Araujo Cardoso, Cristian Roncada, Emerson Rodrigues da Silva, 2017

机译：哮喘在巴西的影响：来自巴西国家数据库系统的数据的纵向分析
7. Adult-Child Speech Interaction: Speech Database and Psychophysiological Experimental Data [O] . Elena Lyakso 2019

机译：成人儿童语音互动：语音数据库和心理生理实验数据
8. RSRE (Royal Signals and Radar Establishment) Speech Database Recordings (1983). Part 2. Recording Made for Automatics Speech Recognition Assessment and Research [R] . Russell, M. J., Moore, R. K., Tomlinson, M. J., 1984

机译：RsRE（皇家信号和雷达建立）语音数据库记录（1983年）。第2部分。用于自动语音识别评估和研究的录音

A Brazilian Speech Database

摘要

著录项

相似文献

相关主题

期刊订阅