Bottleneck Feature-Based Hybrid Deep Autoencoder Approach for Indian Language Identification

Himanish Shekhar Das; Pinki Roy

首页> 外文期刊>Arabian Journal for Science and Engineering. Section A, Sciences >Bottleneck Feature-Based Hybrid Deep Autoencoder Approach for Indian Language Identification

【24h】

Bottleneck Feature-Based Hybrid Deep Autoencoder Approach for Indian Language Identification

机译：基于瓶颈特征的混合深度自动编码器用于印度语言识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Latest and emerging approaches are essential to resolve the communication barrier among different languages in speechprocessing. The automatic language identification system is developed to identify the spoken language from speech utterances.Feature selection is a very challenging task in language identification. In this paper, bottleneck feature-based hybrid deepautoencoder approach is proposed to identify the given speech signal with corresponding language features. In the proposedapproach, initially Mel-frequency cepstral coefficients, linear prediction coefficients, and shifted delta coefficients featuresare directly extracted from multilingual speech utterances. Further, we extracted bottleneck feature from the bottleneck layerof the bottleneck deep neural network. Initially, recognition rate has been evaluated for each feature set to find out the bestfeature. Finally, the best feature along with other features is used as the input for deep autoencoder with softmax regressionto identify the language based on class labels. The deep autoencoder is fine-tuned to reach the global optimum through Jayaoptimization algorithm. To carry out the experiments, the recorded database is used for four Indian languages with specialemphasis on northeastern languages. The experimental results demonstrate that the proposed hybrid approach using bottleneckfeature with shifted delta coefficients is performing well with 97.10% accuracy. Moreover, experimental results also show thatproposed hybrid approach gives superior outcome when compared with the baseline deep neural network-based approaches.

机译：最新的和新兴的方法对于解决语音处理中不同语言之间的交流障碍至关重要。开发自动语言识别系统以从语音中识别口语。特征选择是语言识别中一项非常艰巨的任务。本文提出了一种基于瓶颈特征的混合深度自动编码器方法，以识别具有相应语言特征的语音信号。在提出的方法中，首先从多语言语音中直接提取梅尔频率倒谱系数，线性预测系数和位移增量系数特征。此外，我们从瓶颈深度神经网络的瓶颈层提取了瓶颈特征。最初，已经对每个功能集的识别率进行了评估，以找出最佳功能。最后，最佳功能与其他功能一起用作具有softmax回归的深度自动编码器的输入，以基于类标签识别语言。深度自动编码器通过Jayaoptimization算法进行了微调，以达到全局最优。为了进行实验，记录的数据库用于四种印度语言，其中东北语言特别强调。实验结果表明，所提出的使用瓶颈特征和偏移增量系数的混合方法以97.10％的精度表现良好。此外，实验结果还表明，与基于基线深层神经网络的方法相比，提出的混合方法可提供更好的结果。

著录项

来源
《Arabian Journal for Science and Engineering. Section A, Sciences》 |2020年第4期|3425-3436|共12页
作者
Himanish Shekhar Das; Pinki Roy;
展开▼
作者单位

Department of Computer Science and Engineering National Institute of Technology Silchar Silchar Assam 788010 India;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech processing; Language identification; Bottleneck feature; Deep autoencoder; Softmax regression; Jaya optimization;

机译：语音处理;语言识别;瓶颈功能;深度自动编码器;Softmax回归;Jaya优化;

相似文献

外文文献
中文文献
专利

1. Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification [J] . Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai, EURASIP journal on audio, speech, and music processing . 2015,第1期

机译：基于深度神经网络的瓶颈特征和基于去噪自动编码器的去混响用于远距离说话者识别
2. Short Utterance Based Speech Language Identification in Intelligent Vehicles With Time-Scale Modifications and Deep Bottleneck Features [J] . Ma Zhanyu, Yu Hong, Chen Wei, IEEE Transactions on Vehicular Technology . 2019,第1期

机译：具有时标修改和深瓶颈特征的智能汽车中基于短说话的语音语言识别
3. Brain tumor classification using a hybrid deep autoencoder with Bayesian fuzzy clustering-based segmentation approach [J] . Biocybernetics and biomedical engineering . 2020,第1期

机译：脑肿瘤分类采用哈斯尼亚模糊聚类的分割方法使用杂交深度自动化器
4. Decision Level Fusion based Approach for Indian Languages Identification using Deep Neural Network [C] . Kanika Gupta, Kartikeya Singh Gour, Sompal Arya, IEEE Region 10 Conference . 2018

机译：基于决策级融合的深度神经网络印度语言识别方法
5. Feature-based Generative Design of Mechanisms with a Variational Autoencoder [D] . Brandt, Benjamin F. 2020

机译：基于功能的变形AutoEncoder机制的生成设计
6. Deep Bottleneck Features for Spoken Language Identification [O] . Bing Jiang, Yan Song, Si Wei, -1

机译：口语识别的深层瓶颈功能
7. Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification [O] . Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai, 2015

机译：基于深度神经网络的瓶颈特征和基于去噪自动编码器的去混响用于远距离说话者识别

Bottleneck Feature-Based Hybrid Deep Autoencoder Approach for Indian Language Identification

摘要

著录项

相似文献

相关主题

期刊订阅