On the use of deep feedforward neural networks for automatic language identification

Ignacio Lopez-Moreno; Javier Gonzalez-Dominguez; David Martinez; Oldrich Plchot; Joaquin Gonzalez-Rodriguez; Pedro J. Moreno

首页> 外文期刊>Computer speech and language >On the use of deep feedforward neural networks for automatic language identification

【24h】

On the use of deep feedforward neural networks for automatic language identification

机译：关于使用深度前馈神经网络进行自动语言识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we present a comprehensive study on the use of deep neural networks (DNNs) for automatic language identification (LID). Motivated by the recent success of using DNNs in acoustic modeling for speech recognition, we adapt DNNs to the problem of identifying the language in a given utterance from its short-term acoustic features. We propose two different DNN-based approaches. In the first one, the DNN acts as an end-to-end LID classifier, receiving as input the speech features and providing as output the estimated probabilities of the target languages. In the second approach, the DNN is used to extract bottleneck features that are then used as inputs for a state-of-the-art i-vector system. Experiments are conducted in two different scenarios: the complete NIST Language Recognition Evaluation dataset 2009 (LRE'09) and a subset of the Voice of America (VOA) data from LRE'09, in which all languages have the same amount of training data. Results for both datasets demonstrate that the DNN-based systems significantly outperform a state-of-art i-vector system when dealing with short-duration utterances. Furthermore, the combination of the DNN-based and the classical i-vector system leads to additional performance improvements (up to 45% of relative improvement in both EER and C_(avg) on 3s and 10s conditions, respectively).

机译：在这项工作中，我们对使用深度神经网络（DNN）进行自动语言识别（LID）进行了全面的研究。受到最近在声学模型中使用DNN进行语音识别的成功推动，我们使DNN适应了从短期声学特征以给定发音识别语言的问题。我们提出了两种不同的基于DNN的方法。在第一个中，DNN用作端到端LID分类器，接收语音特征作为输入，并提供目标语言的估计概率作为输出。在第二种方法中，DNN用于提取瓶颈特征，然后将其用作最新i向量系统的输入。实验在两种不同的情况下进行：完整的NIST语言识别评估数据集2009（LRE'09）和来自LRE'09的美国之音（VOA）数据的子集，其中所有语言都具有相同数量的训练数据。这两个数据集的结果都表明，在处理短时语音时，基于DNN的系统明显优于最新的i-vector系统。此外，基于DNN的系统和经典i向量系统的组合还带来了其他性能改进（在3s和10s条件下，EER和C_（avg）相对改进分别达到了45％）。

著录项

来源
《Computer speech and language》 |2016年第11期|46-59|共14页
作者
Ignacio Lopez-Moreno; Javier Gonzalez-Dominguez; David Martinez; Oldrich Plchot; Joaquin Gonzalez-Rodriguez; Pedro J. Moreno;
展开▼
作者单位

Google Inc., New York, USA,Google Inc, 76 Ninth Ave. P.C. 10011, New York, NY;

ATVS-Biometric Recognition Group, Universidad Autonoma de Madrid, Madrid, Spain;

I3A, Zaragoza, Spain;

Brno University of Technology, Brno, Czech Republic;

ATVS-Biometric Recognition Group, Universidad Autonoma de Madrid, Madrid, Spain;

Google Inc., New York, USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
LID; DNN; Bottleneck; i-vectors;

机译：盖;DNN;瓶颈;向量;

相似文献

外文文献
中文文献
专利

1. Modelling multi-level prosody and spectral features using deep neural network for an automatic tonal and non-tonal pre-classification-based Indian language identification system [J] . Bhanja Chuya China, Laskar Mohammad Azharuddin, Laskar Rabul Hussain Language Resources and Evaluation . 2021,第3期

机译：基于自动色调和非音调预分类的印度语言识别系统建模多级韵律和光谱特征
2. Cascade convolutional neural network-long short-term memory recurrent neural networks for automatic tonal and nontonal preclassification-based Indian language identification [J] . China Bhanja Chuya, Laskar Mohammad A., Laskar Rabul H. Expert Systems . 2020,第5期

机译：级联卷积神经网络长短期内存经常性神经网络，用于自动色调和非统计学预分配的印度语言识别
3. Applying deep neural networks for the automatic recognition of sign language words: A communication aid to deaf agriculturists [J] . Venugopalan Adithya, Reghunadhan Rajesh Expert systems with applications . 2021,第Deca期

机译：应用深神经网络以自动识别手语言词汇：对聋院农业的沟通援助
4. Automatic language identification using deep neural networks [C] . Lopez-Moreno Ignacio, Gonzalez-Dominguez Javier, Plchot Oldrich, IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：使用深度神经网络的自动语言识别
5. Automatic language identification with recurrent neural networks. [D] . Braun, Jerome J. 1997

机译：利用递归神经网络自动识别语言。
6. Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network [O] . Seung Seog Han, Gyeong Hun Park, Woohyung Lim, -1

机译：深度神经网络在灰指甲诊断中显示出与皮肤科医生相当且通常优于皮肤病的性能：通过基于区域的卷积深度神经网络自动构建灰指甲数据集
7. On the use of deep feedforward neural networks for automatic language identification [O] . Lopez-Moreno Ignacio, Gonzalez-Dominguez Javier, Martinez David, 2016

机译：关于使用深度前馈神经网络进行自动语言识别

On the use of deep feedforward neural networks for automatic language identification

摘要

著录项

相似文献

相关主题

期刊订阅