首页> 外文会议>Annual Conference of the International Speech Communication Association >Context adaptive neural network for rapid adaptation of deep CNN based acoustic models
【24h】

Context adaptive neural network for rapid adaptation of deep CNN based acoustic models

机译:背景自适应神经网络快速适应深层CNN基声学模型

获取原文

摘要

Using auxiliary input features has been seen as one of the most effective ways to adapt deep neural network (DNN)-based acoustic models to speaker or environment. However, this approach has several limitations. It only performs compensation of the bias term of the hidden layer and therefore does not fully exploit the network capabilities. Moreover, it may not be well suited for certain types of architectures such as convolutional neural networks (CNNs) because the auxiliary features have different time-frequency structures from speech features. This paper resolves these problems by extending the recently proposed context adaptive DNN (CA-DNN) framework to CNN architectures. A CA-DNN is a DNN with one or several layers factorized in sub-layers associated with an acoustic context class representing speaker or environment. The output of the factorized layer is obtained as the weighted sum of the contributions of each sub-layer, weighted by acoustic context weights that are derived from auxiliary features such as i-vectors. Importantly, a CA-DNN can compensate both bias and weight matrices. In this paper, we investigate the use of CA-DNN for deep CNN-based architectures. We demonstrate consistent performance gains for utterance level rapid adaptation on the AURORA4 task over a strong network-in-network based deep CNN architecture.
机译:使用辅助输入特征被视为将深神经网络(DNN)的最有效方法之一进行了一种,以便对扬声器或环境进行基础的声学模型。但是,这种方法有几个限制。它只执行隐藏层的偏置项的补偿,因此不会完全利用网络功能。此外,由于辅助特征具有来自语音特征的不同时频结构,因此可能对某些类型的架构(CNNS)进行了非常适合于诸如卷积神经网络(CNNS)。本文通过将最近提出的上下文自适应DNN(CA-DNN)框架扩展到CNN架构来解决这些问题。 CA-DNN是具有一个或多个层的DNN,其分解与表示扬声器或环境的声学上下文类相关联的子层。获得分解层的输出作为每个子层的贡献的加权之和,由从诸如i - vecors的辅助特征导出的声学上下文权重。重要的是,CA-DNN可以补偿偏置和权重矩阵。在本文中,我们研究了CA-DNN在基于CNN的深层建筑的使用。我们展示了在基于强大网络网络的深度CNN架构上对Aurora4任务的话语水平快速适应的一致性增益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号