...
首页> 外文期刊>Computer speech and language >Restricted Boltzmann machines for vector representation of speech in speaker recognition
【24h】

Restricted Boltzmann machines for vector representation of speech in speaker recognition

机译:说话人识别中用于语音矢量表示的受限玻尔兹曼机

获取原文
获取原文并翻译 | 示例
           

摘要

Over the last few years, i-vectors have been the state-of-the-art technique in speaker recognition. Recent advances in Deep Learning (DL) technology have improved the quality of i-vectors but the DL techniques in use are computationally expensive and need phonetically labeled background data. The aim of this work is to develop an efficient alternative vector representation of speech by keeping the computational cost as low as possible and avoiding phonetic labels, which are not always accessible. The proposed vectors will be based on both Gaussian Mixture Models (GMM) and Restricted Boltzmann Machines (RBM) and will be referred to as GMM-RBM vectors. The role of RBM is to learn the total speaker and session variability among background GMM supervectors. This RBM, which will be referred to as Universal RBM (URBM), will then be used to transform unseen supervectors to the proposed low dimensional vectors. The use of different activation functions for training the URBM and different transformation functions for extracting the proposed vectors are investigated. At the end, a variant of Rectified Linear Units (ReLU) which is referred to as variable ReLU (VReLU) is proposed. Experiments on the core test condition 5 of NIST SRE 2010 show that comparable results with conventional i-vectors are achieved with a clearly lower computational load in the vector extraction process.
机译:在过去的几年中,i向量已成为说话人识别的最新技术。深度学习(DL)技术的最新进展已经提高了i向量的质量,但是使用的DL技术在计算上非常昂贵,并且需要用语音标记的背景数据。这项工作的目的是通过保持尽可能低的计算成本并避免并非总是可访问的语音标签来开发有效的语音矢量替代表示。提出的矢量将基于高斯混合模型(GMM)和受限玻尔兹曼机(RBM),并将被称为GMM-RBM矢量。 RBM的作用是了解背景GMM超向量之间的说话人和会话的总变异性。然后,该RBM(将被称为通用RBM(URBM))将用于将看不见的超向量转换为建议的低维向量。研究了使用不同的激活函数来训练URBM,以及使用不同的转换函数来提取提出的向量。最后,提出了一种整流线性单元(ReLU)的变体,称为变量ReLU(VReLU)。在NIST SRE 2010的核心测试条件5上进行的实验表明,在向量提取过程中,计算量明显较低,可以实现与常规i-vector相当的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号