首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Model dimensionality selection in bilinear transformation for feature space MLLR rapid speaker adaptation
【24h】

Model dimensionality selection in bilinear transformation for feature space MLLR rapid speaker adaptation

机译:特征空间MLLR快速说话人自适应的双线性变换模型维数选择

获取原文
获取原文并翻译 | 示例

摘要

Bilinear models based feature space Maximum Likelihood Linear Regression (FMLLR) speaker adaptation have showed good performance especially when the amount of adaptation data is limited. However, the model dimensionality selection is very critical to the performance of bilinear models and need more work to find the optimal selection method. In this paper, we present an empirical study on this issue and suggest using a piecewise log-linear function to describe the relationship between the relatively optimal dimensionality parameter and the variant amount of data. This relationship can be used to efficiently select the bilinear model dimensionality in FMLLR speaker adaptation with the variant amount of data for each test speaker to improve recognition performance on the English voice control dataset.
机译:基于双线性模型的特征空间最大似然线性回归(FMLLR)说话人自适应已显示出良好的性能,尤其是在自适应数据量有限的情况下。但是,模型维数选择对于双线性模型的性能非常关键,需要更多的工作来找到最佳选择方法。在本文中,我们对这一问题进行了实证研究,并建议使用分段对数线性函数来描述相对最佳维数参数与数据量变化之间的关系。该关系可用于在FMLLR说话者自适应中有效选择双线性模型维,每个测试说话者的数据量各不相同,以提高英语语音控制数据集上的识别性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号