Tibetan acoustic model research based on TDNN

机译：基于TDNN的藏语声学模型研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural network (DNN) has been significantly improved in Tibetan speech recognition tasks, however, it still requires improvement when compared with that in Mandarin, English, or other languages. This paper examines a Tibetan acoustic model based on deep neural network and extracts the i-Vector features by modeling the speaker in the feature space. After combining the MFCCs and i-Vector features, we train a time-delayed neural network (TDNN) based Tibetan acoustic model, compared to deep neural network, it can get better performance. At the same time, we study the transfer learning from Mandarin to Tibetan and prove its effectiveness.

机译：深度神经网络（DNN）在藏文语音识别任务中已得到显着改进，但是与普通话，英语或其他语言相比，它仍需要改进。本文研究了基于深度神经网络的藏族声学模型，并通过在特征空间中对说话人建模来提取i-Vector特征。结合MFCC和i-Vector功能后，我们训练了基于时延神经网络（TDNN）的藏语声学模型，与深层神经网络相比，它可以获得更好的性能。同时，我们研究了从普通话到藏语的迁移学习，并证明了其有效性。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2018年|601-604|共4页
会议地点
作者
Jinghao Yan; Hongzhi Yu; Guanyu Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hidden Markov models; Feature extraction; Adaptation models; Speech recognition; Acoustics; Neural networks; Context modeling;

机译：隐马尔可夫模型;特征提取;适应模型;语音识别;声学;神经网络;上下文建模;

相似文献

外文文献
中文文献
专利

1. Selection of acoustic modeling unit for Tibetan speech recognition based on deep learning [J] . Baojia Gong, Rangzhuoma Cai, Zhijie Cai, MATEC Web of Conferences . 2021,第a期

机译：基于深度学习的西藏语音识别声学建模单元
2. Domain adaptation of lattice-free MMI based TDNN models for speech recognition [J] . Yanhua Long, Yijie Li, Hone Ye, International journal of speech technology . 2017,第1期

机译：基于无格MMI的TDNN模型的语音识别域自适应
3. Stochastic RUL Calculation Enhanced With TDNN-Based IGBT Failure Modeling [J] . Alireza Alghassi, Suresh Perinpanayagam, Mohammad Samie IEEE Transactions on Reliability . 2016,第2期

机译：基于TDNN的IGBT故障建模增强了随机RUL计算
4. Tibetan acoustic model research based on TDNN [C] . Jinghao Yan, Hongzhi Yu, Guanyu Li Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2018

机译：基于TDNN的西藏声学模型研究
5. The role of stress in Tibetan tonogenesis: A study in historical comparative acoustics. [D] . Caplow, Nancy Jill. 2009

机译：应力在藏族音调中的作用：历史比较声学研究。
6. Tdnn-Based Engine In-Cylinder Pressure Estimation from Shaft Velocity Spectral Representation [O] . Andrés F. Valencia-Duque, David A. Cárdenas-Peña, Andrés M. Álvarez-Meza, 2021

机译：基于TDNN的发动机轴速度谱估计轴速度谱表示
7. Recognition of Acoustic Emission Signal based on the Algorithms of TDNN and GMM [O] . Aidong Deng, Hao Cao, Hang Tong, 2014

机译：基于TDNN和GMM算法的声发射信号识别。

Tibetan acoustic model research based on TDNN

摘要

著录项

相似文献

相关主题

期刊订阅