首页> 外文会议>National Conference on Communications >Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data

【24h】

Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data

机译：研究目标集缩减以实现印地语-英语代码转换数据的端到端语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

End-to-end (E2E) systems are fast replacing the conventional systems in the domain of automatic speech recognition. As the target labels are learned directly from speech data, the E2E systems need a bigger corpus for effective training. In the context of code-switching task, the E2E systems face two challenges: (i) the expansion of the target set due to multiple languages involved, and (ii) the lack of availability of sufficiently large domain-specific corpus. Towards addressing those challenges, we propose an approach for reducing the number of target labels for reliable training of the E2E systems on limited data. The efficacy of the proposed approach has been demonstrated on two prominent architectures, namely CTC-based and attention-based E2E networks. The experimental validations are performed on a recently created Hindi-English code-switching corpus. For contrast purpose, the results for the full target set based E2E system and a hybrid DNN-HMM system are also reported.

机译：端到端（E2E）系统正在自动语音识别领域迅速取代传统系统。由于直接从语音数据中学习目标标签，因此E2E系统需要更大的语料库才能进行有效的训练。在代码转换任务的上下文中，E2E系统面临两个挑战：（i）由于涉及多种语言而导致目标集的扩展，以及（ii）缺乏足够大的特定领域语料库的可用性。为了应对这些挑战，我们提出了一种减少目标标签数量的方法，以便在有限的数据上可靠地培训E2E系统。所提出的方法的有效性已经在两个著名的体系结构上得到了证明，即基于CTC和基于注意力的E2E网络。实验验证是在最近创建的印地语-英语代码转换语料库上执行的。为了对比，还报告了基于完整目标集的E2E系统和混合DNN-HMM系统的结果。

著录项

来源
《National Conference on Communications》|2020年|1-5|共5页
会议地点
作者
Kunal Dhawan; Ganji Sreeram; Kumar Priyadarshi; Rohit Sinha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
end-to-end speech recognition; code-switching; attention mechanism;

机译：端到端语音识别;代码转换;注意机制;

相似文献

外文文献
中文文献
专利

1. Investigating Bilingual Deep Neural Networks for Automatic Recognition of Code-switching Frisian Speech [J] . Emre Y?lmaz, Henk van den Heuvel, David van Leeuwen Procedia Computer Science . 2016,第1期

机译：研究双语深度神经网络以自动识别代码转换弗里斯兰语语音
2. Acoustic data augmentation for Mandarin-English code-switching speech recognition [J] . Applied Acoustics . 2020,第Apra期

机译：声学数据增强，用于普通话-英语代码转换语音识别
3. Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition (L) [J] . Odette Scharenborg, Louis ten Bosch, Lou Boves, The Journal of the Acoustical Society of America . 2003,第6期

机译：桥接自动语音识别和心理语言学：将候选清单扩展到人类语音识别的端到端模型（L）
4. Investigating End-to-end Speech Recognition for Mandarin-english Code-switching [C] . Changhao Shan, Chao Weng, Guangsen Wang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：研究普通话-英语代码转换的端到端语音识别
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Measuring open-set word recognition in school-aged children: Corpus of monosyllabic target words and speech maskers [O] . Angela Yarnell Bonino, Ashley R. Malley -1

机译：测量学龄儿童的开放式单词识别：单音节目标单词和语音掩盖语的语料库
7. Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data [O] . Kunal Dhawan, Ganji Sreeram, Kumar Priyadarshi, 2020

机译：调查终端 - 英语代码切换数据结束语音识别的目标集减少

Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data

摘要

著录项

相似文献

相关主题

期刊订阅