A Unified Framework for Multilingual Speech Recognition in Air Traffic Control Systems

Lin Yi; Guo Dongyue; Zhang Jianwei; Chen Zhengmao; Yang Bo

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >A Unified Framework for Multilingual Speech Recognition in Air Traffic Control Systems

【24h】

A Unified Framework for Multilingual Speech Recognition in Air Traffic Control Systems

机译：空中交通管制系统中多语言语音识别统一框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work focuses on robust speech recognition in air traffic control (ATC) by designing a novel processing paradigm to integrate multilingual speech recognition into a single framework using three cascaded modules: an acoustic model (AM), a pronunciation model (PM), and a language model (LM). The AM converts ATC speech into phoneme-based text sequences that the PM then translates into a word-based sequence, which is the ultimate goal of this research. The LM corrects both phoneme- and word-based errors in the decoding results. The AM, including the convolutional neural network (CNN) and recurrent neural network (RNN), considers the spatial and temporal dependences of the speech features and is trained by the connectionist temporal classification loss. To cope with radio transmission noise and diversity among speakers, a multiscale CNN architecture is proposed to fit the diverse data distributions and improve the performance. Phoneme-to-word translation is addressed via a proposed machine translation PM with an encoder-decoder architecture. RNN-based LMs are trained to consider the code-switching specificity of the ATC speech by building dependences with common words. We validate the proposed approach using large amounts of real Chinese and English ATC recordings and achieve a 3.95% label error rate on Chinese characters and English words, outperforming other popular approaches. The decoding efficiency is also comparable to that of the end-to-end model, and its generalizability is validated on several open corpora, making it suitable for real-time approaches to further support ATC applications, such as ATC prediction and safety checking.

机译：这项工作侧重于通过设计新的处理范式来集成使用三个级联模块的单语言语音识别来集成多语言语音识别：声学模型（AM），发音模型（PM）和A语言模型（LM）。 AM将ATC语音转换为基于音素的文本序列，即PM然后转换为基于单词的序列，这是本研究的最终目标。 LM在解码结果中校正音级和基于Word的错误。 AM包括卷积神经网络（CNN）和经常性神经网络（RNN），认为语音特征的空间和时间依赖性，并受到连接主义时间分类损失的训练。为了应对扬声器之间的无线电传输噪声和多样性，提出了一种多尺度CNN架构，以适应各种数据分布并提高性能。通过具有编码器解码器架构的提出的机器翻译PM来解决音素到字翻译。基于RNN的LMS培训，以考虑通过用常用词构建依赖性的ATC演讲的代码切换特异性。我们使用大量真正的中文和英语ATC录音验证了所提出的方法，并在汉字和英语单词上实现3.95％的标签错误率，优于其他流行的方法。解码效率也与端到端模型的解码效率相当，其普遍性是在几种开放的语料库上验证，这适用于进一步支持ATC应用的实时方法，例如ATC预测和安全检查。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2021年第8期|3608-3620|共13页
作者
Lin Yi; Guo Dongyue; Zhang Jianwei; Chen Zhengmao; Yang Bo;
展开▼
作者单位

Sichuan Univ Coll Comp Sci Chengdu 610000 Peoples R China;

Sichuan Univ Coll Comp Sci Chengdu 610000 Peoples R China;

Sichuan Univ Coll Comp Sci Chengdu 610000 Peoples R China;

Sichuan Univ Coll Comp Sci Chengdu 610000 Peoples R China;

Sichuan Univ Coll Comp Sci Chengdu 610000 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Hidden Markov models; Task analysis; Atmospheric modeling; Speech recognition; Vocabulary; Decoding; Real-time systems; Acoustic model (AM); air traffic control (ATC); machine translation pronunciation model (PM); multiscale CNN (MCNN); multilingual; robust speech recognition;

机译：隐马尔可夫模型;任务分析;大气建模;语音识别;词汇;解码;实时系统;声学模型（AM）;空中交通管制（ATC）;机器翻译发音模型（MCNN）;多思级CNN（MCNN）;多语言;强大的演讲识别;

相似文献

外文文献
中文文献
专利

1. A Unified Framework for Multilingual Text-to-Speech Synthesis with SSML Specification as Interface [J] . WU Zhiyong, CAO Guangqi, MENG M. Helen, 清华大学学报（英文版） . 2009,第005期
2. A Unified Framework for Multilingual Text-to-Speech Synthesis with SSML Specification as Interface [J] . 吴志勇, 曹光琦, 蒙关玲, 清华大学学报：自然科学英文版 . 2009,第005期
3. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu 信号与信息处理（英文） . 2020,第001期
4. Safer Design and Less Cost Operation for Low-Traffic Long-Road Illumination Using Control System Based on Pattern Recognition Technique [J] . Muhammad M. A. S. Mahmoud, Leyla Muradkhanli 智能控制与自动化（英文） . 2020,第003期
5. Objective Information Theory Exemplified in Air Traffic Control System [J] . XU Jianfeng, WANG Shuliang, LIU Zhenyu, 电子学报：英文版 . 2021,第004期
6. Improving speech recognition models with small samples for air traffic control systems [J] . Lin Yi, Li Qin, Yang Bo, Neurocomputing . 2021,第Jula20期

机译：提高用于空中交通管制系统的小样本的语音识别模型
7. Air traffic control speech recognition system cross-task & speaker adaptation [J] . de Cordoba R., Ferreiros J., San-Segundo R., IEEE Aerospace and Electronic Systems Magazine . 2006,第期

机译：空中交通管制语音识别系统跨任务和说话者自适应
8. Air traffic control speech recognition system cross-task & speaker adaptation [J] . de Cordoba R., Ferreiros J., San-Segundo R., IEEE Aerospace and Electronic Systems Magazine . 2006,第9期

机译：空中交通管制语音识别系统跨任务和说话者自适应
9. A context-aware speech recognition and understanding system for air traffic control domain [C] . Youssef Oualil, Dietrich Klakow, Gyorgy Szaszák, 2017 IEEE Automatic Speech Recognition and Understanding Workshop . 2017

机译：空中交通管制领域的上下文感知语音识别和理解系统
10. Evidence toward an expanded international civil aviation organization (ICAO) concept of a single unified global communication navigation surveillance air traffic management (CNS/ATM) system: A quantitative analysis of ADS-B technology within a CNS/ATM system. [D] . Gardner, Gregory S. 2005

机译：扩大国际民用航空组织（ICAO）单一统一全球通信导航监视空中交通管理（CNS / ATM）系统概念的证据：对CNS / ATM系统中ADS-B技术的定量分析。
11. Selected Aspects of Using the Telemetry Method in Synthesis of RelNav System for Air Traffic Control [O] . Milan Džunda, Natália Kotianová, Peter Dzurovčin, 2020

机译：在用于空中交通管制的RelNav系统的综合中使用遥测方法的某些方面
12. The integration of automatic speech recognition into the air traffic control system [O] . Karlsson Joakim 1990

机译：将自动语音识别集成到空中交通管制系统中
13. Automatic speech recognition in air traffic control [R] . Karlsson, Joakim 1990

机译：空中交通管制中的自动语音识别

A Unified Framework for Multilingual Speech Recognition in Air Traffic Control Systems

摘要

著录项

相似文献

相关主题

期刊订阅