Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments

Zhang Zixing; Geiger Juergen; Pohjalainen Jouni; Mousa Amr El-Desoky; Jin Wenyu; Schuller Bjoern

首页> 外文期刊>ACM transactions on intelligent systems >Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments

【24h】

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments

机译：深度学习对环境的鲁棒性语音识别：最新进展概述

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Eliminating the negative effect of non-stationary environmental noise is a long-standing research topic for automatic speech recognition but still remains an important challenge. Data-driven supervised approaches, especially the ones based on deep neural networks, have recently emerged as potential alternatives to traditional unsupervised approaches and with sufficient training, can alleviate the shortcomings of the unsupervised methods in various real-life acoustic environments. In this light, we review recently developed, representative deep learning approaches for tackling non-stationary additive and convolutional degradation of speech with the aim of providing guidelines for those involved in the development of environmentally robust speech recognition systems. We separately discuss single- and multi-channel techniques developed for the front-end and back-end of speech recognition systems, as well as joint front-end and back-end training frameworks. In the meanwhile, we discuss the pros and cons of these approaches and provide their experimental results on benchmark databases. We expect that this overview can facilitate the development of the robustness of speech recognition systems in acoustic noisy environments.

机译：消除非平稳环境噪声的负面影响是自动语音识别的长期研究课题，但仍然是一个重要的挑战。数据驱动的监督方法，尤其是基于深度神经网络的方法，最近已成为传统无监督方法的潜在替代方法，并且经过充分培训，可以缓解无监督方法在各种现实声学环境中的缺点。有鉴于此，我们回顾了最近开发的，具有代表性的深度学习方法，以解决语音的非平稳加性和卷积退化，旨在为那些参与开发环境稳健的语音识别系统的人员提供指导。我们分别讨论为语音识别系统的前端和后端以及联合的前端和后端培训框架开发的单通道和多通道技术。同时，我们讨论了这些方法的优缺点，并在基准数据库上提供了它们的实验结果。我们希望本概述可以促进在嘈杂的环境中语音识别系统的鲁棒性发展。

著录项

来源
《ACM transactions on intelligent systems》 |2018年第5期|49.1-49.28|共28页
作者
Zhang Zixing; Geiger Juergen; Pohjalainen Jouni; Mousa Amr El-Desoky; Jin Wenyu; Schuller Bjoern;
展开▼
作者单位

Imperial Coll London Dept Comp Queens Gate 180 London SW7 2AZ England;

Huawei Technol Dusseldorf GmbH German Res Ctr Riesstr 25 D-80992 Munich Germany;

Univ Passau Chair Complex & Intelligent Syst Innstr 41 D-94032 Passau Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Robust speech recognition; deep learning; neural networks; non-stationary noise; multi-channel speech recognition;

机译：强大的语音识别;深度学习神经网络;非平稳噪声;多通道语音识别;

相似文献

外文文献
中文文献
专利

1. An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition [J] . Bo Wu, Kehuang Li, Fengpei Ge, Selected Topics in Signal Processing, IEEE Journal of . 2017,第8期

机译：端到端深度学习方法可同时进行语音去混响和声学建模，以实现可靠的语音识别
2. Noise robust speech recognition system using multimodal audio-visual approach using different deep learning classification techniques [J] . Eslam E. El Maghraby, Amr M. Gody, Mohamed Hesham Farouk International Journal of Advanced Computer Research . 2020,第47期

机译：利用不同深度学习分类技术，使用多模式视听方法的噪声强大语音识别系统
3. Modulation Filter Learning Using Deep Variational Networks for Robust Speech Recognition [J] . Agrawal Purvi, Ganapathy Sriram Selected Topics in Signal Processing, IEEE Journal of . 2019,第2期

机译：使用深度变分网络进行调制滤波器学习以实现稳健的语音识别
4. Deep learning for environmentally robust speech recognition [C] . A. I. Alhamada, O. O. Khalifa, A. H. Abdalla International Conference on Electronic Devices, Systems and Applications . 2020

机译：深度学习环境强大的语音识别
5. Environmental and speaker robustness in automatic speech recognition with limited learning data. [D] . Cui, Xiaodong. 2005

机译：具有有限学习数据的自动语音识别中的环境和说话者鲁棒性。
6. Biosignal Sensors and Deep Learning-Based Speech Recognition: A Review [O] . Wookey Lee, Jessica Jiwon Seong, Busra Ozlu, 2021

机译：生物关键传感器与基于深度学习的语音识别：审查
7. Acoustic Modeling Based on Deep Learning for Low-Resource Speech Recognition: An Overview [O] . Chongchong Yu, Meng Kang, Yunbing Chen, 2020

机译：基于深度学习的低资源语音识别的声学建模：概述

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments

摘要

著录项

相似文献

相关主题

期刊订阅