Deep ad-hoc beamforming

Xiao-Lei Zhang

首页> 外文期刊>Computer speech and language >Deep ad-hoc beamforming

【24h】

Deep ad-hoc beamforming

机译：深度ad-hoc波束成形

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Far-field speech processing is an important and challenging problem. In this paper, we propose deep ad-hoc beamforming, a deep-learning-based multichannel speech enhancement framework based on ad-hoc microphone arrays, to address the problem. It contains three novel components. First, it combines ad-hoc microphone arrays with deep-learning-based multichannel speech enhancement, which reduces the probability of the occurrence of far-field acoustic environments significantly. Second, it groups the microphones around the speech source to a local microphone array by a supervised channel selection framework based on deep neural networks. Third, it develops a simple time synchronization framework to synchronize the channels that have different time delay. Besides the above novelties and advantages, the proposed model is also trained in single-channel fashion, so that it can easily employ new development of speech processing techniques. Its test stage is also flexible in incorporating any number of microphones without retraining or modifying the framework. We have developed many implementations of the proposed framework and conducted an extensive experiment in scenarios where the locations of the speech sources are far-field, random, and blind to the microphones. Results on speech enhancement tasks show that our method outperforms its counterpart that works with linear microphone arrays by a considerable margin in both diffuse noise reverberant environments and point source noise reverberant environments. We have also tested the framework with different handcrafted features. Results show that although good features lead to high performance, they do not affect the conclusion on the effectiveness of the proposed framework.

机译：远场语音处理是一个重要和具有挑战性的问题。在本文中，我们提出了基于Ad-Hoc麦克风阵列的深度学习的多通道语音增强框架的深度ad-hoc波束成形，解决了问题。它包含三个新型组件。首先，它将Ad-hoc麦克风阵列与基于深度学习的多通道语音增强结合，这显着降低了远场声学环境的发生概率。其次，它通过基于深神经网络的监督信道选择框架将语音源周围的麦克风围绕局部麦克风阵列。第三，它开发了一个简单的时间同步框架，可以同步具有不同时间延迟的通道。除了上述Noveltize和优点外，拟议的型号也在单通道时尚培训，因此它可以轻松地采用语音处理技术的新开发。其测试阶段也可以在不重新培训或修改框架的情况下掺入任何数量的麦克风。我们已经开发了许多拟议框架的实现，并在语音源的位置是远场的场景中进行了广泛的实验，对麦克风盲目。语音增强任务的结果表明，我们的方法优于其对应物，其对应于漫射噪声混响环境和点源噪声混响环境中的相当数利润率。我们还测试了不同的手工特征的框架。结果表明，虽然良好的功能导致高性能高，但它们不会影响提出框架的有效性的结论。

著录项

来源
《Computer speech and language》 |2021年第7期|101201.1-101201.18|共18页
作者
Xiao-Lei Zhang;
展开▼
作者单位

Research & Development Institute of Northwestern Polytechnical University in Shenzhen Shenzhen China Center for Intelligent Acoustics and Immersive Communications School of Marine Science and Technology Northwestern Polytechnical University China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Adaptive beamforming; Ad-hoc microphone array; Channel selection; Deep learning; Distributed microphone array;

机译：自适应波束形成;ad-hoc麦克风阵列;频道选择;深度学习;分布式麦克风阵列;

相似文献

外文文献
中文文献
专利

1. Simulation modeling and analysis of the hop count distribution in cognitive radio ad-hoc networks with beamforming [J] . Le The Dung, Choi Seong-Gon Simulation modelling practice and theory: International journal of the Federation of European Simulation Societies . 2018,第期

机译：仿真建模与波束形成中的认知无线电广告网络跳数分析
2. Distributed phase-shift beamforming power balancing in ad-hoc and sensor networks [J] . Alexandru Mihnea Moucha, Viktor Cerny, Jan Kubr, Telecommunication systems: Modeling, Analysis, Design and Management . 2014,第4期

机译：临时和传感器网络中的分布式相移波束形成功率平衡
3. Outage probability analysis for multiple input??multiple output ad-hoc network with quantised beamforming [J] . Ananthi G. Communications, IET . 2012,第7期

机译：具有量化波束形成的多输入多输出自组织网络中断概率分析
4. The Air-Ground Integrated MIMO Cooperative Relay Beamforming Wireless Ad-Hoc Network Technology Research That Based on Maximum Ratio Combining [C] . Zhifang Wang, Junguo Dong, Jianguo Yu, . 2020

机译：基于最大比率组合的空地集成MIMO协作中继波束成形无线自组网技术研究
5. Novel hardware implementation and adaptive beamforming algorithms for microwave beamforming structure. [D] . Farzaneh Koodiani, Sadegh. 2008

机译：微波波束成形结构的新型硬件实现和自适应波束成形算法。
6. Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments [O] . Kyoungjin Noh, Joon-Hyuk Chang 2020

机译：基于深度神经网络的混响和波束成形的联合优化用于多通道环境中的声音事件检测
7. Physical size of microphone arrays in ad-hoc beamforming [O] . Tinakari Aki 2017

机译：ad-hoc波束成形中麦克风阵列的物理尺寸
8. Awareness of Emerging Wireless Technologies: Ad-hoc and Personal Area Networks Standards and Emerging Technologies (Sensibilisation a l'emergence des technologies sans fil: technologies emergeantes et normes de reseaux personnels et ad-hoc) [R] . Stassinopoulos, G. , Boucher, L. , Churavy, M. , 2007

机译：对新兴无线技术的认识：ad-hoc和个人区域网络标准和新兴技术（sensibilisation a l'des des technologies sans fil：technologies emergeantes et normes de reseaux personnels et ad-hoc）

Deep ad-hoc beamforming

摘要

著录项

相似文献

相关主题

期刊订阅