Data-Efficient Framework for Real-World Multiple Sound Source 2d Localization

机译：数据高效框架，用于现实世界多声源2D本地化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural networks have recently led to promising results for the task of multiple sound source localization. Yet, they require a lot of training data to cover a variety of acoustic conditions and micro-phone array layouts. One can leverage acoustic simulators to inexpensively generate labeled training data. However, models trained on synthetic data tend to perform poorly with real-world recordings due to the domain mismatch. Moreover, learning for different microphone array layouts makes the task more complicated due to the infinite number of possible layouts. We propose to use adversarial learning methods to close the gap between synthetic and real do-mains. Our novel ensemble-discrimination method significantly improves the localization performance without requiring any label from the real data. Furthermore, we propose a novel explicit transformation layer to be embedded in the localization architecture. It enables the model to be trained with data from specific microphone array layouts while generalizing well to unseen layouts during inference.

机译：深度神经网络最近导致了多种声源本地化任务的有希望的结果。然而，它们需要大量的培训数据来涵盖各种声学条件和微电话阵列布局。人们可以利用声学模拟器廉价地生成标记的培训数据估计。然而，由于域不匹配导致的综合数据培训的模型往往与真实录音不好。此外，为不同的麦克风阵列布局学习，由于无限数量的可能布局，任务更加复杂。我们建议使用对抗性学习方法来缩小合成和真实系数之间的差距。我们的新型集合鉴别方法显着提高了本地化性能，而无需实际数据的任何标签。此外，我们提出了一种新颖的显式转换层，以嵌入在本地化架构中。它使模型能够通过特定麦克风阵列布局的数据训练，同时概括在推理期间不均匀地布局。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|3425-3429|共5页
会议地点
作者
Guillaume Le Moing; Phongtharin Vinayavekhin; Don Joven Agravante; Tadanobu Inoue; Jayakorn Vongkulbhisal; Asim Munawar; Ryuki Tachibana;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Location awareness; Training; Adaptation models; Layout; Training data; Microphone arrays; Acoustics;

机译：位置意识;培训;适应模型;布局;培训数据;麦克风阵列;声学;

相似文献

外文文献
中文文献
专利

1. Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification [J] . Wakabayashi Mizuho, Okuno Hiroshi G., Kumon Makoto IEEE Robotics and Automation Letters . 2020,第2期

机译：基于声音源定位与识别之间的数据关联的无人机验光的多声源位置估计
2. Drone audition listening from the sky estimates multiple sound source positions by integrating sound source localization and data association [J] . Wakabayashi Mizuho, Okuno Hiroshi G., Kumon Makoto Advanced Robotics: The International Journal of the Robotics Society of Japan . 2020,第11a12期

机译：通过集成声源本地化和数据关联来估计来自天空的无人机验证估计多个声源位置
3. Multiple-to-single sound source localization by applying single-source bins detection [J] . Jia Maoshen, Sun Jundai, Bao Changchun, Applied Acoustics . 2018,第SEPa期

机译：通过应用单源垃圾箱检测实现多到单声源定位
4. Multiplexed CSP Analysis for Multiple Sound Sources Localization - Improvement of TDOA Estimation and Determination of Sound Source Quantity [C] . D. Sannomiya, H. Kitano, J. Takayama, Sensor Symposium on Sensors, Micromachines, and Applied Systems . 2005

机译：多路复用CSP分析，多重声源定位 - 改进TDOA估计和声源数量的确定
5. Multiple source localization for real-world systems. [D] . Peterson, John Michael. 2006

机译：实际系统的多源本地化。
6. Multiple Sound Sources Localization with Frame-by-Frame Component Removal of Statistically Dominant Source [O] . Maoshen Jia, Yuxuan Wu, Changchun Bao, 2018

机译：具有统计优势源的逐帧分量移除的多声源定位
7. Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification [O] . Mizuho Wakabayashi, Hiroshi G. Okuno, Makoto Kumon 2020

机译：基于声音源定位与识别之间的数据关联的无人机验光的多声源位置估计

Data-Efficient Framework for Real-World Multiple Sound Source 2d Localization

摘要

著录项

相似文献

相关主题

期刊订阅