A unified neural-network-based speaker localization technique

Arslan G.; Sakarya F.A.

首页> 外文期刊>IEEE Transactions on Neural Networks >A unified neural-network-based speaker localization technique

【24h】

A unified neural-network-based speaker localization technique

机译：基于统一神经网络的说话人定位技术

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Locating and tracking a speaker in real time using microphone arrays is important in many applications such as hands-free video conferencing, speech processing in large rooms, and acoustic echo cancellation. A speaker can be moving from the far field to the near field of the array, or vice versa. Many neural-network-based localization techniques exist, but they are applicable to either far-field or near-field sources, and are computationally intensive for real-time speaker localization applications because of the wide-band nature of the speech. We propose a unified neural-network-based source localization technique, which is simultaneously applicable to wide-band and narrow-band signal sources that are in the far field or near field of a microphone array. The technique exploits a multilayer perceptron feedforward neural network structure and forms the feature vectors by computing the normalized instantaneous cross-power spectrum samples between adjacent pairs of sensors. Simulation results indicate that our technique is able to locate a source with an absolute error of less than 3.5/spl deg/ at a signal-to-noise ratio of 20 dB and a sampling rate of 8000 Hz at each sensor.

机译：在许多应用中，例如免提视频会议，大房间中的语音处理和回声消除，使用麦克风阵列实时定位和跟踪扬声器很重要。扬声器可以从阵列的远场移到近场，反之亦然。存在许多基于神经网络的定位技术，但是它们适用于远场或近场源，并且由于语音的宽带特性，对于实时说话者定位应用来说，计算量很大。我们提出了一种基于神经网络的统一源定位技术，该技术同时适用于麦克风阵列远场或近场中的宽带和窄带信号源。该技术利用多层感知器前馈神经网络结构，并通过计算相邻传感器对之间的归一化瞬时跨功率谱样本来形成特征向量。仿真结果表明，我们的技术能够以20 dB的信噪比和每个传感器8000 Hz的采样率定位绝对误差小于3.5 / spl deg /的声源。

著录项

来源
《IEEE Transactions on Neural Networks》 |2000年第4期|P.997-1002|共6页
作者
Arslan G.; Sakarya F.A.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A unified framework for score normalization techniques applied to text-independent speaker verification [J] . Mariethoz J., Bengio S. IEEE signal processing letters . 2005,第7期

机译：适用于与文本无关的说话人验证的分数归一化技术的统一框架
2. Smart Speakers Need Localization Techniques [J] . GPS World . 2020,第4期

机译：智能扬声器需要本地化技术
3. On-line speaker testing using neural-network-based system [J] . M J Er, T H Ooi, C T Toh, Insight . 1995,第1期

机译：使用基于神经网络的系统进行在线扬声器测试
4. Neural-Network-Based Spectrum Processing for Speech Recognition and Speaker Verification [C] . Jan Zelinka, Jan Vanek, Ludek Mueller International conference on statistical language and speech processing . 2015

机译：基于神经网络的频谱处理，用于语音识别和说话人验证
5. Data mining techniques in education: A comparison of conventional statistical linear regression and neural-network-based tools. [D] . Thigpen, Michele K. 2000

机译：教育中的数据挖掘技术：传统统计线性回归和基于神经网络的工具的比较。
6. A review of auditing techniques for the Unified Medical Language System [O] . Ling Zheng, Zhe He, Duo Wei, 2020

机译：统一医疗语言系统审计技术综述
7. 1 A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification [O] . Johnny Mariéthoz, Samy Bengio 2010

机译：1用于文本独立说话人验证的分数标准化技术的统一框架
8. Application of Geophysical Measurement Techniques in the Localization of Brine and the Determination of the Structure and Boundaries of Salt Deposits. Localization of Brine in Salt Mines Using High Frequency. [R] . 1972

机译：地球物理测量技术在盐水局部化中的应用及盐矿结构与边界的确定。高频盐水局部化盐矿。

A unified neural-network-based speaker localization technique

摘要

著录项

相似文献

相关主题

期刊订阅