A simplified early auditory model with application in audio classification

Wei Chu; Benoit Champagne

首页> 外文期刊>Canadian journal of electrical and computer engineering >A simplified early auditory model with application in audio classification

【24h】

A simplified early auditory model with application in audio classification

机译：简化的早期听觉模型及其在音频分类中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

La dernière décennie a connu une expansion de la recherche sur les algorithmes de classification audio et de segmentation. Cependant, l'effet du brait de fond sur les performances de la classification n'a pas été largement étudié. Récemment, un modèle auditif qui calcule un spectre auditif a atteint une performance excellente en classification audio ainsi qu'une robustesse dans un environnement bruité. Malheureusement, ce modèle auditif est caractérisé par des besoins élevés en calcul et par un traitement non-linéaire. Dans ce papier, quelques modifications sont introduites afin de développer une version simplifiée de ce modèle qui est linéaire à l'exception du calcul de la valeur de la racine carrée de l'énergie. Des tâches de classification de la parole/musique de même que de la paroleon-parole sont effectuées pour évaluer la performance de la classification, en utilisant un classifieur à automate à support vectoriel. Comparé à une transformation rapide de Fourier conventionnelle, les deux spectres auditifs - cehri d'origine et celui simplifié proposé - montrent des performances plus robustes dans les tests avec bruit Les résultais des tests montrent également qu'en dépit d'une complexité de calcul réduite, la performance du spectre auditif simplifié qui a été proposé est proche du spectre auditif d'origine.%The past decade has seen extensive research on audio classification and segmentation algorithms. However, the effect of background noise on classification performance has not been widely investigated. Recently, an early auditory model that calculates a so-called auditory spectrum has achieved excellent performance in audio classification along with robustness in a noisy environment. Unfortunately, this early auditory model is characterized by high computational requirements and the use of nonlinear processing. In this paper, certain modifications are introduced to develop a simplified version of this model which is linear except for the calculation of the square-root value of the energy. Speech/music and speechon-speech classification tasks are carried out to evaluate the classification performance, with a support vector machine (SVM) as the classifier. Compared to a conventional fast Fourier transform-based spectrum, both the original auditory spectrum and the proposed simplified auditory spectrum snow more robust performance in noisy test cases. Test results also indicate that despite a reduced computational complexity, the performance of the proposed simplified auditory spectrum is close to that of the original auditory spectrum.

机译：在过去的十年中，音频分类和分段算法的研究有所扩展。但是，背景编织对分类性能的影响尚未得到广泛研究。近来，计算听力频谱的听力模型在音频分类以及在嘈杂环境中的鲁棒性方面已经获得了优异的性能。不幸的是，这种听力模型的特点是计算量大并且需要非线性处理。在本文中，引入了一些修改以开发该模型的简化版本，该模型除了计算能量平方根的值外，都是线性的。使用基于矢量的自动分类器执行语音/音乐以及语音/非语音分类任务以评估分类性能。与快速的传统傅立叶变换相比，两个听觉频谱（原始cehri和简化的拟声频谱）在噪声测试中表现出更强大的性能。测试结果还表明，尽管计算复杂度降低了，已提出的简化听觉频谱的性能接近原始听觉频谱。％在过去的十年中，音频分类和分段算法得到了广泛的研究。但是，背景噪声对分类性能的影响尚未得到广泛研究。近来，计算所谓的听觉频谱的早期听觉模型已经在音频分类中实现了出色的性能以及在嘈杂环境中的鲁棒性。不幸的是，这种早期听觉模型的特点是计算量大，并且需要使用非线性处理。在本文中，引入了某些修改以开发此模型的简化版本，该模型除了计算能量的平方根值外，它是线性的。以支持向量机（SVM）作为分类器，执行语音/音乐和语音/非语音分类任务以评估分类性能。与传统的基于快速傅立叶变换的频谱相比，原始的听觉频谱和提议的简化的听觉频谱在嘈杂的测试案例中均具有更强大的性能。测试结果还表明，尽管计算复杂度有所降低，但所提出的简化听觉频谱的性能却接近原始听觉频谱。

著录项

来源
《Canadian journal of electrical and computer engineering》 |2006年第4期|p.185-189|共5页
作者
Wei Chu; Benoit Champagne;
展开▼
作者单位

Department of Electrical and Computer Engineering, McGill University, Montreal, Quebec H3A 2A7;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类电工技术;
关键词
classification; auditory spectrum; early auditory model; noise robustness;

机译：分类;听觉频谱;早期听觉模型;噪声鲁棒性;

相似文献

外文文献
中文文献
专利

1. A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification [J] . Chu W., Champagne B. IEEE transactions on audio, speech and language processing . 2008,第1期

机译：基于噪声稳健FFT的听觉谱及其在音频分类中的应用
2. Beyond the Audiogram: Application of Models of Auditory Fitness for Duty to Assess Communication in the Real World [J] . Dubno Judy R. Ear and hearing. . 2018,第3期

机译：除了听力图之外：在责任中应用责任的模型，以评估现实世界中的沟通
3. A simplified audiovisual fusion model with application to large-vocabulary recognition of French Canadian speech [J] . L. Gagnon, S. Foucher, F. Laliberte, Canadian journal of electrical and computer engineering . 2008,第2期

机译：一种简化的视听融合模型，应用于加拿大法语语音的大词汇量识别
4. A Simplified Early Auditory Model with Application in Speech/Music Classification [C] . Wei Chu, Benoit Champagne, The Institute of Electrical and Electronics EngineersInc. Canadian Conference on Electrical and Computer Engineering . 2006

机译：一种简化的早期听觉模型，具有语音/音乐分类
5. Incorporating Auditory Models in Speech/Audio Applications [D] . Krishnamoorthi, Harish 2011

机译：在语音/音频应用程序中整合听觉模型
6. Beyond the audiogram: Application of models of auditory fitness for duty to assess communication in the real world [O] . Judy R. Dubno -1

机译：听力图之外：听觉适应性模型在评估现实世界中的交流中的应用
7. Beyond the audiogram: application of models of auditory fitness for duty to assess communication in the real world [O] . Judy R. Dubno 2018

机译：超越了AudioGram：对听觉健身模型的应用征用，以评估现实世界中的沟通

A simplified early auditory model with application in audio classification

摘要

著录项

相似文献

相关主题

期刊订阅