A spectrogram-patch-input DNN model for detection and classification of acoustic events in speech overlapping scenarios

Miquel ESPI; Masakiyo FUJIMOTO; Yotaro KUBO; Tomohiro NAKATANI

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >A spectrogram-patch-input DNN model for detection and classification of acoustic events in speech overlapping scenarios

【24h】

A spectrogram-patch-input DNN model for detection and classification of acoustic events in speech overlapping scenarios

机译：频谱图-补丁输入DNN模型，用于语音重叠场景中的声音事件检测和分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an acoustic event detection and classification method that learns features from spectrogram patches (i.e. concatenation of a certain number of consecutive spectrum frames) in an unsupervised manner, and integrates effortlessly within the deep neural network framework. Most AED use-cases happen in scenarios where speech overlaps with acoustic events, and while derived features (e.g. MFCCs, Mel-filter-banks) have traditionally characterized well the spectrum of speech, they are too dense and centered on specific frequencies to be used with non-speech tasks. Results show that the proposed model based on spectrogram-patch out-performs those based on derived features, as well as previous AED works.

机译：本文提出了一种声音事件检测和分类方法，该方法以无监督的方式从频谱图补丁中学习特征（即连接一定数量的连续频谱帧），并毫不费力地集成在深度神经网络框架中。大多数AED用例都发生在语音与声音事件重叠的场景中，尽管派生功能（例如MFCC，Mel滤波器组）传统上已经很好地描述了语音频谱，但它们过于密集并且集中在特定频率上，无法使用与非语音任务。结果表明，所提出的基于频谱图补丁的模型优于基于派生特征的模型，以及先前的AED工作。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2014年第52期|共6页
作者
Miquel ESPI; Masakiyo FUJIMOTO; Yotaro KUBO; Tomohiro NAKATANI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电报、传真;
关键词
Acoustic event detection; Deep neural network; Spectrogram patch; Source sparsity;

机译：声音事件检测;深度神经网络;声谱图补丁;源稀疏;

相似文献

外文文献
中文文献
专利

1. A spectrogram-patch-input DNN model for detection and classification of acoustic events in speech overlapping scenarios [J] . Miquel ESPI, Masakiyo FUJIMOTO, Yotaro KUBO, 電子情報通信学会技術研究報告. 音声. Speech . 2014,第52期

机译：频谱图-补丁输入DNN模型，用于语音重叠场景中的声音事件检测和分类
2. Acoustic Event Detection in Speech Overlapping Scenarios Based on High-Resolution Spectral Input and Deep Learning [J] . Miquel ESPI, Masakiyo FUJIMOTO, Tomohiro NAKATANI IEICE transactions on information and systems . 2015,第10期

机译：基于高分辨率谱输入和深度学习的语音重叠场景中的声音事件检测
3. On the Joint Use of NMF and Classification for Overlapping Acoustic Event Detection [J] . Giannoulis Panagiotis, Potamianos Gerasimos, Maragos Petros Proceedings . 2018,第2期

机译：NMF和分类在重叠声事件检测中的联合使用
4. Spectrogram patch based acoustic event detection and classification in speech overlapping conditions [C] . Espi Miquel, Fujimoto Masakiyo, Kubo Yotaro, 2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays . 2014

机译：语音重叠条件下基于频谱图补丁的声音事件检测和分类
5. Automatic Acoustic Events Detection, Classification, and Semantic Annotation for Persistent Surveillance Applications. [D] . Alkilani, Amjad H. I. 2014

机译：持续监视应用程序的自动声音事件检测，分类和语义注释。
6. Frequency overlap between electric and acoustic stimulation and speech-perception benefit in patients with combined electric and acoustic stimulation [O] . Ting Zhang, Anthony J. Spahr, Michael F. Dorman -1

机译：患者合并电和声刺激电和声刺激和语音感知利益之间的频率重叠
7. Acoustic Event Detection in Speech Overlapping Scenarios Based on High-Resolution Spectral Input and Deep Learning [O] . Miquel ESPI, Masakiyo FUJIMOTO, Tomohiro NAKATANI 2015

机译：基于高分辨率光谱输入和深度学习的语音重叠场景中的声学事件检测
8. Modeling and Classification of Acoustic Transients by Speech Recognition Techniques. [R] . Woodard, J. P. 1989

机译：基于语音识别技术的声学瞬态建模与分类。

A spectrogram-patch-input DNN model for detection and classification of acoustic events in speech overlapping scenarios

摘要

著录项

相似文献

相关主题

期刊订阅