PureMIC: A New Audio Dataset for the Classification of Musical Instruments based on Convolutional Neural Networks

Castel-Branco Goncalo; Falcao Gabriel; Perdigao Fernando

首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >PureMIC: A New Audio Dataset for the Classification of Musical Instruments based on Convolutional Neural Networks

【24h】

PureMIC: A New Audio Dataset for the Classification of Musical Instruments based on Convolutional Neural Networks

机译：威胁：基于卷积神经网络的乐器分类的新音频数据集

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic classification of musical instruments from audio relies heavily on datasets of acoustic recordings of the instruments to train models of those instruments. To do this, precise labels of the instrument's events are mandatory. Also, it is very difficult to obtain such labels, especially in polyphonic performances. OpenMic-2018 is a polyphonic dataset created specifically with the aim to train instrument models. However, this dataset is based on weak and incomplete labels. The automatic classification of sound events, based on the VGGish bottleneck layer as proposed before by the AudioSet, implies the classification of only one second at a time, making it hard to find the label of that exact moment. To answer this question, this paper proposes PureMIC, a new strongly labeled dataset (SLD) that isolates 1000 single instrument clips manually labeled. Moreover, the proposed model classifies clips over time and also enhances the labeling robustness of a high number of unlabeled samples in OpenMIC-2018 due to its ability of classification over time. In the paper we disambiguate and report the automatic labeling of previously unlabeled samples. The proposed new labels achieve a mean average precision (mAP) of 0.701 for OpenMIC test data, outperforming its baseline (0.66). The code is released online so that the research community can replicate and follow the proposed implementation.

机译：来自音频的乐器自动分类严重依赖于仪器的声学记录数据集，以培训这些仪器的模型。为此，仪器事件的精确标签是强制性的。而且，非常难以获得这种标签，尤其是在复音性能中。 OpenMic-2018是一个专门创建的Polyphonic数据集，其目的是培训仪器模型。但是，此数据集基于弱和不完整的标签。基于Audioset之前提出的VAGATH瓶颈层的声音事件的自动分类意味着一次只有一秒钟的分类，使得很难找到该确切时刻的标签。为了回答这个问题，本文提出了一种诸如手动标记的1000个单个仪器夹的新强大标记的数据集（SLD）。此外，所提出的模型随着时间的推移来对剪辑进行分类，并且由于其随时间的分类能力，增强了OpenMic-2018中的大量未标记样本的标记稳健性。在论文中，我们消除并报告先前未标记的样本的自动标记。所提出的新标签实现了OpenMic测试数据的平均平均精度（MAP）为0.701，优于其基线（0.66）。该代码在线发布，以便研究社区可以复制和遵循拟议的实施。

著录项

来源
《Journal of signal processing systems for signal, image, and video technology》 |2021年第9期|977-987|共11页
作者
Castel-Branco Goncalo; Falcao Gabriel; Perdigao Fernando;
展开▼
作者单位

Univ Coimbra Inst Telecomunicacoes Coimbra Portugal;

Univ Coimbra Inst Telecomunicacoes Coimbra Portugal;

Univ Coimbra Inst Telecomunicacoes Coimbra Portugal;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
OpenMIC-2018; AudioSet; PureMIC; Instrument classification; Instrument labeling; Deep learning;

机译：Openmic-2018;audioset;纯度;仪器分类;仪器标签;深度学习;

相似文献

外文文献
中文文献
专利

1. Convolutional neural networks for classification of music-listening EEG: comparing 1D convolutional kernels with 2D kernels and cerebral laterality of musical influence [J] . Neural computing & applications . 2020,第13期

机译：卷积神经网络，用于音乐收听EEG的分类：将1D卷积粒与2D核和音乐影响的脑横向相比
2. Recognition and Classification Model of Music Genres and Chinese Traditional Musical Instruments Based on Deep Neural Networks [J] . Ke Xu Scientific programming . 2021,第a期

机译：基于深神经网络的音乐流派和中国传统乐器的认可与分类模型
3. Automatic musical instrument classification using fractional fourier transform based- MFCC features and counter propagation neural network [J] . Bhalke D. G., Rao C. B. Rama, Bormane D. S. Journal of Intelligent Information Systems . 2016,第3期

机译：基于分数阶傅里叶变换的自动乐器分类-MFCC特征和反向传播神经网络
4. Classification of Musical Instruments with Convolutional Neural Networks [C] . Miljan Mitrovic, Marko Misic Telecommunications Forum . 2018

机译：卷积神经网络对乐器的分类
5. Deep Convolutional Neural Networks for the Classification of the EMBER Malware Dataset [D] . Nallamothu, Anudeep 2018

机译：深度卷积神经网络用于EMBER恶意软件数据集的分类
6. Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network [O] . Seung Seog Han, Gyeong Hun Park, Woohyung Lim, -1

机译：深度神经网络在灰指甲诊断中显示出与皮肤科医生相当且通常优于皮肤病的性能：通过基于区域的卷积深度神经网络自动构建灰指甲数据集
7. Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network [O] . Seung Seog Han, Gyeong Hun Park, Woohyung Lim, 2018

机译：深度神经网络对甲癣诊断的皮肤病学家表现出相同，并且通常优异的性能：由基于区域的卷积深神经网络自动构建甲癣数据集

PureMIC: A New Audio Dataset for the Classification of Musical Instruments based on Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅