首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling
【24h】

Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling

机译:基于光谱相关建模的多关音乐唱歌旋律提取

获取原文

摘要

Convolutional neural network (CNN) based methods have achieved state-of-the-art performance for singing melody extraction from polyphonic music. However, most of these methods focus on the learning of local features, while relationships among spectral components locating far apart are often neglected. In this paper, we explore the idea of modeling spectral correlation explicitly for melody extraction. Specifically, we present a spectral correlation module (SCM) that can learn to model the relationships among all frequency bands in a time-frequency representation, thus allowing the encoding of global spectral information into a conventional CNN. Furthermore, we propose to integrate center frequencies with the input feature map of SCM to improve the performance. We implement a light-weight model comprised of SCM blocks to verify the efficacy of our system. Our system achieves a state-of-the-art overall accuracy of 83.5% on the MedleyDB dataset.
机译:基于卷积神经网络(CNN)的方法已经实现了用于唱歌的最新性能,用于唱歌从部隙音乐中振动旋律。 然而,大多数这些方法都侧重于局部特征的学习,而定位远距离的光谱分量之间的关系通常被忽略。 在本文中,我们探讨了显式用于旋律提取的光谱相关性的思想。 具体地,我们提出了一种光谱相关模块(SCM),其可以学习在时频表示中的所有频带之间的关系模拟关系,从而允许将全局光谱信息的编码成传统的CNN。 此外,我们建议将中心频率与SCM的输入特征图集成,以提高性能。 我们实现了由SCM块组成的轻量级模型,以验证我们的系统的功效。 我们的系统在MedleyDB数据集上实现了最先进的总准确性为83.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号