Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling

机译：基于光谱相关建模的多关音乐唱歌旋律提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional neural network (CNN) based methods have achieved state-of-the-art performance for singing melody extraction from polyphonic music. However, most of these methods focus on the learning of local features, while relationships among spectral components locating far apart are often neglected. In this paper, we explore the idea of modeling spectral correlation explicitly for melody extraction. Specifically, we present a spectral correlation module (SCM) that can learn to model the relationships among all frequency bands in a time-frequency representation, thus allowing the encoding of global spectral information into a conventional CNN. Furthermore, we propose to integrate center frequencies with the input feature map of SCM to improve the performance. We implement a light-weight model comprised of SCM blocks to verify the efficacy of our system. Our system achieves a state-of-the-art overall accuracy of 83.5% on the MedleyDB dataset.

机译：基于卷积神经网络（CNN）的方法已经实现了用于唱歌的最新性能，用于唱歌从部隙音乐中振动旋律。然而，大多数这些方法都侧重于局部特征的学习，而定位远距离的光谱分量之间的关系通常被忽略。在本文中，我们探讨了显式用于旋律提取的光谱相关性的思想。具体地，我们提出了一种光谱相关模块（SCM），其可以学习在时频表示中的所有频带之间的关系模拟关系，从而允许将全局光谱信息的编码成传统的CNN。此外，我们建议将中心频率与SCM的输入特征图集成，以提高性能。我们实现了由SCM块组成的轻量级模型，以验证我们的系统的功效。我们的系统在MedleyDB数据集上实现了最先进的总准确性为83.5％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|241-245|共5页
会议地点
作者
Xingjian Du; Bilei Zhu; Qiuqiang Kong; Zejun Ma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Time-frequency analysis; Correlation; Neural networks; Estimation; Signal processing; Feature extraction; Encoding;

机译：时频分析;相关性;神经网络;估计;信号处理;特征提取;编码;

相似文献

外文文献
中文文献
专利

1. Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time-Domain Adaptive Filtering-Based Method [J] . Reddy M. Gurunath, Rao K. Sreenivasa Circuits, systems, and signal processing . 2018,第7期

机译：基于时域自适应滤波的和弦音乐信号主要旋律提取
2. Main melody extraction from polyphonic music based on modified Euclidean algorithm [J] . Zhang Weiwei, Chen Zhe, Yin Fuliang Applied Acoustics . 2016,第nova期

机译：基于改进欧几里得算法的和弦音乐主旋律提取
3. Predominant Melody Extraction from Polyphonic Music Signals Based on Harmonic Structure [J] . Jea-Yul YOON, Chai-Jong SONG, Hochong PARK IEICE transactions on information and systems . 2013,第11期

机译：基于谐波结构的和弦音乐信号主要旋律提取
4. Fusing transcription results from polyphonic and monophonic audio for singing melody transcription in polyphonic music [C] . Bilei Zhu, Fuzhang Wu, Ke Li, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：将和弦和单声道音频的转录结果融合在一起，以在和弦音乐中演唱旋律转录
5. Study of Sequential Accelerated Solvent Extraction of Different Depths of Oak Tank Staves, Affected by Three Different Heat Sources, Analyzed by Gas Chromatography-Mass Spectrometry and Correlations to Sensory Descriptive Analysis of Their Model Wine Extractions. [D] . Llodra, David. 2013

机译：气相色谱-质谱联用分析了三种不同热源对不同深度橡木桶壁深度的顺序加速溶剂萃取及其与模型提取酒的感官描述相关性的研究。
6. Musical Melody and Speech Intonation: Singing a Different Tune [O] . Robert J. Zatorre, Shari R. Baum 2012

机译：音乐旋律和语音语调：演唱不同的音调
7. Singing Transcription from Polyphonic Music Using Melody Contour Filtering [O] . Zhuang He, Yin Feng 2021

机译：使用旋律轮廓滤波从多关音乐的唱歌转录
8. Extraction of Chlorophyll-a Concentration Based on Spectral Unmixing Model Using Field Hyperspectral Data in Taihu Lake [R] . Jianguang, W. , Qing, X. , Qinhuo, L. , 2005

机译：基于光谱分离模型的太湖野外高光谱数据提取叶绿素a浓度

Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling

摘要

著录项

相似文献

相关主题

期刊订阅