Disentangled Multidimensional Metric Learning for Music Similarity

机译：用于音乐相似性的解缠多维度量学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Music similarity search is useful for a variety of creative tasks such as replacing one music recording with another recording with a similar "feel", a common task in video editing. For this task, it is typically necessary to define a similarity metric to compare one recording to another. Music similarity, however, is hard to define and depends on multiple simultaneous notions of similarity (i.e. genre, mood, instrument, tempo). While prior work ignore this issue, we embrace this idea and introduce the concept of multidimensional similarity and unify both global and specialized similarity metrics into a single, semantically disentangled multidimensional similarity metric. To do so, we adapt a variant of deep metric learning called conditional similarity networks to the audio domain and extend it using track-based information to control the specificity of our model. We evaluate our method and show that our single, multidimensional model outperforms both specialized similarity spaces and alternative baselines. We also run a user-study and show that our approach is favored by human annotators as well.

机译：音乐相似性搜索对于各种创造性任务很有用，例如，将一个音乐录制替换为具有类似“感觉”的另一种录制，这是视频编辑中的常见任务。对于此任务，通常需要定义一个相似性度量以将一个记录与另一个记录进行比较。但是，音乐相似性很难定义，并且要依赖多个同时的相似性概念（例如流派，情绪，乐器，节奏）。尽管先前的工作忽略了这个问题，但是我们接受了这个想法，并引入了多维相似性的概念，并将全局和专用相似性度量标准统一为一个语义上解开的多维相似性度量标准。为此，我们将一种称为条件相似性网络的深度度量学习变体改编为音频域，并使用基于轨道的信息对其进行扩展以控制模型的特异性。我们评估了我们的方法，并表明我们的单一多维模型优于专门的相似性空间和替代基线。我们还进行了用户研究，并表明我们的方法也受到人类注释者的青睐。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|6-10|共5页
会议地点
作者
Jongpil Lee; Nicholas J. Bryan; Justin Salamon; Zeyu Jin; Juhan Nam;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
multidimensional music similarity; metric learning; disentangled representation; query-by-example;

机译：多维音乐相似度;度量学习;分解表示法;实例查询;

相似文献

外文文献
中文文献
专利

1. The multidimensional perturbation value: A single metric to measure similarity and activity of treatments in high-throughput multidimensional screens [J] . Hutz J.E., Nelson T., Wu H., Journal of biomolecular screening: The official journal of the Society for Biomolecular Screening . 2013,第4期

机译：多维扰动值：一种用于度量高通量多维屏幕中处理的相似性和活动性的度量
2. Music Recommendation Based on Multidimensional Description and Similarity Measures [J] . Bozena Kostek, Andrzej Kaczmarek Fundamenta Informaticae . 2013,第1a4期

机译：基于多维描述和相似度度量的音乐推荐
3. Context Similarity Metric for Multidimensional Service Recommendation [J] . Liwei Liu, Nikolay Mehandjiev, Dong-Ling Xu International Journal of Electronic Commerce . 2013,第1期

机译：多维服务推荐的上下文相似性度量
4. Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning [C] . Keitaro Tanaka, Ryo Nishikimi, Yoshiaki Bando, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：基于VAE的公制学习的音乐仪器声音的俯仰 - TIMBRE解剖
5. A multidimensional scaling study of seven theoretical indices of intervallic similarity and musicians' perceptions among twenty-one pitch-class sets with implications for music teaching and learning. [D] . Lane, Roger C. 1997

机译：一个多维比例缩放研究，涉及二十一个音高类集合中的七个区间相似性理论指标和音乐家的感知，这对音乐的教与学具有一定的意义。
6. Similarity Metric Learning for 2D to 3D Registration of Brain Vasculature [O] . Alice Tang, Fabien Scalzo -1

机译：用于脑血管2D到3D配准的相似度量学习
7. Learning content-based metrics for music similarity [O] . Dieleman Sander, Schrauwen Benjamin 2012

机译：学习基于内容的音乐相似度量

Disentangled Multidimensional Metric Learning for Music Similarity

摘要

著录项

相似文献

相关主题

期刊订阅