首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Disentangled Multidimensional Metric Learning for Music Similarity
【24h】

Disentangled Multidimensional Metric Learning for Music Similarity

机译:用于音乐相似性的解缠多维度量学习

获取原文

摘要

Music similarity search is useful for a variety of creative tasks such as replacing one music recording with another recording with a similar "feel", a common task in video editing. For this task, it is typically necessary to define a similarity metric to compare one recording to another. Music similarity, however, is hard to define and depends on multiple simultaneous notions of similarity (i.e. genre, mood, instrument, tempo). While prior work ignore this issue, we embrace this idea and introduce the concept of multidimensional similarity and unify both global and specialized similarity metrics into a single, semantically disentangled multidimensional similarity metric. To do so, we adapt a variant of deep metric learning called conditional similarity networks to the audio domain and extend it using track-based information to control the specificity of our model. We evaluate our method and show that our single, multidimensional model outperforms both specialized similarity spaces and alternative baselines. We also run a user-study and show that our approach is favored by human annotators as well.
机译:音乐相似性搜索对于各种创造性任务很有用,例如,将一个音乐录制替换为具有类似“感觉”的另一种录制,这是视频编辑中的常见任务。对于此任务,通常需要定义一个相似性度量以将一个记录与另一个记录进行比较。但是,音乐相似性很难定义,并且要依赖多个同时的相似性概念(例如流派,情绪,乐器,节奏)。尽管先前的工作忽略了这个问题,但是我们接受了这个想法,并引入了多维相似性的概念,并将全局和专用相似性度量标准统一为一个语义上解开的多维相似性度量标准。为此,我们将一种称为条件相似性网络的深度度量学习变体改编为音频域,并使用基于轨道的信息对其进行扩展以控制模型的特异性。我们评估了我们的方法,并表明我们的单一多维模型优于专门的相似性空间和替代基线。我们还进行了用户研究,并表明我们的方法也受到人类注释者的青睐。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号