Improving Content-based Audio Retrieval by Vocal Imitation Feedback

机译：通过人声模仿反馈改善基于内容的音频检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Content-based audio retrieval including query-by-example (QBE) and query-by-vocal imitation (QBV) is useful when search-relevant text labels for the audio are unavailable, or text labels do not sufficiently narrow the search. However, a single query example may not provide sufficient information to ensure the target sound(s) in the database are the most highly ranked. In this paper, we adapt an existing model for generating audio embeddings to create a state-of-the-art similarity measure for audio QBE and QBV. We then propose a new method to update search results when top-ranked items are not relevant: The user provides an additional vocal imitation to illustrate what they do or do not want in the search results. This imitation may either be of some portion of the initial query example, or of a top-ranked (but incorrect) search result. Results show that adding vocal imitation feedback improves initial retrieval results by a statistically significant amount.

机译：当音频的搜索相关文本标签不可用或文本标签不能使搜索范围缩小时，基于内容的音频检索（包括按示例查询（QBE）和按声音查询模仿（QBV））非常有用。但是，单个查询示例可能无法提供足够的信息来确保数据库中的目标声音排名最高。在本文中，我们改编了一个用于生成音频嵌入的现有模型，以为音频QBE和QBV创建最新的相似性度量。然后，我们提出了一种在排名靠前的项目不相关时更新搜索结果的新方法：用户提供了另一种语音模仿，以说明他们在搜索结果中想要或不想要的内容。这种模仿可能是初始查询示例的一部分，也可能是排名最高（但不正确）的搜索结果。结果表明，添加人声模仿反馈可将初始检索结果提高统计学上显着的水平。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2019年|4100-4104|共5页
会议地点
作者
Bongjun Kim; Bryan Pardo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
audio signal processing; content-based retrieval; query processing; relevance feedback; text analysis;

机译：音频信号处理;基于内容的检索;查询处理;相关性反馈;文本分析;

相似文献

外文文献
中文文献
专利

1. Content-based audio retrieval with relevance feedback [J] . Chunru Wan, Mingchun Liu Pattern recognition letters . 2006,第2期

机译：基于内容的音频检索以及相关性反馈
2. Content-Based Analysis Improves Audiovisual Archive Retrieval [J] . Huurnink B., Snoek C. G. M., de Rijke M., Multimedia, IEEE Transactions on . 2012,第4期

机译：基于内容的分析可改善视听档案的检索
3. Daubechies Wavelets Based Robust Audio Fingerprinting for Content-Based Audio Retrieval [J] . Wei Sun, Zhe-Ming Lu, Fa-Xin Yu, International journal of digital crime and forensics . 2012,第2期

机译：基于Daubechies小波的稳健音频指纹识别，用于基于内容的音频检索
4. Improving Content-based Audio Retrieval by Vocal Imitation Feedback [C] . Bongjun Kim, Bryan Pardo IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：通过声音模仿反馈改进基于内容的音频检索
5. Efficient techniques for relevance feedback processing in content-based image retrieval. [D] . Liu, Danzhou. 2009

机译：基于内容的图像检索中相关性反馈处理的高效技术。
6. Individual Differences in Audio-Vocal Speech Imitation Aptitude in Late Bilinguals: Functional Neuro-Imaging and Brain Morphology [O] . Susanne Maria Reiterer, Xiaochen Hu, Michael Erb, 2011

机译：晚期双语者在声乐语音模仿能力上的个体差异：功能性神经影像学和脑形态学
7. Content-based analysis improves audiovisual archive retrieval [O] . Huurnink B., Snoek C.G.M., de Rijke M., 2012

机译：基于内容的分析可改善视听档案的检索
8. FALCON: Feedback Adaptive Loop for Content-Based Retrieval [R] . Wu, L. , Faloutsos, C. , Sycara, K. , 2000

机译：FaLCON：基于内容检索的反馈自适应循环

Improving Content-based Audio Retrieval by Vocal Imitation Feedback

摘要

著录项

相似文献

相关主题

期刊订阅