Audio Keywords Discovery for Text-Like Audio Content Analysis and Retrieval

Lu L.; Hanjalic A.

首页> 外文期刊>IEEE transactions on multimedia >Audio Keywords Discovery for Text-Like Audio Content Analysis and Retrieval

【24h】

Audio Keywords Discovery for Text-Like Audio Content Analysis and Retrieval

机译：音频关键字发现，可进行类似文本的音频内容分析和检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Inspired by classical text document analysis employing the concept of (key) words, this paper presents an unsupervised approach to discover (key) audio elements in general audio documents. The (key) audio elements can be considered the equivalents of the text (key) words, and enable content-based audio analysis and retrieval following the analogy to the proven text analysis theories and methods. Since general audio signals usually show complicated and strongly varying distribution and density in the feature space, we propose an iterative spectral clustering method with context-dependent scaling factors to decompose an audio data stream into audio elements. Using this clustering method, temporal signal segments with similar low-level features are grouped into natural clusters that we adopt as audio elements. To detect those audio elements that are most representative for the semantic content, that is, the key audio elements, two cases are considered. First, if only one audio document is available for analysis, a number of heuristic importance indicators are defined and employed to detect the key audio elements. For the case that multiple audio documents are available, more sophisticated measures for audio element importance, including expected term frequency (ETF), expected inverse document frequency (EIDF), expected term duration (ETD) and expected inverse document duration (EIDD), are proposed. Our experiments showed encouraging results regarding the quality of the obtained (key) audio elements and their potential applicability for content-based audio document analysis and retrieval.

机译：受到采用（关键词）词概念的经典文本文档分析的启发，本文提出了一种无监督的方法来发现一般音频文档中的（关键词）音频元素。（关键）音频元素可以被认为是文本（关键）单词的等同物，并且可以按照与公认的文本分析理论和方法类似的方式，进行基于内容的音频分析和检索。由于一般音频信号通常在特征空间中显示复杂且变化很大的分布和密度，因此我们提出了一种具有上下文相关缩放因子的迭代频谱聚类方法，以将音频数据流分解为音频元素。使用这种聚类方法，将具有相似低阶特征的时间信号片段分组为自然簇，我们将其用作音频元素。为了检测最能代表语义内容的音频元素，即关键音频元素，考虑了两种情况。首先，如果只有一个音频文档可用于分析，则将定义许多启发式重要性指示符并将其用于检测关键音频元素。对于有多个音频文档的情况，针对音频元素重要性的更复杂的度量包括期望术语频率（ETF），期望文档反向频率（EIDF），期望术语持续时间（ETD）和期望文档反向持续时间（EIDD）。建议。我们的实验表明，关于获得的（关键）音频元素的质量及其在基于内容的音频文档分析和检索中的潜在适用性，令人鼓舞的结果。

著录项

来源
《IEEE transactions on multimedia》 |2008年第1期|p.74-85|共12页
作者
Lu L.; Hanjalic A.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Audio content mining; audio element; audio keywords; content-based audio analysis; key audio element; knowledge discovery;

机译：音频内容挖掘;音频元素;音频关键词;基于内容的音频分析;关键音频元素;知识发现;

相似文献

外文文献
中文文献
专利

1. Text-Like Segmentation of General Audio for Content-Based Retrieval [J] . Lu L., Hanjalic A. Multimedia, IEEE Transactions on . 2009,第4期

机译：基于内容的检索的通用音频的文本样分割
2. Daubechies Wavelets Based Robust Audio Fingerprinting for Content-Based Audio Retrieval [J] . Wei Sun, Zhe-Ming Lu, Fa-Xin Yu, International journal of digital crime and forensics . 2012,第2期

机译：基于Daubechies小波的稳健音频指纹识别，用于基于内容的音频检索
3. Content-Based Analysis Improves Audiovisual Archive Retrieval [J] . Huurnink B., Snoek C. G. M., de Rijke M., Multimedia, IEEE Transactions on . 2012,第4期

机译：基于内容的分析可改善视听档案的检索
4. Towards optimal audio "keywords" detection for audio content analysis and discovery [C] . Lie Lu, Alan Hanjalic, PLie Lu, Annual ACM international conference on Multimedia;ACM international conference on Multimedia . 2006

机译：寻求用于音频内容分析和发现的最佳音频“关键词”检测
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. Original research: Quantifying alcohol audio-visual content in UK broadcasts of the 2018 Formula 1 Championship: a content analysis and population exposure [O] . Alexander Barker, Magdalena Opazo-Breton, Emily Thomson, 2020

机译：原始研究：量化2018级锦标赛英国广播中的酒精视听内容：内容分析和人口曝光
7. Interactive Audio Content: An Approach to Audio Content for a Dynamic Museum Experience through Augmented Audio Reality and Adaptive Information Retrieval [O] . Wakkary Ron, Newby Kenneth, Hatala Marek, 2004

机译：交互式音频内容：通过增强的音频真实性和自适应信息检索来获得动态博物馆体验的音频内容的方法

Audio Keywords Discovery for Text-Like Audio Content Analysis and Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅