Automatic Segmentation, Classification and Clustering of Broadcast News Audio

机译：广播新闻音频的自动分段，分类和聚类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic recognition of broadcast feeds from radio and television sources has been gaining importance recently, especially with the success of systems such as the CMU Informedia system [1]. In this work we describe the problems faced in adapting a system built to recognize one utterance at a time to a task that requires recognition of an entire half hour show. We break the problem into three components: segmentation, classification, and clustering. We show that a priori knowledge of acoustic conditions and speakers in the broadcast data is not required for segmentation. The system is able to detect changes in acoustics, recognize previously observed conditions, and use this to pool adaptation data. We also describe a novel application of the Symmetric Kullback-Leibler distance metric that is used as a single solution to both the segmentation and clustering problems. The three components are evaluated through comparisons between the Partitioned and Unpartitioned components of the 1996 ARPA Hub 4 evaluation test set.

机译：最近，自动识别来自广播和电视源的广播提要变得越来越重要，尤其是随着诸如CMU Informedia系统[1]之类的系统的成功。在这项工作中，我们描述了将一个系统识别为一次识别一种话语的系统适应需要识别整个半小时节目的任务所面临的问题。我们将问题分为三个部分：细分，分类和聚类。我们表明，分割不需要广播数据中的声学条件和扬声器的先验知识。该系统能够检测到声学变化，识别先前观察到的状况，并以此来收集适应数据。我们还描述了对称Kullback-Leibler距离度量的一种新颖应用，该度量被用作分割和聚类问题的单个解决方案。通过比较1996 ARPA Hub 4评估测试集中的分区和未分区组件，对这三个组件进行了评估。

著录项

来源
《Proceedings of the speech recognition workshop》|1997年|97-99|共3页
会议地点 Chantilly VA(US)
作者
Matthew A. Siegler; Uday Jain; Bhiksha Raj; Richard M. Stern;
展开▼
作者单位

ECE Department - Speech Group Carnegie Mellon University Pittsburgh, PA 15213;

ECE Department - Speech Group Carnegie Mellon University Pittsburgh, PA 15213;

ECE Department - Speech Group Carnegie Mellon University Pittsburgh, PA 15213;

ECE Department - Speech Group Carnegie Mellon University Pittsburgh, PA 15213;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自然科学理论与方法论;自动模拟理论（自动仿真理论）;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic Segmentation and Classification of Audio Broadcast Data [J] . P. Dhanalakshmi, S. Palanivel, V. Ramalingam Asian Journal of Information Technology . 2010,第2期

机译：音频广播数据的自动分段和分类
2. Audio segmentation-by-classification approach based on factor analysis in broadcast news domain [J] . Diego Castán, Alfonso Ortega, Antonio Miguel, EURASIP journal on audio, speech, and music processing . 2014,第1期

机译：广播新闻领域中基于因子分析的音频分类方法
3. Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora [J] . Rongqing Huang, Hansen J.H.L. IEEE transactions on audio, speech and language processing . 2006,第3期

机译：广播新闻和NGSW语料库的无监督音频分类和分段的进展
4. Automatic Segmentation, Classification and Clustering of Broadcast News Audio [C] . DARPA speech recognition workshop . 1997

机译：自动分割，广播新闻音频分类和聚类
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. Lung Lesion Detection in CT Scan Images Using the Fuzzy Local Information Cluster Means (FLICM) Automatic Segmentation Algorithm and Back Propagation Network Classification [O] . M Lavanya, P Muthu Kannan 2017

机译：使用模糊局部信息聚类均值（FLICM）自动分割算法和反向传播网络分类的CT扫描图像中的肺部病变检测
7. Audio Segmentation, Classification and Clustering in a Broadcast News Task [O] . Hugo Meinedo, Joao Neto 2003

机译：广播新闻任务中的音频分段，分类和聚类

Automatic Segmentation, Classification and Clustering of Broadcast News Audio

摘要

著录项

相似文献

相关主题

期刊订阅