An Integrated Top-Down/Bottom-Up Approach To Speaker Diarization

机译：集成的自上而下/自下而上的方法来实现扬声器音质化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most speaker diarization systems fit into one of two categories: bottom-up or top-down. Bottom-up systems are the most popular but can sometimes suffer from instability from merging and stopping criteria difficulties. Top-down systems deliver competitive results but are particularly prone to poor model initialization which often leads to large variations in performance. This paper presents a new integrated bottom-up/top-down approach to speaker diarization which aims to harness the strengths of each system and thus to improve performance and stability. In contrast to previous work, here the two systems are fused at the heart of the segmentation and clustering stage. Experimental results show improvements in speaker diarization performance for both meeting and TV-show domain data indicating increased intra and inter-domain stability. On the TV-show data in particular, an average relative improvement of 32% DER is obtained.

机译：大多数扬声器二元化系统属于以下两种类别之一：自下而上或自上而下。自下而上的系统是最流行的系统，但有时会因合并和停止标准困难而遭受不稳定的困扰。自上而下的系统可提供有竞争力的结果，但特别容易导致模型初始化效果不佳，而这通常会导致性能出现较大差异。本文提出了一种新的自下而上/自上而下的集成方法，以实现说话人的区分，目的是利用每个系统的优势，从而提高性能和稳定性。与以前的工作相比，这里的两个系统是分割和聚类阶段的核心。实验结果表明，针对会议和电视节目域数据，说话人的区分性能得到了改善，表明域内和域间稳定性有所提高。特别是在电视节目数据上，获得了DER的32％的平均相对改进。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2654-2657|共4页
会议地点
作者
Simon Bozonnet; Nicholas Evans; Corinne Fredouille; Dong Wang; Raphaeel Troncy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speaker diarization; speaker segmentation; speaker clustering; system combination; SDM;

机译：说话人差异化说话人细分;说话者聚类;系统组合; SDM;

相似文献

外文文献
中文文献
专利

1. A Comparative Study of Bottom-Up and Top-Down Approaches to Speaker Diarization [J] . Evans N., Bozonnet S., Dong Wang, Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第2期

机译：自下而上和自上而下的说话人差异化方法的比较研究
2. Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach [J] . Shum, S.H., Dehak, IEEE transactions on audio, speech and language processing . 2013,第10期

机译：说话人差异化的无监督方法：集成和迭代方法
3. Top-down and bottom-up: Front to back Comment on "Move me, astonish me ... delight my eyes and brain: The Vienna Integrated Model, of top-down and bottom-up processes in Art Perception (VIMAP) and corresponding affective, evaluative, and neurophysiological correlates" by Matthew Pelowski et al. [J] . Nadal Marcos, Skov Martin Physics of life reviews . 2017,第期

机译：自上而下和自下而上：前面回到“移动我，令我惊讶的是我的眼睛和大脑：维也纳综合模型，在艺术感知（VIMAP）的自上而下和自下而上的过程中的自上而下和自下而上的过程 Matthew Pelowski等人的情感，评价和神经生理学相关。
4. An Integrated Top-Down/Bottom-Up Approach To Speaker Diarization [C] . Simon Bozonnet, Nicholas Evans, Corinne Fredouille, Annual conference of the International Speech Communication Association . 2010

机译：一种综合的上下/扬声器深度的自下而上的方法
5. Top-down and bottom-up tools for integrated pest management in Northeastern hop production. [D] . Calderwood, Lily B. 2015

机译：自上而下和自下而上的工具，用于东北蛇麻草生产中的病虫害综合管理。
6. Evolutionary Steps in the Emergence of Life Deduced from the Bottom-Up Approach and GADV Hypothesis (Top-Down Approach) [O] . Kenji Ikehara 2016

机译：自下而上的方法和GADV假说（自上而下的方法）推论出生命出现的进化步骤
7. Linguistic influences on bottom-up and top-down clustering for speaker diarization [O] . Simon Bozonnet, Dong Wang, Nicholas Evans 2016

机译：语音对自上而下和自上而下聚类的影响对于说话人的疏导

An Integrated Top-Down/Bottom-Up Approach To Speaker Diarization

摘要

著录项

相似文献

相关主题

期刊订阅