首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >An Integrated Top-Down/Bottom-Up Approach To Speaker Diarization
【24h】

An Integrated Top-Down/Bottom-Up Approach To Speaker Diarization

机译:集成的自上而下/自下而上的方法来实现扬声器音质化

获取原文

摘要

Most speaker diarization systems fit into one of two categories: bottom-up or top-down. Bottom-up systems are the most popular but can sometimes suffer from instability from merging and stopping criteria difficulties. Top-down systems deliver competitive results but are particularly prone to poor model initialization which often leads to large variations in performance. This paper presents a new integrated bottom-up/top-down approach to speaker diarization which aims to harness the strengths of each system and thus to improve performance and stability. In contrast to previous work, here the two systems are fused at the heart of the segmentation and clustering stage. Experimental results show improvements in speaker diarization performance for both meeting and TV-show domain data indicating increased intra and inter-domain stability. On the TV-show data in particular, an average relative improvement of 32% DER is obtained.
机译:大多数扬声器二元化系统属于以下两种类别之一:自下而上或自上而下。自下而上的系统是最流行的系统,但有时会因合并和停止标准困难而遭受不稳定的困扰。自上而下的系统可提供有竞争力的结果,但特别容易导致模型初始化效果不佳,而这通常会导致性能出现较大差异。本文提出了一种新的自下而上/自上而下的集成方法,以实现说话人的区分,目的是利用每个系统的优势,从而提高性能和稳定性。与以前的工作相比,这里的两个系统是分割和聚类阶段的核心。实验结果表明,针对会议和电视节目域数据,说话人的区分性能得到了改善,表明域内和域间稳定性有所提高。特别是在电视节目数据上,获得了DER的32%的平均相对改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号