System output combination for improved speaker diarization

机译：系统输出组合可改善扬声器的清晰度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

System combination or fusion is a popular, successful and sometimes straightforward means of improving performance in many fields of statistical pattern classification, including speech and speaker recognition. Whilst there is significant work in the literature which aims to improve speaker diarization performance by combining multiple feature streams, there is little work which aims to combine the outputs of multiple systems. This paper reports our first attempts to combine the outputs of two state-of-the-art speaker diarization systems, namely ICSI's bottom-up and LIA-EURECOM's top-down systems. We show that a cluster matching procedure reliably identifies corresponding speaker clusters in the two system outputs and that, when they are used in a new realignment and resegmentation stage, the combination leads to relative improvements of 13% and 7% DER on independent development and evaluation sets.

机译：系统组合或融合是一种在许多统计模式分类领域（包括语音和说话者识别）中提高性能的流行，成功且有时直接的方法。尽管在文献中有大量工作旨在通过组合多个特征流来提高说话者的二分音性能，但是很少有工作旨在组合多个系统的输出。本文报告了我们首次尝试结合两种最先进的扬声器二分系统的输出，即ICSI的自下而上和LIA-EURECOM的自上而下的系统。我们表明，一个群集匹配程序可以可靠地识别两个系统输出中的相应说话者群集，并且在新的重新调整和重新细分阶段使用它们时，该组合可以使独立开发和评估的DER相对提高13％和7％套。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2650-2653|共4页
会议地点
作者
Simon Bozonnet; Nicholas Evans; Xavier Anguera; Oriol Vinyals; Gerald Friedland; Corinne Fredouille;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speaker diarization; system combination; fusion;

机译：说话人差异化系统组合;融合;

相似文献

外文文献
中文文献
专利

1. Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study [J] . Mihelic France, Vesnicer Bostjan, Zibert Janez Journal of computing and information technology . 2008,第3期

机译：音频广播新闻中演讲者跟踪的演讲者区分系统的开发：一个案例研究
2. Development Of A Speaker Diarization System For Speaker Tracking In Audio Broadcast News: A Case Study [J] . Janez Zibert, Bostjan Vesnicer, France Mihelic Journal of Computing and Information Technology . 2008,第3期

机译：音频广播新闻中演讲者跟踪的演讲者差异化系统的开发：一个案例研究
3. Improving speaker diarization for naturalistic child-adult conversational interactions using contextual information [J] . Kumar Manoj, Kim So Hyun, Lord Catherine, The Journal of the Acoustical Society of America . 2020,第2期

机译：使用上下文信息改进讲话者深度自然的儿童成人对话交互
4. System output combination for improved speaker diarization [C] . Simon Bozonnet, Nicholas Evans, Xavier Anguera, Annual conference of the International Speech Communication Association . 2010

机译：改进扬声器日益改善的系统输出组合
5. Extension and combination of economic input output models for assessing critical infrastructure system interdependencies. [D] . Chen, Ping. 2006

机译：经济投入产出模型的扩展和组合，用于评估关键基础设施系统的相互依赖性。
6. Improving speaker diarization for naturalistic child-adult conversational interactions using contextual information [O] . Manoj Kumar, So Hyun Kim, Catherine Lord, -1

机译：使用上下文信息为自然主义的成人与儿童之间的对话互动提高说话者的区分能力
7. Multistream speaker diarization through Information Bottleneck system outputs combination [O] . Deepu Vijayasenan, Fabio Valente, Petr Motlicek 2015

机译：通过信息瓶颈系统输出组合的多音扬声器二值化

System output combination for improved speaker diarization

摘要

著录项

相似文献

相关主题

期刊订阅