AUTOMATIC SPEAKER CLUSTERING

机译：自动扬声器集群

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a fully automatic speaker clustering algorithm, which consists of three components: building a distance matrix based on Gaussian models of the acoustic segments; performing hierarchical clustering on the distance matrix with the prior assumption that consecutive segments should be more likely to come from the same speaker; and selecting the best clustering solution automatically by minimizing the within-cluster dispersion with some penalty against too many clusters. We applied this automatic speaker clustering technique in 1996 Hub4 evaluation, and the results show that it contributed significantly to the word error rate (WER) reduction in unsupervised adaptation. From our experiments, the algorithm seldom misclassifies segments from the same speaker into different clusters. We used the same clustering procedure for both partitioned evaluation (PE) and unpartitioned evaluation (UE) tests [1]. Experiments also show that this automatic speaker clustering algorithm improves unsupervised adaptation as much as the hand labeled ideal case where the clusters are generated based on true speaker, channel and background condition.

机译：本文提出了一种全自动的说话人聚类算法，它由三个部分组成：基于声学片段的高斯模型建立距离矩阵；在事先假设连续片段应该更可能来自同一说话者的前提下，对距离矩阵进行分层聚类；并通过最大程度地降低集群内部分散度（对过多集群造成一些损失）来自动选择最佳集群解决方案。我们在1996年Hub4评估中应用了这种自动的说话人聚类技术，结果表明，它在无监督适应中对降低词错误率（WER）起到了重要作用。从我们的实验来看，该算法很少将来自同一说话者的片段分类为不同的簇。我们对分区评估（PE）和未分区评估（UE）测试使用了相同的聚类过程[1]。实验还表明，这种自动的说话人聚类算法可以改善无监督的适应性，就像手工标记的理想情况一样，后者是根据真实的说话人，声道和背景条件生成聚类的。

著录项

来源
《Proceedings of the speech recognition workshop》|1997年|108-111|共4页
会议地点 Chantilly VA(US)
作者
Hubert Jin; Francis Kubala; Rich Schwartz;
展开▼
作者单位

BBN Systems and Technologies Cambridge, MA 02138;

BBN Systems and Technologies Cambridge, MA 02138;

BBN Systems and Technologies Cambridge, MA 02138;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自然科学理论与方法论;自动模拟理论（自动仿真理论）;
关键词

相似文献

外文文献
中文文献
专利

1. Bio-inspired Approach for Automatic Speaker Clustering Using Auditory Modeling and Self-Organizing Maps [J] . Anton A. Yakovenko, Galina F. Malykhina Procedia Computer Science . 2018,第5期

机译：使用听觉建模和自组织地图自动扬声器聚类生物启发方法
2. Automatic Speaker Clustering Using a Voice Characteristic Reference Space and Maximum Purity Estimation [J] . Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang IEEE transactions on audio, speech and language processing . 2007,第4期

机译：使用语音特征参考空间和最大纯度估计的自动扬声器聚类
3. High level speaker specific features modeling in automatic speaker recognition system [J] . Satyanand Singh International Journal of Electrical and Computer Engineering . 2020,第2期

机译：自动扬声器识别系统中高级扬声器特定功能造型
4. Automatic speaker clustering from multi-speaker utterances [C] . McLaughlin, J., Reynolds, . 1999

机译：通过多说话者说话自动聚集说话者
5. Finding Difficult Speakers in Automatic Speaker Recognition [D] . Stoll, Lara Lynn 2011

机译：在自动说话人识别中寻找困难的说话人
6. Cross-Clustering: A Partial Clustering Algorithm with Automatic Estimation of the Number of Clusters [O] . Paola Tellaroli, Marco Bazzi, Michele Donato, -1

机译：跨集群：具有自动估计集群数量的部分集群算法
7. Automatic Speaker Clustering Using A Voice Characteristic Reference Space And Maximum Purity Estimation [O] . Wei-ho Tsai, Shih-sian Cheng, Hsin-min Wang 2007

机译：使用语音特征参考空间和最大纯度估计的自动说话人聚类

AUTOMATIC SPEAKER CLUSTERING

摘要

著录项

相似文献

相关主题

期刊订阅