首页> 外文会议>International Conference on speech and computer >Speaker Diarization: A Top-Down Approach Using Syllabic Phonology
【24h】

Speaker Diarization: A Top-Down Approach Using Syllabic Phonology

机译:说话人二分法:使用音节音系的自上而下的方法

获取原文

摘要

A top-down approach to speaker diarization is developed using a modified Baum-Welch algorithm. The HMM states combine phonemes according to structural positions under syllabic phonological theory. By nature of the structural phonology, there are at most 16 states, and the transition matrix is sparse, allowing efficient decoding to structural phones. This addresses the issue of phoneme specificity in speaker diarization - that speaker similarities/differences are confounded by phonetic similarities/differences. We address this here without the expensive use of a complete set of individual phonemes. The voice activity detection (VAD) issue is likewise addressed, giving a new approach to VAD.
机译:使用改进的Baum-Welch算法开发了自上而下的说话人区分方法。 HMM状态根据音节语音学理论根据结构位置组合音素。根据结构音系学的性质,最多有16种状态,并且过渡矩阵稀疏,可以对结构电话进行有效解码。这解决了说话人差异化中音素特异性的问题-说话人的相似度/差异被语音的相似度/差异所混淆。我们在这里解决此问题,而无需大量使用完整的单个音素。语音活动检测(VAD)问题也得到了解决,为VAD提供了一种新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号