首页>
外国专利>
Unsupervised speaker segmentation of multi-speaker speech data
Unsupervised speaker segmentation of multi-speaker speech data
展开▼
机译:多说话者语音数据的无监督说话者分割
展开▼
页面导航
摘要
著录项
相似文献
摘要
Systems and methods for unsupervised segmentation of multi-speaker speech or audio data by speaker. A front-end analysis is applied to input speech data to obtain feature vectors. The speech data is initially segmented and then clustered into groups of segments that correspond to different speakers. The clusters are iteratively modeled and resegmented to obtain stable speaker segmentations. The overlap between segmentation sets is checked to ensure successful speaker segmentation. Overlapping segments are combined and remodeled and resegmented. Optionally, the speech data is processed to produce a segmentation lattice to maximize the overall segmentation likelihood.
展开▼