首页>
外国专利>
Speaker feature extraction apparatus and the speaker feature extraction method, speech recognition device, voice synthesis device, as well as, program recording medium
Speaker feature extraction apparatus and the speaker feature extraction method, speech recognition device, voice synthesis device, as well as, program recording medium
展开▼
机译:说话者特征提取设备和说话者特征提取方法,语音识别设备,语音合成设备以及程序记录介质
展开▼
页面导航
摘要
著录项
相似文献
摘要
PROBLEM TO BE SOLVED: To extract speaker characteristics with good accuracy from a smaller quantity of utterance data. SOLUTION: Acoustic models are stored in a first acoustic model storage section 7a to an n-th acoustic model storage section 7n by each of n pieces of speaker clusters in the acoustic model storage sections 7. The vocal tract length normalization coefficient αdetermined by estimating likelihood by equation (a) according to a reference of maximizing the likelihood of the acoustic models of learning speakers for the acoustic models of all the learning speakers by using a nonlinear frequency warping obtained by applying a correction factor β to vocal tract length normalization coefficient α is used for clustering of the learning speakers of this case as the distance between the respective learning speakers. The distances between the respective learning speakers are set in accordance with the information on the vocal tract lengths which are the fluctuating factors of the physiological characteristics and the correction information of the ways and habits of the utterance, by which the learning speakers are clustered with the speaker characteristics extracted with good accuracy by taking the speakers' habits into consideration from a smaller quantify of the utterance data as the distances between the respective learning speakers.
展开▼