首页> 外国专利> Learning pronunciations from acoustic sequences

Learning pronunciations from acoustic sequences

机译：从声学序列学习发音

页面导航

摘要
著录项
相似文献

摘要

Methods, systems, and apparatus, including computer programs encoded on computer storage media for learning pronunciations from acoustic sequences. One method includes receiving an acoustic sequence, the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the time steps processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output for the time step using a phoneme output layer to generate a phoneme representation for the acoustic feature representation for the time step; and processing the recurrent output for the time step using a grapheme output layer to generate a grapheme representation for the acoustic feature representation for the time step; and extracting, from the phoneme and grapheme representations for the acoustic feature representations at each time step, a respective pronunciation for each of one or more words.

机译：方法，系统和装置，包括编码在计算机存储介质上的计算机程序，用于从声学序列中学习发音。一种方法包括接收声学序列，该声学序列包括在多个时间步长中的每个时间步长处的相应的声学特征表示;对于每个时间步，通过一个或多个递归神经网络层中的每一个处理声学特征表示以产生递归输出;使用音素输出层处理该时间步骤的循环输出，以生成该时间步骤的声学特征表示的音素表示;使用字素输出层处理该时间步的循环输出，以生成该时间步的声学特征表示的字素表示;在每个时间步从用于声学特征表示的音素和字素表示中提取一个或多个单词中的每个单词的相应发音。

著录项

公开/公告号US10127904B2

专利类型
公开/公告日2018-11-13

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号US201514811939
发明设计人 KANURY KANISHKA RAO;HASIM SAK;OUAIS ALSHARIF;FRANCOISE BEAUFAYS;
展开▼

申请日2015-07-29
分类号G10L15/00;G10L15/187;G10L15/06;G06N3/04;G06N3/08;G10L15/16;G10L15/02;
国家 US
入库时间 2022-08-21 12:11:44

相似文献

专利
外文文献
中文文献