How the Human Brain Recognizes Speech in the Context of Changing Speakers

机译：在说话者不断变化的背景下人脑如何识别语音

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

We understand speech from different speakers with ease, whereas artificial speech recognition systems struggle with this task. It is unclear how the human brain solves this problem. The conventional view is that speech message recognition and speaker identification are two separate functions and that message processing takes place predominantly in the left hemisphere, whereas processing of speaker-specific information is located in the right hemisphere. Here, we distinguish the contribution of specific cortical regions, to speech recognition and speaker information processing, by controlled manipulation of task and resynthesized speaker parameters. Two functional magnetic resonance imaging studies provide evidence for a dynamic speech-processing network that questions the conventional view. We found that speech recognition regions in left posterior superior temporal gyrus/superior temporal sulcus (STG/STS) also encode speaker-related vocal tract parameters, which are reflected in the amplitude peaks of the speech spectrum, along with the speech message. Right posterior STG/STS activated specifically more to a speaker-related vocal tract parameter change during a speech recognition task compared with a voice recognition task. Left and right posterior STG/STS were functionally connected. Additionally, we found that speaker-related glottal fold parameters (e.g., pitch), which are not reflected in the amplitude peaks of the speech spectrum, are processed in areas immediately adjacent to primary auditory cortex, i.e., in areas in the auditory hierarchy earlier than STG/STS. Our results point to a network account of speech recognition, in which information about the speech message and the speaker's vocal tract are combined to solve the difficult task of understanding speech from different speakers.

机译：我们可以轻松地理解来自不同说话者的语音，而人工语音识别系统却难以完成这项任务。目前尚不清楚人脑如何解决这个问题。传统观点认为语音消息识别和说话人识别是两个独立的功能，并且消息处理主要发生在左半球，而说话人特定信息的处理则位于右半球。在这里，我们通过任务的控制操作和重新合成的说话人参数来区分特定皮质区域对语音识别和说话人信息处理的贡献。两项功能性磁共振成像研究为质疑传统观点的动态语音处理网络提供了证据。我们发现左后颞上回/颞上沟（STG / STS）中的语音识别区域还编码与说话者相关的声道参数，这些参数与语音消息一起反映在语音频谱的振幅峰值中。与语音识别任务相比，在语音识别任务期间，右后方STG / STS特别激活了说话人相关的声道参数变化。左右后STG / STS功能连接。此外，我们发现，与说话者相关的声门褶皱参数（例如音高）（未反映在语音频谱的振幅峰值中）在紧邻主听皮层的区域（即听觉层次中较早的区域）中进行处理。比STG / STS。我们的结果指向语音识别的网络帐户，其中将有关语音消息和说话人声道的信息组合在一起，以解决理解不同说话人语音的艰巨任务。

著录项

期刊名称 The Journal of Neuroscience
作者
Katharina von Kriegstein; David R. R. Smith; Roy D. Patterson; Stefan J. Kiebel; Timothy D. Griffiths;
展开▼
作者单位

展开▼
年(卷),期 2010(30),2
年度 2010
页码 629–638
总页数 10
原文格式 PDF
正文语种
中图分类神经科学;
关键词

相似文献

外文文献
中文文献
专利

1. How the human brain recognizes speech in the context of changing speakers. [J] . von Kriegstein K, Smith DR, Patterson R The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2010,第2期

机译：在不断变化的说话者背景下，人脑如何识别语音。
2. How the human brain recognizes speech in the context of changing speakers. [J] . von Kriegstein K, Smith DR, Patterson R The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2010,第2期

机译：在不断变化的说话者背景下，人脑如何识别语音。
3. Method for recognizing speech/speaker using emotional change to govern unsupervised adaptation [J] . Kemp Thomas The Journal of the Acoustical Society of America . 2010,第3期

机译：利用情绪变化来控制无监督适应的语音/说话者识别方法
4. Comparing speaker-dependent and speaker-adaptive acoustic models for recognizing dysarthric speech [C] . Frank Rudzicz, PFrank Rudzicz International ACM SIGACCESS conference on Computers and accessibility . 2007

机译：比较说话者相关和说话者自适应的声学模型以识别发音异常的语音
5. Korean honorific speech style shift: Intra-speaker variables and context. [D] . Chang, Sumi. 2014

机译：韩国尊敬的演讲风格转变：演讲者内部变量和上下文。
6. Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：识别消息和使者：仿生频谱分析可增强语音和说话者识别能力
7. How the human brain recognizes speech in the context of changing speakers [O] . von Kriegstein, K., Smith, D., Patterson, R., 2010

机译：人类大脑如何在改变说话者的背景下识别语音

How the Human Brain Recognizes Speech in the Context of Changing Speakers

摘要

著录项

相似文献

相关主题

期刊订阅