首页> 美国政府科技报告 >Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment.

【24h】

Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment.

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study has focused on five complementary research tasks in the domain of audio, speech, language, and speaker recognition and processing. In the area of speaker recognition/identification (SID), advancements have been realized to address acoustic mismatch due to speaker overlap, language mismatch, channel/microphone/additive noise, speaker style (spoken vs. singing), speaker state (physical task stress), distant speech, and environment based (room reverberation). In language ID (LID), advancements have been shown for improved out-of-set language rejection, as well as integrated spectral and prosody based LID solutions. For co-channel and diarization, new algorithms based on gammatone subband frequency modulation was achieved. In diarization, robust speech activity detection based on a combination (Combo-SAD) feature stream was developed. New keyword spotting technology using phonological features as well as audio stream assessment for peak clipping and speaker height estimation were also developed. All algorithms were evaluated on various speech corpora from AFRL, CRSS-UTDallas, and publicly available.

著录项

作者
Hansen, J. H.;
展开▼
作者单位

展开▼
年度 2015
页码 1-134
总页数 134
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Speech recognition; Algorithms; Extraction; Identification; Language; Learning machines; Microphones; Noise(sound); Speech analysis; Stress(physiology); Speaker recognition; Language recognition; Overlap speech detection; Diarization; Automatic speech recognition; Keyword spotting; Speaker analysis; Audio waveform analysis; Audio analysis and information extraction; Pe35885g; Wucombjutd;

机译：语音识别;算法;提取;识别;语言;学习机器;麦克风;噪声（声音）;语音分析;压力（生理学）;说话人识别;语言识别;重叠语音检测;二叉化;自动语音识别;关键词识别;说话人分析;音频波形分析;音频分析和信息提取; pe35885g; Wucombjutd;

相似文献

外文文献
中文文献
专利

1. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE transactions on information and systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
2. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE Transactions on Information and Systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
3. Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification [J] . Po-Yi Shih, Po-Chuan Lin, Jhing-Fa Wang, Journal of network and computer applications . 2011,第5期

机译：强大的多说话者语音识别功能以及高度可靠的在线说话者自适应和识别功能
4. NIST Speech Processing Evaluations: LVCSR, Speaker Recognition, Language Recognition [C] . Martin, Alvin F., Garofolo, . 2007

机译：NIST语音处理评估：LVCSR，说话人识别，语言识别
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：识别消息和使者：仿生频谱分析可增强语音和说话者识别能力
7. Automatic Speech recognition, with large vocabulary, robustness, independence of speaker and multilingual processing [O] . CAON D. R. S. 2010

机译：自动语音识别，词汇量大，健壮性强，说话者独立且具有多语言处理能力

Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment.

摘要

著录项

相似文献

相关主题

期刊订阅