Using Boosting and POS Word Graph Tagging to Improve Speech Recognition

机译：使用Boosting和POS字图标记来改善语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The word graphs produced by a large vocabulary speech recognition system usually contain a path labelled with the correct utterance, but this is not always the highest scoring path. Boosting increases the probability of words which occur often in the word graph, which are in some sense robust. Adding syntactic information allows rescoring of arc probabilities with the possibility that more grammatical word sequences will also be the correct ones. A theory is developed which allows general probabilistic syntactic models to be used to rescore word lattices. Experiments conducted on the Wall Street Journal (WSJ) corpus with a version of the AT&T 1995 FST LVSR system with part of speech (POS) trigram sequences show that using only POS leads to a loss in performance. Boosting alone provides an improvement in performance which is not statistically significant. Cascading the two methods, boosting first and then using syntactic information improves performance 4.5 % relative on a large portion of the 1995 DARPA test set.

机译：大型词汇语音识别系统生成的词图通常包含标有正确话语的路径，但这并不总是最高的得分路径。增强会增加单词图中经常出现的单词的概率，从某种意义上说，单词在某种程度上是健壮的。添加句法信息可以对弧形概率进行记录，并且更多的语法单词序列也将是正确的。发展了一种理论，该理论允许将一般的概率句法模型用于重排词格。在《华尔街日报》（WSJ）语料库上，使用部分语音（POS）三字母顺序的AT＆T 1995 FST LVSR系统进行的实验表明，仅使用POS会导致性能下降。单独增强可改善性能，但在统计上并不显着。相对于1995年DARPA测试集的大部分而言，将两种方法（首先增强然后使用语法信息）进行级联可以将性能提高4.5％。

著录项

来源
《European Conference on Speech Communication and Technology v.3; 20010903-20010907; Aalborg; DK》|2001年|P.2143-2146|共4页
会议地点 Aalborg(DK);Aalborg(DK)
作者
Christer Samuelsson; James L. Hieronymus;
展开▼
作者单位

Xerox Research Centre Europe 6, chemin de Maupertuis 38240 Meylan, FRANCE;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Optimizing polymer aggregation and blend morphology for boosting the photovoltaic performance of polymer solar cells via a random terpolymerization strategy [J] . Tao Zhang, Chang He, Jianhui Hou, 天然气化学（英文版） . 2021,第008期
2. Optimizing polymer aggregation and blend morphology for boosting the photovoltaic performance of polymer solar cells via a random terpolymerization strategy [J] . Tao Zhang, Cunbin An, Qianglong Lv, 能源化学：英文版 . 2021,第008期
3. Strongly Coupled 2D Transition Metal Chalcogenide-MXene-Carbonaceous Nanoribbon Heterostructures with Ultrafast Ion Transport for Boosting Sodium/Potassium Ions Storage [J] . Junming Cao, Junzhi Li, Dongdong Li, 纳微快报：英文版 . 2021,第007期
4. Self-supported hierarchical porous Li_(4)Ti_(5)O_(12)/carbon arrays for boosted lithium ion storage [J] . Jun Liu, Aixiang Wei, Guoxiang Pan, 能源化学：英文版 . 2021,第003期
5. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
6. Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task [J] . Masahiko MATSUSHITA, Hiromitsu NISHIZAKI, Takehito UTSURO, IEICE Transactions on Information and Systems . 2005,第3期

机译：通过组合多个语音识别器的输出以执行语音驱动的WEB检索任务，提高口语查询的关键字识别
7. Word set probability boosting for improved spontaneous dialog recognition [J] . Sarukkai R.R., Ballard D.H. IEEE Transactions on Speech and Audio Proceeding . 1997,第5期

机译：单词集概率增强可改善自发对话框识别
8. Using Boosting and POS Word Graph Tagging to Improve Speech Recognition [C] . Christer Samuelsson, James L. Hieronymus European conference on speech communication and technology . 2001

机译：使用升压和POS字形标记标记以改善语音识别
9. Improving Keywords Spotting Performance in Noise with Augmented Dataset from Vocoded Speech and Speech Denoising [D] . Li, Ruohao. 2021

机译：从声音语音和语音去噪带来的噪声中的噪声中的关键字
10. Development of a Two-Stage Procedure for the Automatic Recognition of Dysfluencies in the Speech of Children Who Stutter: II. ANN Recognition of Repetitions and Prolongations With Supplied Word Segment Markers [O] . Peter Howell, Stevie Sackin, Kazan Glenn -1

机译：自动识别口吃儿童言语中流离失所的两阶段程序的发展：II。具有提供的词段标记的ANN识别重复和延长
11. Acoustically Grounded Word Embeddings for Improved Acoustics-to-word Speech Recognition [O] . Shane Settle, Kartik Audhkhasi, Karen Livescu, 2019

机译：用于改善声学与单词语音识别的声学接地词嵌入
12. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Using Boosting and POS Word Graph Tagging to Improve Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅