首页> 外文学位 >Robust probabilistic predictive syntactic processing: Motivations, models, and applications.
【24h】

Robust probabilistic predictive syntactic processing: Motivations, models, and applications.

机译:鲁棒的概率预测句法处理:动机,模型和应用。

获取原文
获取原文并翻译 | 示例

摘要

This thesis presents a broad-coverage probabilistic top-down parser, and its application to the problem of language modeling for speech recognition. The parser builds fully connected derivations incrementally, in a single pass from left-to-right across the string. We argue that the parsing approach that we have adopted is well-motivated from a psycholinguistic perspective, as a model that captures probabilistic dependencies between lexical items, as part of the process of building connected syntactic structures. The basic parser and conditional probability models are presented, and empirical results are provided for its parsing accuracy on both newspaper text and spontaneous telephone conversations. Modifications to the probability model are presented that lead to improved performance. A new language model which uses the output of the parser is then defined. Perplexity and word error rate reduction are demonstrated over trigram models, even when the trigram is trained on significantly more data. Interpolation on a word-by-word basis with a trigram model yields additional improvements.
机译:本文提出了一种涵盖面广的概率自上而下的解析器,并将其应用于语音识别的语言建模问题。解析器在整个字符串中从左到右一次传递,以增量方式构建完全连接的派生。我们认为,从心理语言的角度出发,我们采用的解析方法是一种很好的动机,它是一种捕获词汇项之间的概率依存关系的模型,是构建关联句法结构的过程的一部分。给出了基本的解析器和条件概率模型,并提供了针对其在报纸文本和自发电话交谈中的解析准确性的实验结果。提出了对概率模型的修改,从而提高了性能。然后定义一个使用解析器输出的新语言模型。即使在大量的数据上训练了三字母组合,在三字母组合模型上也证明了困惑性和单词错误率的降低。使用三字母组合词模型在逐个单词的基础上进行插值会产生其他改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号