首页> 外文会议>Conference on empirical methods in natural language processing >PCFG Induction for Unsupervised Parsing and Language Modelling
【24h】

PCFG Induction for Unsupervised Parsing and Language Modelling

机译:用于无监督解析和语言建模的PCFG归纳法

获取原文

摘要

The task of unsupervised induction of probabilistic context-free grammars (PCFGs) has attracted a lot of attention in the field of computational linguistics. Although it is a difficult task, work in this area is still very much in demand since it can contribute to the advancement of language parsing and modelling. In this work, we describe a new algorithm for PCFG induction based on a principled approach and capable of inducing accurate yet compact artificial natural language grammars and typical context-free grammars. Moreover, this algorithm can work on large grammars and datasets and infers correctly even from small samples. Our analysis shows that the type of grammars induced by our algorithm are, in theory, capable of modelling natural language. One of our experiments shows that our algorithm can potentially outperform the state-of-the-art in unsupervised parsing on the WSJ10 corpus.
机译:概率性上下文无关文法(PCFG)的无监督归纳任务在计算语言学领域引起了很多关注。尽管这是一项艰巨的任务,但由于它可以促进语言解析和建模的发展,因此在此领域的工作仍然非常需求。在这项工作中,我们描述了一种基于有原则的方法的PCFG归纳的新算法,该算法能够引入准确而紧凑的人工自然语言语法和典型的无上下文语法。而且,该算法可以处理较大的语法和数据集,甚至可以从较小的样本中正确推断。我们的分析表明,从理论上讲,我们算法产生的语法类型能够对自然语言进行建模。我们的一项实验表明,在WSJ10语料库的无监督解析中,我们的算法有可能优于最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号