PCFG Induction for Unsupervised Parsing and Language Modelling

机译：用于无监督解析和语言建模的PCFG归纳法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The task of unsupervised induction of probabilistic context-free grammars (PCFGs) has attracted a lot of attention in the field of computational linguistics. Although it is a difficult task, work in this area is still very much in demand since it can contribute to the advancement of language parsing and modelling. In this work, we describe a new algorithm for PCFG induction based on a principled approach and capable of inducing accurate yet compact artificial natural language grammars and typical context-free grammars. Moreover, this algorithm can work on large grammars and datasets and infers correctly even from small samples. Our analysis shows that the type of grammars induced by our algorithm are, in theory, capable of modelling natural language. One of our experiments shows that our algorithm can potentially outperform the state-of-the-art in unsupervised parsing on the WSJ10 corpus.

机译：概率性上下文无关文法（PCFG）的无监督归纳任务在计算语言学领域引起了很多关注。尽管这是一项艰巨的任务，但由于它可以促进语言解析和建模的发展，因此在此领域的工作仍然非常需求。在这项工作中，我们描述了一种基于有原则的方法的PCFG归纳的新算法，该算法能够引入准确而紧凑的人工自然语言语法和典型的无上下文语法。而且，该算法可以处理较大的语法和数据集，甚至可以从较小的样本中正确推断。我们的分析表明，从理论上讲，我们算法产生的语法类型能够对自然语言进行建模。我们的一项实验表明，在WSJ10语料库的无监督解析中，我们的算法有可能优于最新技术。

著录项

来源
《Conference on empirical methods in natural language processing》|2014年|1353-1362|共10页
会议地点
作者
James Scicluna; Colin de la Higuera;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Grammatical Inference of PCFGs Applied to Language Modelling and Unsupervised Parsing [J] . Scicluna James, de la Higuera Colin Fundamenta Informaticae . 2016,第4期

机译：PCFG的语法推断在语言建模和无监督分析中的应用
2. Teach Your Robot Your Language! Trainable Neural Parser for Modeling Human Sentence Processing: Examples for 15 Languages [J] . Hinaut Xavier, Twiefel Johannes IEEE Transactions on Cognitive and Developmental Systems . 2020,第2期

机译：教你的机器人你的语言！用于建模人类句子处理的可训练神经解析器：15种语言的例子
3. Unsupervised grammar induction and similarity retrieval in medical language processing using the Deterministic Dynamic Associative Memory (DDAM) model. [J] . Pantazi SV Journal of biomedical informatics. . 2010,第5期

机译：使用确定性动态联想记忆（DDAM）模型在医学语言处理中进行无监督语法归纳和相似度检索。
4. PCFG Induction for Unsupervised Parsing and Language Modelling [C] . James Scicluna, Colin de la Higuera Conference on empirical methods in natural language processing . 2014

机译：PCFG诱导无监督的解析和语言建模
5. Supervised Training on Synthetic Languages: A Novel Framework for Unsupervised Parsing [D] . Wang, Dingquan. 2019

机译：关于综合语培训：无监督解析的新框架
6. Unsupervised grammar induction of clinical report sublanguage [O] . Rohit J Kate 2012

机译：临床报告亚语言的无监督语法归纳
7. PCFG Induction for Unsupervised Parsing and Language Modelling [O] . James Scicluna, Colin De La Higuera 2015

机译：无监督解析和语言建模的pCFG归纳

PCFG Induction for Unsupervised Parsing and Language Modelling

摘要

著录项

相似文献

相关主题

期刊订阅