...
【24h】

A Cascaded Unsupervised Model for PoS Tagging

机译:POS标记的级联无监督模型

获取原文
获取原文并翻译 | 示例
           

摘要

Part of speech (PoS) tagging is one of the fundamental syntactic tasks in Natural Language Processing, as it assigns a syntactic category to each word within a given sentence or context (such as noun, verb, adjective, etc.). Those syntactic categories could be used to further analyze the sentence-level syntax (e.g., dependency parsing) and thereby extract the meaning of the sentence (e.g., semantic parsing). Various methods have been proposed for learning PoS tags in an unsupervised setting without using any annotated corpora. One of the widely used methods for the tagging problem is log-linear models. Initialization of the parameters in a log-linear model is very crucial for the inference. Different initialization techniques have been used so far. In this work, we present a log-linear model for PoS tagging that uses another fully unsupervised Bayesian model to initialize the parameters of the model in a cascaded framework. Therefore, we transfer some knowledge between two different unsupervised models to leverage the PoS tagging results, where a log-linear model benefits from a Bayesian model's expertise. We present results for Turkish as a morphologically rich language and for English as a comparably morphologically poor language in a fully unsupervised framework. The results show that our framework outperforms other unsupervised models proposed for PoS tagging.
机译:语音(POS)标记的一部分是自然语言处理中的基本句法任务之一,因为它为给定句子或上下文中的每个单词(例如名词,动词,形容词等)分配句法类别。这些句法类别可用于进一步分析句子级语法(例如,依赖解析),从而提取句子的含义(例如,语义解析)。已经提出了各种方法,用于在不使用任何注释的语料库的情况下在无监督的环境中学习POS标签。用于标记问题的广泛使用的方法之一是对数线性模型。对数线性模型中的参数的初始化对于推理非常重要。到目前为止已经使用了不同的初始化技术。在这项工作中,我们为POS标记提供了一个对数线性模型,它使用另一个完全无监督的贝叶斯模型来初始化级联框架中模型的参数。因此,我们在两种不同无监督模型之间传输一些知识来利用POS标记结果,其中从贝叶斯模型的专业知识中获益。我们向土耳其语作为形态学丰富的语言和英语作为一种完全无人监督的框架的语言呈现出色的富含语言。结果表明,我们的框架优于为POS标记提出的其他无人监督的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号