首页> 外文期刊>International Journal of Computer Processing of Oriental Languages >Examining Cf-ranking Methods for Text Structuring and Pronominalization in Korean
【24h】

Examining Cf-ranking Methods for Text Structuring and Pronominalization in Korean

机译:检查韩文文本结构化和代名词化的Cf排序方法

获取原文
获取原文并翻译 | 示例
           

摘要

Centering Theory assumes that entities realized in an utterance can be ranked according to their relative salience degree. This ranking called Cf-ranking (ranking of forward-looking centers) determines the likelihood that the entities realized in an utterance will be the center of the subsequent utterance, and it is one of the central issues in Centering literatures. This paper deals with these Cf-ranking issues in Korean at the level of text structuring and pronominalization for coherent text generation. For text structuring, we compare several Cf-ranking methods by examining various Centering-based metrics to evaluate local coherence of text, and for pronominalization, we compare them by examining previous rules for Centering-based pronoun generation rules and our pronominalization model. In almost all previous works, surface word order was not solely employed for Cf-ranking, instead it was additionally considered to supplement the main ranking scheme based on the fact that linear order does not perform well alone. However, this study shows that due to the characteristics of the Korean language, ranking by surface word order is better than any other ranking method in most Centering-based Metrics which depend on Cf-ranking, and it is also reliable in terms of pronominalization accuracy. Additionally we found that based on the Cf-ranking by surface word order, it is the most effective way for text structuring to maximize simply the number of utterance pairs whose first realized nominal entity in adjacent utterances is identical.
机译:居中理论假设可以将根据话语实现的实体根据其相对显着程度进行排名。该排名称为Cf排名(前瞻性中心的排名),它确定以一种话语实现的实体将成为后续话语的中心的可能性,这是居中文献中的核心问题之一。本文在韩文中的这些Cf排名问题中,从文字结构化和名词化的角度来处理连贯的文字生成。对于文本结构,我们通过检查各种基于居中的度量标准来比较几种Cf排名方法,以评估文本的局部连贯性;对于代词化,我们通过检查以前基于居中的代词生成规则和代词化模型对它们进行比较。在几乎所有以前的著作中,表面单词顺序并非仅用于Cf排名,而是基于线性顺序不能很好地执行这一事实,还被认为是对主要排名方案的补充。但是,这项研究表明,由于朝鲜语的特性,在大多数基于Cf排序的基于居中的度量中,按表面单词顺序进行的排序要好于任何其他排序方法,并且在标称化准确性方面也很可靠。另外,我们发现基于表面词序的Cf排名,这是文本结构最简单地最大化其声母在相邻声母中完全相同的声母对数量的最有效方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号