首页> 外文会议>Natural language understanding and intelligent applications >A CDT-Styled End-to-End Chinese Discourse Parser
【24h】

A CDT-Styled End-to-End Chinese Discourse Parser

机译:CDT风格的端到端中文话语解析器

获取原文
获取原文并翻译 | 示例

摘要

Discourse parsing is a challenging task and plays a critical role in discourse analysis. Since the release of the Rhetorical Structure Theory Discourse Treebank (RST-DT) and the Penn Discourse Tree-bank (PDTB), the research on English discourse parsing has attracted increasing attention and achieved considerable success in recent years. At the same time, some preliminary research on certain subtasks about discourse parsing for other languages, such as Chinese, has been conducted. In this paper, the Connective-driven Dependency Treebank (CDTB) corpus is introduced. Then an end-to-end Chinese discourse parser to parse free texts into the Connective-driven Dependency Tree (CDT) style is presented. The parser consists of multiple components including elementary discourse unit detector, discourse relation recognizer, discourse parse tree generator and attribution labeler. In particular, attribution labeler determines two attributions (sense and centering) for every non-terminal node in the discourse parse trees. Effective feature sets are proposed for every component respectively. Comprehensive experiments are conducted on the Connective-driven Dependency Treebank (CDTB) corpus with an overall F1 score of 20.0%.
机译:语篇解析是一项艰巨的任务,在语篇分析中起着至关重要的作用。自从修辞结构理论话语树库(RST-DT)和宾州话语树库(PDTB)发布以来,近年来英语话语解析研究受到了越来越多的关注,并取得了可喜的成就。同时,对某些其他任务(例如中文)的语篇解析进行了一些初步研究。本文介绍了连接驱动的依赖树库(CDTB)语料库。然后,提出了一种端到端中文话语解析器,用于将自由文本解析为连接驱动的依赖树(CDT)样式。解析器由多个组件组成,包括基本语篇单元检测器,语篇关系识别器,语篇解析树生成器和归因标记器。特别地,归因标记器为话语解析树中的每个非终端节点确定两个归因(感知和居中)。分别为每个组件提出了有效的功能集。在连接驱动的依赖性树库(CDTB)语料库上进行了综合实验,F1总体得分为20.0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号