首页> 外文会议>9th International conference on language resources and evaluation >An Out-of-Domain Test Suite for Dependency Parsing of German
【24h】

An Out-of-Domain Test Suite for Dependency Parsing of German

机译:德国依赖性解析域的域名测试套件

获取原文

摘要

We present a dependency conversion of five German test sets from five different genres. The dependency representation is made as similar as possible to the dependency representation of TiGer, one of the two big syntactic treebanks of German. The purpose of these test sets is to enable researchers to test dependency parsing models on several different data sets from different text genres. We discuss some easy to compute statistics to demonstrate the variation and differences in the test sets and provide some baseline experiments where we test the effect of additional lexical knowledge on the out-of-domain performance of two state-of-the-art dependency parsers. Finally, we demonstrate with three small experiments that text normalization may be an important step in the standard processing pipeline when applied in an out-of-domain setting.
机译:我们从五种不同的类型呈现五个德国测试集的依赖性转换。依赖性表示与Tiger的依赖关系表示类似,德国的两个大句法树班。这些测试集的目的是使研究人员能够在来自不同文本类型的几种不同数据集上测试依赖性解析模型。我们讨论了一些易于计算统计数据来展示测试集中的变化和差异,并提供一些基线实验,在那里我们测试额外的词汇知识对两个最先进的依赖解析器的域外域表现的效果。最后,我们展示了三个小实验,即在域外设置时,文本归一化在标准处理管道中的一个重要步骤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号