首页> 外文会议>9th International conference on language resources and evaluation >The Interplay Between Lexical and Syntactic Resources in Incremental Parsebanking
【24h】

The Interplay Between Lexical and Syntactic Resources in Incremental Parsebanking

机译:增量解析库中词汇与句法资源的相互作用

获取原文

摘要

Automatic syntactic analysis of a corpus requires detailed lexical and morphological information that cannot always be harvested from traditional dictionaries. In building the INESS Norwegian treebank, it is often the case that necessary lexical information is missing in the morphology or lexicon. The approach used to build the treebank is incremental parsebanking; a corpus is parsed with an existing grammar, and the analyses are efficiently disambiguated by annotators. When the intended analysis is unavailable after parsing, the reason is often that necessary information is not available in the lexicon. INESS has therefore implemented a text preprocessing interface where annotators can enter unrecognized words before parsing. This may concern words that are unknown to the morphology and/or lexicon, and also words that are known, but for which important information is missing. When this information is added, either during text preprocessing or during disambiguation, the result is that after reparsing the intended analysis can be chosen and stored in the treebank. The lexical information added to the lexicon in this way may be of great interest both to lexicographers and to other language technology efforts, and the enriched lexical resource being developed will be made available at the end of the project.
机译:对语料库的自动句法分析需要完整的词汇和形态学信息,不能总是从传统词典中收获。在构建Inets Norwegian TreeBank时,往往是在形态或词典中缺少必要词汇信息的情况。用于构建TreeBank的方法是增量ParseBanking;用现有语法解析了语料库,分析通过注释器有效消除歧义。在解析后预期的分析不可用时,原因通常是在词典中不可用必要的信息。因此,Iness已实现文本预处理接口,其中注释器可以在解析之前在解析之前输入无法识别的单词。这可能涉及形态和/或词典中未知的单词,以及已知的单词,但缺少重要信息。添加此信息时,在文本预处理或歧义期间,结果是在拒绝归库之后,可以选择预期的分析并将其存储在树班中。添加到词库这样的词汇信息可能是极大的兴趣既词典编纂者和其他语言技术的努力,以及正在开发的丰富词汇资源将在项目结束时提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号