【24h】

Unified Semantic Parsing with Weak Supervision

机译:具有弱监督的统一语义解析

获取原文

摘要

Semantic parsing over multiple knowledge bases enables a parser to exploit structural similarities of programs across the multiple domains. However, the fundamental challenge lies in obtaining high-quality annotations of (utterance, program) pairs across various domains needed for training such models. To overcome this, we propose a novel framework to build a unified multi-domain enabled semantic parser trained only with weak supervision (denotations). Weakly supervised training is particularly arduous as the program search space grows exponentially in a multi-domain setting. To solve this, we incorporate a multi-policy distillation mechanism in which we first train domain-specific semantic parsers (teachers) using weak supervision in the absence of the ground truth programs, followed by training a single unified parser (student) from the domain specific policies obtained from these teachers. The resultant semantic parser is not only compact but also generalizes better, and generates more accurate programs. It further does not require the user to provide a domain label while querying. On the standard OVERNIGHT dataset (containing multiple domains), we demonstrate that the proposed model improves performance by 20% in terms of denotation accuracy in comparison to baseline techniques.
机译:通过对多个知识库的语义解析,解析器可以利用跨多个域的程序的结构相似性。然而,根本的挑战在于获得训练此类模型所需的各个领域的(话语,节目)对的高质量注释。为了克服这个问题,我们提出了一个新颖的框架来构建统一的多域启用语义分析器,该语法分析器仅在弱监督(表示)的情况下进行训练。由于程序搜索空间在多域设置中呈指数增长,因此缺乏监督的培训尤其艰巨。为了解决这个问题,我们引入了一种多策略提炼机制,在该机制中,我们首先在缺乏基本真理程序的情况下使用弱监督来训练特定于领域的语义解析器(教师),然后再训练来自该域的单个统一解析器(学生)从这些老师那里获得的具体政策。生成的语义解析器不仅紧凑,而且泛化效果更好,并生成更准确的程序。此外,它不要求用户在查询时提供域标签。在标准的OVERNIGHT数据集(包含多个域)上,我们证明了与基线技术相比,该模型在表示精度方面提高了20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号