首页> 外国专利> SENTENCE EMBEDDING METHOD AND APPARATUS BASED ON SUBWORD EMBEDDING AND SKIP-THOUGHTS

SENTENCE EMBEDDING METHOD AND APPARATUS BASED ON SUBWORD EMBEDDING AND SKIP-THOUGHTS

机译:基于子词嵌入和跳过思想的句子嵌入方法及装置

摘要

Provided are sentence embedding method and apparatus based on subword embedding and skip-thoughts. To integrate skip-thought sentence embedding learning methodology with a subword embedding technique, a skip-thought sentence embedding learning method based on subword embedding and methodology for simultaneously learning subword embedding learning and skip-thought sentence embedding learning, that is, multitask learning methodology, are provided as methodology for applying intra-sentence contextual information to subword embedding in the case of subword embedding learning. This makes it possible to apply a sentence embedding approach to agglutinative languages such as Korean in a bag-of-words form. Also, skip-thought sentence embedding learning methodology is integrated with a subword embedding technique such that intra-sentence contextual information can be used in the case of subword embedding learning. A proposed model minimizes additional training parameters based on sentence embedding such that most training results may be accumulated in a subword embedding parameter.
机译:提供了一种基于子词嵌入和跳过思想的句子嵌入方法和装置。为了将跳字型句子嵌入学习方法与子词嵌入技术相集成,基于子字词嵌入的跳字型句子嵌入学习方法和用于同时学习子词嵌入学习和跳字型句子嵌入学习的方法,即多任务学习方法,提供的方法是在子词嵌入学习的情况下将句子内上下文信息应用于子词嵌入的方法。这使得可以将句子嵌入方法应用于词缀形式的朝鲜语等凝集语言。另外,跳字句子嵌入学习方法与子词嵌入技术集成在一起,因此在子词嵌入学习的情况下可以使用句子内上下文信息。所提出的模型基于句子嵌入来最小化额外的训练参数,使得大多数训练结果可以累积在子词嵌入参数中。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号