首页> 外文会议>China National Conference on Computational Linguistics >Improving Sentence Classification by Multilingual Data Augmentation and Consensus Learning
【24h】

Improving Sentence Classification by Multilingual Data Augmentation and Consensus Learning

机译:通过多语言数据增强和共识学习改进句子分类

获取原文

摘要

Neural network based models have achieved impressive results on the sentence classification task. However, most of previous work focuses on designing more sophisticated network or effective learning paradigms on monolingual data, which often suffers from insufficient discriminative knowledge for classification. In this paper, we investigate to improve sentence classification by multilingual data augmentation and consensus learning. Comparing to previous methods, our model can make use of multilingual data generated by machine translation and mine their language-share and language-specific knowledge for better representation and classification. We evaluate our model using English (i.e., source language) and Chinese (i.e., target language) data on several sentence classification tasks. Very positive classification performance can be achieved by our proposed model.
机译:基于神经网络的模型在句子分类任务上取得了令人印象深刻的结果。 然而,以前的大多数工作侧重于在单机数据上设计更复杂的网络或有效的学习范例,这往往遭受了对分类的不充分的歧视知识。 在本文中,我们通过多语言数据增强和共识学习来调查改善句子分类。 与以前的方法相比,我们的模型可以利用机器翻译和挖掘它们的语言共享和语言特定知识生成的多语种数据,以更好的表示和分类。 我们使用关于几个句子分类任务的英语(即源语言)和中文(即目标语言)数据来评估我们的模型。 我们所提出的模型可以实现非常积极的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号