首页> 外文会议>Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >PhoNLP: A joint multi-task learning model for Vietnamese part-of-speech tagging, named entity recognition and dependency parsing
【24h】

PhoNLP: A joint multi-task learning model for Vietnamese part-of-speech tagging, named entity recognition and dependency parsing

机译:Phonlp:越南语术语标记的联合多任务学习模型,名为实体识别和依赖解析

获取原文

摘要

We present the first multi-task learning model-named PhoNLP-for joint Vietnamese part-of-speech (POS) tagging, named entity recognition (NER) and dependency parsing. Experiments on Vietnamese benchmark datasets show that PhoNLP produces state-of-the-art results, outperforming a single-task learning approach that fine-tunes the pre-trained Vietnamese language model PhoBERT (Nguyen and Nguyen, 2020) for each task independently. We publicly release PhoNLP as an open-source toolkit under the Apache License 2.0. Although we specify PhoNLP for Vietnamese, our PhoNLP training and evaluation command scripts in fact can directly work for other languages that have a pre-trained BERT-based language model and gold annotated corpora available for the three tasks of POS tagging, NER and dependency parsing. We hope that PhoNLP can serve as a strong baseline and useful toolkit for future NLP research and applications to not only Vietnamese but also the other languages.
机译:我们介绍了第一个多任务学习模型名为phonlp-for联合越南语部分 - 语音(pos)标记,命名实体识别(ner)和依赖关系解析。 越南基准数据集的实验表明,PhonlP会产生最先进的结果,优先表现出单一任务学习方法,可以独立调整每项任务的预先培训的越南语模型Phobert(Nguyen和Nguyen,2020)。 我们将PhonlP公开作为Apache许可证2.0下的开源工具包释放。 虽然我们为越南语指定了Phonlp,但我们的PhonlP培训和评估命令脚本实际上可以直接为其他语言工作,这些语言可以为POS标记的三个任务提供预先训练的BERT的语言模型和Gold注释语言,但是 。 我们希望Phonlp可以作为未来NLP研究和应用程序的强大基线和有用的工具包,不仅是越南语,而且是其他语言。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号