Named Entity Recognition for Chinese Social Media with Domain Adversarial Training and Language Modeling

机译：基于领域对抗训练和语言建模的中国社交媒体命名实体识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent years have seen a surge of interest in natural language processing (NLP) for social media because the massive unstructured data from social media provide valuable information. However, natural language processing in this domain often suffers from the lack of large scale labeled data used for building models. In this paper, we focus specifically on the task of named entity recognition (NER) for Chinese social media. We propose a neural network model for domain adaptation which builds on domain-adversarial training and language modeling. The model is capable of learning from multiple sources of training data: labeled in-domain data, labeled out-of-domain data, as well as (large-scale) unlabeled in-domain data. To demonstrate the effectiveness of our approach, we experiment on an enlarged Chinese social media corpus. Results show-that the approach outperforms baselines significantly.

机译：近年来，社交媒体对自然语言处理（NLP）的兴趣激增，因为来自社交媒体的大量非结构化数据提供了有价值的信息。但是，该领域中的自然语言处理通常会遭受缺乏用于构建模型的大规模标记数据的困扰。在本文中，我们专门针对中国社交媒体的命名实体识别（NER）任务。我们提出了一种基于领域对抗训练和语言建模的领域适应神经网络模型。该模型能够从多种训练数据源中学习：带标签的域内数据，带标签的域外数据以及（大规模）无标签的域内数据。为了证明我们方法的有效性，我们在扩大的中国社交媒体语料库上进行了实验。结果表明，该方法明显优于基线。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|687-699|共13页
会议地点
作者
Yong Xu; Qi Lu; Muhua Zhu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Named entity recognition; Language model; Domain-adversarial training;

机译：命名实体识别;语言模型;领域对抗训练;

相似文献

外文文献
中文文献
专利

1. Cross-Domain and Semisupervised Named Entity Recognition in Chinese Social Media: A Unified Model [J] . Jingjing Xu, Hangfeng He, Xu Sun, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2018,第11期

机译：中国社交媒体中跨域和半监督的命名实体识别：统一模型
2. Event identification in web social media through named entity recognition and topic modeling [J] . Konstantinos N. Vavliakis, Andreas L. Symeonidis, Pericles A. Mitkas Data & Knowledge Engineering . 2013,第nova期

机译：通过命名实体识别和主题建模在网络社交媒体中进行事件识别
3. Joint Pre-Trained Chinese Named Entity Recognition Based on Bi-Directional Language Model [J] . Ma Changxia, Zhang Chen International Journal of Pattern Recognition and Artificial Intelligence . 2021,第9期

机译：基于双向语言模型的联合预先培训的中文命名实体识别
4. Named Entity Recognition for Chinese Social Media with Domain Adversarial Training and Language Modeling [C] . Yong Xu, Qi Lu, Muhua Zhu International Conference on Artificial Neural Networks . 2019

机译：用域对抗培训和语言建模的中国社交媒体命名实体识别
5. A data-intensive approach to named entity recognition using domain and language independent methods [D] . Osesina, Olukayode Isaac. 2010

机译：使用领域和语言无关的方法进行的数据密集型命名实体识别方法
6. Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations [O] . Min Zhang, Guohua Geng, Jing Chen 2020

机译：使用语言模型表示的嵌入式识别命名实体识别的半监控双向短期内存和条件随机字段模型
7. A Double Adversarial Network Model for Multi-Domain and Multi-Task Chinese Named Entity Recognition [O] . Yun HU, Changwen ZHENG 2020

机译：多域和多任务中文命名实体识别的双对抗网络模型

Named Entity Recognition for Chinese Social Media with Domain Adversarial Training and Language Modeling

摘要

著录项

相似文献

相关主题

期刊订阅