Radical and Stroke-Enhanced Chinese Word Embeddings Based on Neural Networks

Shirui Wang; Wenan Zhou; Qiang Zhou

首页> 外文期刊>Neural processing letters >Radical and Stroke-Enhanced Chinese Word Embeddings Based on Neural Networks

【24h】

Radical and Stroke-Enhanced Chinese Word Embeddings Based on Neural Networks

机译：基于神经网络的激进和中风增强的中文单词嵌入

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The internal structural information of words has proven to be very effective for learning Chinese word embeddings. However, most previous attempts made a single form extraction of internal feature to learn representations, ignoring the comprehensive combination of such information. And they focused only on explicit feature of internal structures, even though these structures still have the implicit semantics of words. In this paper, we propose Radical and Stroke-enhanced Word Embeddings (RSWE), a novel method based on neural networks for learning Chinese word embeddings with joint guidance from semantic and morphological internal information. RSWE enables an embedding model to learn simultaneously from (1) implicit semantic information that is exploited from the radicals, and (2) stroke n-grams information that can be explicitly obtained from Chinese words. In the learning process, RSWE uses stroke n-grams to capture the local structural feature of words, and integrates the implicit information exploited from radicals to enhance the semantic of embeddings. Through this combination procedure, semantics of Chinese words are effectively transferred into the learned embeddings. We evaluate the effectiveness of RSWE on word similarity computation, word analogy reasoning, performance over dimensions, performance over learning corpus size, and named entity recognition tasks, the experimental results show that our model outperforms existing state-of-the-art approaches.

机译：言语的内部结构信息已被证明对学习中文单词嵌入来非常有效。但是，最先前的尝试单一的内容提取内部特征以学习表示，忽略这些信息的全面组合。它们只关注内部结构的明确功能，即使这些结构仍然具有隐式的单词语义。本文提出了基于语义和形态内部信息的联合指导，提出了一种基于神经网络的新型方法，提出了激进和中风增强的单词嵌入式（RSWE），这是一种基于神经网络的新方法。 RSWE使嵌入式模型能够同时学习（1）从激进派利用的隐式语义信息，以及（2）笔划N-GRAMS信息可以从中文单词明确地获得。在学习过程中，RSWE使用笔划n-grams来捕获单词的本地结构特征，并集成了从激进的内隐式信息来增强嵌入的语义。通过这种组合程序，中文单词的语义有效地转移到学习的嵌入中。我们评估RSWE对单词相似性计算的有效性，单词类比推理，尺寸的性能，学习语料库大小，并命名实体识别任务，实验结果表明，我们的模型优于现有的最先进的方法。

著录项

来源
《Neural processing letters》 |2020年第2期|1109-1121|共13页
作者
Shirui Wang; Wenan Zhou; Qiang Zhou;
展开▼
作者单位

Department of Computer Science Beijing University of Posts and Telecommunications No. 10 Xitucheng Road Haidian District 100876 Beijing China;

Department of Computer Science Beijing University of Posts and Telecommunications No. 10 Xitucheng Road Haidian District 100876 Beijing China;

Department of Computer Science Beijing University of Posts and Telecommunications No. 10 Xitucheng Road Haidian District 100876 Beijing China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Word embeddings; Internal structure; Neural networks;

机译：单词嵌入式;内部结构;神经网络;

相似文献

外文文献
中文文献
专利

1. Sentiment analysis on product reviews based onweighted word embeddings and deep neural networks [J] . Onan Aytug Concurrency and computation: practice and experience . 2021,第23期

机译：基于重量单词嵌入和深神经网络的产品评论的情感分析
2. Enhancing unsupervised neural networks based text summarization with word embedding and ensemble learning [J] . Alami Nabil, Meknassi Mohammed, En-nahnahi Noureddine Expert Systems with Application . 2019,第JUNa期

机译：通过词嵌入和集成学习来增强基于文本的无监督神经网络汇总
3. word2set: WordNet-Based Word Representation Rivaling Neural Word Embedding for Lexical Similarity and Sentiment Analysis [J] . Jimenez Sergio, Gonzalez Fabio A., Gelbukh Alexander, IEEE computational intelligence magazine . 2019,第2期

机译：word2set：基于词网的词表示与神经词嵌入竞争，以进行词汇相似度和情感分析
4. Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation [C] . Chunqi Wang, Bo Xu International joint conference on natural language processing . 2017

机译：带词嵌入的卷积神经网络用于中文分词
5. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
6. Identifying antimicrobial peptides using word embedding with deep recurrent neural networks [O] . Md-Nafiz Hamid, Iddo Friedberg -1

机译：通过深度递归神经网络的词嵌入识别抗菌肽
7. Text Classification Based on Convolutional Neural Networks and Word Embedding for Low-Resource Languages: Tigrinya [O] . Awet Fesseha, Shengwu Xiong, Eshete Derb Emiru, 2021

机译：基于卷积神经网络的文本分类和低资源语言的Word嵌入：Tigrinya

Radical and Stroke-Enhanced Chinese Word Embeddings Based on Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅