word2set: WordNet-Based Word Representation Rivaling Neural Word Embedding for Lexical Similarity and Sentiment Analysis

Jimenez Sergio; Gonzalez Fabio A.; Gelbukh Alexander; Duenas George

首页> 外文期刊>IEEE computational intelligence magazine >word2set: WordNet-Based Word Representation Rivaling Neural Word Embedding for Lexical Similarity and Sentiment Analysis

【24h】

word2set: WordNet-Based Word Representation Rivaling Neural Word Embedding for Lexical Similarity and Sentiment Analysis

机译：Word2Set：基于Wordnet的字表示竞争神经词嵌入词汇相似性和情感分析

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Measuring lexical similarity using WordNet has a long tradition. In the last decade, it has been challenged by distributional methods, and more recently by neural word embedding. In recent years, several larger lexical similarity benchmarks have been introduced, on which word embedding has achieved state-of-the-art results. The success of such methods has eclipsed the use of WordNet for predicting human judgments of lexical similarity. We propose a new set cardinality-based method for measuring lexical similarity, which exploits the WordNet graph, obtaining a word representation, which we called word2set, based on related neighboring words. We show that the features extracted from set cardinalities computed using this word representation, when fed into a support vector regression classifier trained on a dataset of common synonyms and antonyms, produce results competitive with those of word-embedding approaches. On the task of predicting the lexical sentiment polarity, our WordNet set-based representation significantly outperforms the classical measures and achieves the performance of neural embeddings. Although word embedding is still the best approach for these tasks, our method significantly reduces the gap between the results shown by knowledge-based approaches and by distributional representations, without requiring a large training corpus. It is also more effective for less-frequent words.

机译：使用Wordnet测量词汇相似性具有悠久的传统。在过去的十年中，它受到分布方法的挑战，最近是神经词嵌入的。近年来，已经介绍了几个较大的词汇相似基准，其中嵌入的单词嵌入已经实现了最先进的结果。此类方法的成功使Wordnet使用Wordnet来预测词汇判断的词汇相似性。我们提出了一种新的基于集基主的方法，用于测量词汇相似性，该方法利用Wordnet图形，获取基于相关的相邻单词的Word2Set的单词表示。我们展示从使用此单词表示计算的集基数中提取的功能，当馈入在公共同义词和反义词的数据集上培训的支持向量回归分类器时，产生与嵌入方法的结果竞争的结果。关于预测词汇情绪极性的任务，我们的Wordnet集合的表示显着优于经典措施并实现了神经嵌入的性能。虽然Word嵌入仍然是这些任务的最佳方法，但我们的方法显着降低了基于知识的方法和分布表示所示的结果之间的差距，而无需大型培训语料库。它对较少频繁的单词来说也更有效。

著录项

来源
《IEEE computational intelligence magazine》 |2019年第2期|41-53|共13页
作者
Jimenez Sergio; Gonzalez Fabio A.; Gelbukh Alexander; Duenas George;
展开▼
作者单位

Inst Caro & Cuervo Bogota DC Colombia;

Univ Nacl Colombia Mindlab Res Grp Bogota DC Colombia;

Inst Politecn Natl CIC Mexico City DF Mexico;

Inst Politecn Natl CIC Mexico City DF Mexico;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. word2set: WordNet-Based Word Representation Rivaling Neural Word Embedding for Lexical Similarity and Sentiment Analysis [J] . Jimenez Sergio, Gonzalez Fabio A., Gelbukh Alexander, IEEE computational intelligence magazine . 2019,第2期

机译：word2set：基于词网的词表示与神经词嵌入竞争，以进行词汇相似度和情感分析
2. Aspect-Based Sentiment Analysis on Indonesian Restaurant Review Using a Combination of Convolutional Neural Network and Contextualized Word Embedding [J] . Putri Rizki Amalia, Edi Winarko Indonesian Journal of Computing and Cybernetics Systems . 2021,第3期

机译：关于印度尼西亚餐厅审查的基于宽度的情绪分析，使用卷积神经网络的组合和情境化词嵌入
3. Sentiment analysis on product reviews based onweighted word embeddings and deep neural networks [J] . Onan Aytug Concurrency and computation: practice and experience . 2021,第23期

机译：基于重量单词嵌入和深神经网络的产品评论的情感分析
4. An Evaluation of Neural Machine Translation and Pre-trained Word Embeddings in Multilingual Neural Sentiment Analysis [C] . George Manias, Argyro Mavrogiorgou, Athanasios Kiourtis, International Conference on Progress in Informatics and Computing . 2020

机译：神经电机翻译与训练前的单词嵌入在多语言神经情绪分析中的评估
5. Improved GloVe Word Embedding Using Linear Weighting Scheme for Word Similarity Tasks [D] . Lu, Qinglan. 2021

机译：使用线性加权方案进行改进的手套单词嵌入单词相似性任务
6. Lexical embeddings produce interference when they are morphologically unrelated to the words in which they are contained: Evidence from eye movements [O] . Kristin M. Weingartner, Barbara J. Juhasz, Keith Rayner -1

机译：当与含有它们的单词形态无关时词汇嵌入产生干扰：来自眼球运动的证据
7. Probabilistic Neural Network and Word Embedding for Sentiment Analysis [O] . Saqib Alam, Nianmin Yao 2018

机译：概率神经网络与情感分析嵌入

word2set: WordNet-Based Word Representation Rivaling Neural Word Embedding for Lexical Similarity and Sentiment Analysis

摘要

著录项

相似文献

相关主题

期刊订阅