Comprehensive analysis of aspect term extraction methods using various text embeddings

Lukasz Augustyniak; Tomasz Kajdanowicz; Przemyslaw Kazienko

首页> 外文期刊>Computer speech and language >Comprehensive analysis of aspect term extraction methods using various text embeddings

【24h】

Comprehensive analysis of aspect term extraction methods using various text embeddings

机译：使用各种文本嵌入的综合术语提取方法综合分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, a variety of model designs and methods have blossomed in the context of the sentiment analysis domain. However, there is still a lack of comprehensive studies of Aspect-based Sentiment Analysis. We want to fill this gap and propose a comparison with ablation analysis of Aspect Term Extraction using various text embeddings methods. We particularly focused on simple architectures based on long short-term memory (LSTM) with optional conditional random field (CRF) enhancement using different pre-trained word embeddings. Moreover, we analyzed the influence on the performance of extending the word vectorization step with character-based word embeddings. The experimental results on SemEval datasets revealed that bi-directional long short-term memory (BiLSTM) could be used as a very good predictor, even comparing to very sophisticated and complex models using huge word embeddings or language models. We presented a comprehensive analysis of various customizations of LSTM-based architecture and word/character embeddings that could be used as a guideline to choose the best model version for particular user needs.

机译：最近，各种模型设计和方法在情感分析域的背景下蓬勃发展。然而，仍然缺乏对基于宽度的情绪分析的综合研究。我们希望使用各种文本嵌入方法填补这种差距并提出与ASPETS术语提取的消融分析的比较。我们特别专注于使用不同预先训练的单词嵌入的可选条件随机字段（CRF）增强的长短短期内存（LSTM）的简单架构。此外，我们分析了对扩展与基于字符的单词嵌入来扩展字矢量化步骤的性能的影响。 Semeval Datasets的实验结果显示，双向长期短期记忆（BILSTM）可以用作非常好的预测因子，甚至与使用巨大的单词嵌入或语言模型的非常复杂和复杂的模型相比。我们对LSTM的架构和单词/字符嵌入的各种自定义进行了全面分析，可以用作选择特定用户需求的最佳型号版本的指导。

著录项

来源
《Computer speech and language》 |2021年第9期|101217.1-101217.19|共19页
作者
Lukasz Augustyniak; Tomasz Kajdanowicz; Przemyslaw Kazienko;
展开▼
作者单位

Department of Computational Intelligence Wroclaw University of Science and Technology Wroclaw Poland;

Department of Computational Intelligence Wroclaw University of Science and Technology Wroclaw Poland;

Department of Computational Intelligence Wroclaw University of Science and Technology Wroclaw Poland;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Aspect-based sentiment analysis; Aspect term extraction; Word embeddings; Character embeddings; LSTM; BiLSTM; CRF; SemEval;

机译：基于宽度的情绪分析;术语提取;单词嵌入式;字符嵌入式;LSTM;Bilstm;CRF;Semeval.;

相似文献

外文文献
中文文献
专利

1. Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier [J] . Manek Asha S., Shenoy P. Deepa, Mohan M. Chandra, World Wide Web . 2017,第2期

机译：使用Gini Index特征选择方法和SVM分类器提取用于大型电影评论中情感分析的方面项
2. A comprehensive method for multilingual video text detection, localization, and extraction [J] . Lyu M.R., Jiqiang Song, Min Cai IEEE Transactions on Circuits and Systems for Video Technology . 2005,第2期

机译：一种全面的多语言视频文本检测，本地化和提取方法
3. Multi-View Data Analysis and Concept Extraction Methods for Text [J] . Jean-Charles Lamirel Knowledge Organization . 2013,第5期

机译：文本的多视图数据分析和概念提取方法
4. An Unsupervised Multiple Word-Embedding Method with Attention Model for Cross Domain Aspect Term Extraction [C] . Ganpat Singh Chauhan, Yogesh Kumar Meena, Dinesh Gopalani, International Conference on Emerging Technologies in Computer Engineering: Machine Learning and Internet of Things . 2020

机译：跨域方面术语提取的带有注意力模型的无监督多词嵌入方法
5. Scaling the Technology Opportunity Analysis text data mining methodology: Data extraction, cleaning, online analytical processing analysis, and reporting of large multi-source datasets. [D] . George, Richard Peyton. 2006

机译：扩展技术机会分析文本数据挖掘方法：数据提取，清理，在线分析处理分析以及大型多源数据集的报告。
6. The AMeX method: a multipurpose tissue-processing and paraffin-embedding method. II. Extraction of spooled DNA and its application to Southern blot hybridization analysis. [O] . Y. Sato, K. Mukai, Y. Matsuno, 1990

机译：AMeX方法：一种多功能组织处理和石蜡包埋方法。二。假脱机DNA的提取及其在Southern印迹杂交分析中的应用。
7. Comprehensive analysis of aspect term extraction methods using various text embeddings [O] . Łukasz Augustyniak, Tomasz Kajdanowicz, Przemysław Kazienko 2021

机译：使用各种文本嵌入的综合术语提取方法综合分析
8. Comprehensive Security Analysis of and an Implementation Framework for Embedded Software Attestation Methods Leveraging FPGA-Based System-on-a-Chip Architectures. [R] . Reber, P. A. 2017

机译：利用基于FpGa的片上系统架构的嵌入式软件认证方法的综合安全性分析和实现框架。

Comprehensive analysis of aspect term extraction methods using various text embeddings

摘要

著录项

相似文献

相关主题

期刊订阅