Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach

机译：构建特定于Twitter的大规模情感词典：一种表示学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose to build large-scale sentiment lexicon from Twitter with a representation learning approach. We cast sentiment lexicon learning as a phrase-level sentiment classification task. The challenges are developing effective feature representation of phrases and obtaining training data with minor manual annotations for building the sentiment classifier. Specifically, we develop a dedicated neural architecture and integrate the sentiment information of tex-t (e.g. sentences or tweets) into its hybrid loss function for learning sentiment-specific phrase embedding (SSPE). The neural network is trained from massive tweets collected with positive and negative emoticons, without any manual annotation. Furthermore, we introduce the Urban Dictionary to expand a small number of sentiment seeds to obtain more training data for building the phrase-level sentiment classifier. We evaluate our sentiment lexicon (TS-Lex) by applying it in a supervised learning framework for Twitter sentiment classification. Experiment results on the benchmark dataset of SemEval 2013 show that, TS-Lex yields better performance than previously introduced sentiment lexicons.

机译：在本文中，我们建议使用表示学习方法从Twitter构建大规模的情感词典。我们将情感词典学习作为短语级别的情感分类任务。面临的挑战是开发有效的短语特征表示并获得带有少量人工注释的训练数据以建立情感分类器。具体来说，我们开发了一种专用的神经体系结构，并将tex-t的情感信息（例如句子或推文）整合到其混合损失函数中，以学习特定于情感的短语嵌入（SSPE）。神经网络从收集的带有正负表情符号的大量推文中进行训练，而无需任何人工注释。此外，我们引入了“城市词典”以扩展少量的情感种子，以获得更多的训练数据以构建短语级情感分类器。我们通过将其应用到Twitter情感分类的监督学习框架中来评估情感词典（TS-Lex）。在SemEval 2013基准数据集上的实验结果表明，TS-Lex比以前引入的情感词典具有更好的性能。

著录项

来源
《International conference on computational linguistics》|2014年|172-182|共11页
会议地点
作者
Duyu Tang; Furu Wei; Bing Qin; Ming Zhou; Ting Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Lexicon-based approach outperforms Supervised Machine Learning approach for Urdu Sentiment Analysis in multiple domains [J] . Mukhtar Neelam, Khan Mohammad Abid, Chiragh Nadia Telematics and Informatics . 2018,第8期

机译：在多个领域中，基于词典的方法胜过有监督的机器学习方法进行乌尔都语情感分析
2. A Hybrid Approach of Machine Learning and Lexicons to Sentiment Analysis: Enhanced Insights from Twitter Data of Natural Disasters [J] . Mendon Shalak, Dutta Pankaj, Behl Abhishek, Information systems frontiers . 2021,第5期

机译：一种机器学习和词汇的混合方法与情感分析：自然灾害的推特数据增强了洞察
3. A Deep Learning-Based Approach to Constructing a Domain Sentiment Lexicon: a Case Study in Financial Distress Prediction [J] . Shixuan Li, Wenxuan Shi, Jiancheng Wang, Information Processing & Management . 2021,第5期

机译：基于深入的学习方法构建域情绪词典：以财务困境预测为例
4. Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach [C] . Duyu Tang, Furu Wei, Bing Qin, International conference on computational linguistics . 2014

机译：建立大规模推特特异性情绪词典：代表学习方法
5. Towards Automated Domain-Oriented Lexicon Construction and Dimension Reduction for Arabic Sentiment Analysis [D] . Alshahrani, Hasan A. 2018

机译：面向阿拉伯语情感分析的面向领域的自动词典构建和降维
6. SentiHealth: creating health-related sentiment lexicon using hybrid approach [O] . Muhammad Zubair Asghar, Shakeel Ahmad, Maria Qasim, -1

机译：SentiHealth：使用混合方法创建与健康相关的情感词典
7. Exerting 2D-Space of Sentiment Lexicons with Machine Learning Techniques: A Hybrid Approach for Sentiment Analysis [O] . Muhammad Yaseen Khan, Khurum Nazir 2020

机译：用机器学习技术施加2D空间的情绪词典：一种杂种方法，具有情绪分析

Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach

摘要

著录项

相似文献

相关主题

期刊订阅