首页> 外文会议>International conference on computational linguistics >Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach
【24h】

Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach

机译:构建特定于Twitter的大规模情感词典:一种表示学习方法

获取原文

摘要

In this paper, we propose to build large-scale sentiment lexicon from Twitter with a representation learning approach. We cast sentiment lexicon learning as a phrase-level sentiment classification task. The challenges are developing effective feature representation of phrases and obtaining training data with minor manual annotations for building the sentiment classifier. Specifically, we develop a dedicated neural architecture and integrate the sentiment information of tex-t (e.g. sentences or tweets) into its hybrid loss function for learning sentiment-specific phrase embedding (SSPE). The neural network is trained from massive tweets collected with positive and negative emoticons, without any manual annotation. Furthermore, we introduce the Urban Dictionary to expand a small number of sentiment seeds to obtain more training data for building the phrase-level sentiment classifier. We evaluate our sentiment lexicon (TS-Lex) by applying it in a supervised learning framework for Twitter sentiment classification. Experiment results on the benchmark dataset of SemEval 2013 show that, TS-Lex yields better performance than previously introduced sentiment lexicons.
机译:在本文中,我们建议使用表示学习方法从Twitter构建大规模的情感词典。我们将情感词典学习作为短语级别的情感分类任务。面临的挑战是开发有效的短语特征表示并获得带有少量人工注释的训练数据以建立情感分类器。具体来说,我们开发了一种专用的神经体系结构,并将tex-t的情感信息(例如句子或推文)整合到其混合损失函数中,以学习特定于情感的短语嵌入(SSPE)。神经网络从收集的带有正负表情符号的大量推文中进行训练,而无需任何人工注释。此外,我们引入了“城市词典”以扩展少量的情感种子,以获得更多的训练数据以构建短语级情感分类器。我们通过将其应用到Twitter情感分类的监督学习框架中来评估情感词典(TS-Lex)。在SemEval 2013基准数据集上的实验结果表明,TS-Lex比以前引入的情感词典具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号