首页> 外文会议>International Conference on Computational Linguistics >Domain-Specific Sentiment Lexicons Induced from Labeled Documents
【24h】

Domain-Specific Sentiment Lexicons Induced from Labeled Documents

机译:从标记文档诱导的域特异性情绪词典

获取原文

摘要

Sentiment analysis is an area of substantial relevance both in industry and in academia, including for instance in social studies. Although supervised learning algorithms have advanced considerably in recent years, in many settings it remains more practical to apply an unsupervised technique. The latter are oftentimes based on sentiment lexicons. However, existing sentiment lexicons reflect an abstract notion of polarity and do not do justice to the substantial differences of word polarities between different domains. In this work, we draw on a collection of domain-specific data to induce a set of 24 domain-specific sentiment lexicons. We rely on initial linear models to induce initial word intensity scores, and then train new deep models based on word vector representations to overcome the scarcity of the original seed data. Our analysis shows substantial differences between domains, which make domain-specific sentiment lexicons a promising form of lexical resource in downstream tasks, and the predicted lexicons indeed perform effectively on tasks such as review classification and cross-lingual word sentiment prediction.
机译:情绪分析是工业和学术界既有实质性相关的领域,包括例如社会研究。虽然近年来,监督学习算法大大提升,但在许多环境中,应用无监督技术仍然更加实用。后者是基于情绪词典的时间。然而,现有的情绪词典反映了一种抽象的极性概念,并且不会对不同域之间的词语极性差异进行大致差异。在这项工作中,我们借鉴了一个特定于域的数据,以诱导一组24个特定于域的情绪词典。我们依靠初始线性模型来诱导初始词强度分数,然后基于Word Vector表示训练新的深层模型来克服原始种子数据的稀缺性。我们的分析显示了域之间的实质性差异,使域特定情绪词典成为下游任务中的有希望的词汇资源形式,并且预测的词汇确实在审查分类和交叉词语情绪预测等任务上有效地执行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号