【24h】

Building Chinese Affective Resources in Valence-Arousal Dimensions

机译:从价位角度构建中国情感资源

获取原文

摘要

An increasing amount of research has recently focused on representing affective states as continuous numerical values on multiple dimensions, such as the valence-arousal (VA) space. Compared to the categorical approach that represents affective states as several classes (e.g., positive and negative), the dimensional approach can provide more finegrained sentiment analysis. However, affective resources with valence-arousal ratings are still very rare, especially for the Chinese language. Therefore, this study builds 1) an affective lexicon called Chinese valence-arousal words (CVAW) containing 1,653 words, and 2) an affective corpus called Chinese valence-arousal text (CVAT) containing 2,009 sentences extracted from web texts. To improve the annotation quality, a corpus cleanup procedure is used to remove outlier ratings and improper texts. Experiments using CVAW words to predict the VA ratings of the CVAT corpus show results comparable to those obtained using English affective resources.
机译:最近,越来越多的研究集中在将情感状态表示为多维维上的连续数值,例如化合价(VA)空间。与将情感状态表示为几个类别(例如,正面和负面)的分类方法相比,量纲方法可以提供更细粒度的情感分析。但是,具有价数等级的情感资源仍然非常稀少,尤其是对于中文。因此,本研究建立了1)包含1,653个单词的情感汉语词汇(CVAW),以及2)包含从Web文本中提取的2,009个句子的情感中文语料(CVAT)。为了提高注释的质量,语料库清理过程用于删除异常值和不正确的文本。使用CVAW单词预测CVAT语料库的VA等级的实验显示,结果与使用英语情感资源获得的结果相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号