Evaluating Feature Sets and Classifiers for Sentiment Analysis of Financial News

机译：评估金融新闻情绪分析的特征集和分类器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Work on sentiment analysis has thus far been limited in the news article domain. This has mainly been caused by 1) news articles lacking a clearly defined target, 2) the difficulty in separating good and bad news from positive and negative sentiment, and 3) the seeming necessity of, and complexity in, relying on domain-specific interpretations and background knowledge. In this paper we propose, define, experiment with, and evaluate, four different feature categories, composed of 26 article features, for sentiment analysis. Using five different machine learning methods, we train sentiment classifiers of Norwegian financial internet news articles, and achieve classification precisions up to ~71%. This is comparable to the state-of-the-art in other domains and close to the human baseline. Our experimentation with different feature subsets shows that the category relying on domain-specific sentiment lexical ('contextual' category), able to grasp the jargon and lingo used in Norwegian financial news, is of cardinal importance in classification - these features yield a precision increase of ~21% when added to the other feature categories. When comparing different machine learning classifiers, we find J48 classification trees to yield the highest performance, closely followed by Random Forests (RF), in line with recent studies, and in opposition to the antedated conception that Support Vector Machines (SVM) is superior in this domain.

机译：迄今为止，关于情感分析的工作仅限于新闻领域。这主要是由于以下原因造成的：1）新闻文章缺乏明确的目标，2）很难将好消息和坏消息与正面和负面情绪区分开，以及3）依赖于特定领域的解释的看似必要性和复杂性和背景知识。在本文中，我们提出，定义，试验和评估由26个文章特征组成的四个不同特征类别，以进行情感分析。我们使用五种不同的机器学习方法，对挪威金融互联网新闻文章的情感分类器进行了训练，并实现了约71％的分类精度。这可与其他领域的最新技术相媲美，并且接近人类基线。我们对不同特征子集的实验表明，依赖领域特定情感词汇的类别（“语境”类别）能够掌握挪威财经新闻中使用的术语和行话，在分类中具有至关重要的意义-这些特征产生了精确度的提高添加到其他功能类别时，约为21％。在比较不同的机器学习分类器时，我们发现J48分类树产生了最高的性能，紧随其后的是随机森林（RF），这与最近的研究一致，并且与支持向量机（SVM）在此域。

著录项

来源
《IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies》|2014年|71-78|共8页
会议地点
作者
Njolstad P.C.S.; Hoysaeter L.S.; Wei Wei; Gulla J.A.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Internet; data mining; electronic publishing; emotion recognition; feature extraction; financial data processing; learning (artificial intelligence); pattern classification; support vector machines; text analysis; tree searching; trees (mathematics); J48 classification trees; Norwegian financial Internet news articles; SVM; domain-specific interpretations; domain-specific sentiment lexica; feature categories; feature classifiers; feature set evaluation; financial news; jargon; lingo; machine learning classifiers; machine learning methods; negative sentiment; positive sentiment; random forests; sentiment analysis; sentiment classifiers; support vector machines; Aggregates; Correlation; Feature extraction; Radio frequency; Reliability; Sentiment analysis; Support vector machines; Artificial neural networks; Decision trees; Feature extraction; Machine learning; Supervised learning; Support vector machines; Text analysis; Web mining;

机译：互联网;数据挖掘;电子出版;情感识别;特征提取;财务数据处理;学习（人工智能）;模式分类;支持向量机;文本分析;树搜索;树（数学）; J48分类树;挪威金融互联网新闻文章;支持向量机;特定领域的解释;特定领域的情感词汇;特征类别;特征分类器;特征集评估;财经新闻;行话;术语;机器学习分类器;机器学习方法;负面情绪;正面情绪;随机森林;情感分析;情感分类器;支持向量机;集合;相关性;特征提取;射频;可靠性;情感分析;支持向量机;人工神经网络;决策树;特征提取;机器学习;监督学习;支持向量机;文本分析;网络挖掘;

相似文献

外文文献
中文文献
专利

1. FineNews: fine-grained semantic sentiment analysis on financial microblogs and news [J] . Dridi Amna, Atzeni Mattia, Recupero Diego Reforgiato International journal of machine learning and cybernetics . 2019,第8期

机译：FineNews：金融微博和新闻的细粒度语义情感分析
2. FineNews: fine-grained semantic sentiment analysis on financial microblogs and news [J] . Dridi Amna, Atzeni Mattia, Recupero Diego Reforgiato International journal of machine learning and cybernetics . 2019,第8期

机译：Finenews：金融微博和新闻的细粒度语义情绪分析
3. Sentiment Classification of Financial News Using Statistical Features [J] . Yazdani Sepideh Foroozan, Murad Masrah Azrifah Azmi, Sharef Nurfadhlina Mohd, International Journal of Pattern Recognition and Artificial Intelligence . 2017,第3期

机译：利用统计特征对财经新闻的情感分类
4. Evaluating Feature Sets and Classifiers for Sentiment Analysis of Financial News [C] . Njolstad P.C.S., Hoysaeter L.S., Wei Wei, IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies . 2014

机译：评估财务新闻情感分析的特征集和分类器
5. Sentiment Analysis on Financial News and Microblogs [D] . Talekar, Chinmay. 2018

机译：金融新闻和微博情感分析
6. How to evaluate sentiment classifiers for Twitter time-ordered data? [O] . Igor Mozetič, Luis Torgo, Vitor Cerqueira, -1

机译：如何评估Twitter时间排序数据的情绪分类器？
7. Sentiment Classification through Combining Classifiers with Multiple Feature Sets [O] . Shoushan Li, Chengqing Zong, Xia Wang 2007

机译：通过将分类器与多个功能集组合来进行情感分类

Evaluating Feature Sets and Classifiers for Sentiment Analysis of Financial News

摘要

著录项

相似文献

相关主题

期刊订阅