Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis

Arora Monika; Kansal Vineet

首页> 外文期刊>Social network analysis and mining >Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis

【24h】

Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis

机译：Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

On social media platforms such as Twitter and Facebook, people express their views, arguments, and emotions of many events in daily life. Twitter is an international microblogging service featuring short messages called tweets from different languages. These texts often consist of noise in the form of incorrect grammar, abbreviations, freestyle, and typographical errors. Sentiment analysis (SA) aims to predict the actual emotions from the raw text expressed by the people through the field of natural language processing (NLP). The main aim of our work is to process the raw sentence from the Twitter dataset and find the actual polarity of the message. This paper proposes a text normalization with deep convolutional character level embedding (Conv-char-Emb) neural network model for SA of unstructured data. This model can tackle the problems: (1) processing the noisy sentence for sentiment detection (2) handling small memory space in word level embedded learning (3) accurate sentiment analysis of the unstructured data. The initial preprocessing stage for performing text normalization includes the following steps: tokenization, out of vocabulary (OOV) detection and its replacement, lemmatization and stemming. A character-based embedding in convolutional neural network (CNN) is an effective and efficient technique for SA that uses less learnable parameters in feature representation. Thus, the proposed method performs both the normalization and classification of sentiments for unstructured sentences. The experimental results are evaluated in the Twitter dataset by a different point polarity (positive, negative and neutral). As a result, our model performs well in normalization and sentiment analysis of the raw Twitter data enriched with hidden information.

著录项

来源
《Social network analysis and mining》 |2019年第1期|共14页
作者
Arora Monika; Kansal Vineet;
展开▼
作者单位

AKTU;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类计算技术、计算机技术;
关键词
Opinion mining; Convolutional neural network; Phonetic algorithm; Soundex; SemEval dataset;

Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis

摘要

著录项

相关主题

期刊订阅