Sentiment Analysis in Tamil Texts: A Study on Machine Learning Techniques and Feature Representation

机译：泰米尔文本中的情感分析：机器学习技术与特征表示研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sentiment Analysis (SA) is an application of Natural Language Processing (NLP) to extract the sentiments expressed in the text. In this paper, we experimented five approaches to perform SA, namely, Lexicon based approach, Supervised Machine learning based approach, Hybrid approach, K-means with Bag of Word (BoW) approach and K-modes with BoW approach. We have experimented these approaches using five corpora with different feature representation techniques to predict the best approach to perform SA in Tamil texts. In this research we used Basic features such as word count and punctuation count in addition to traditional features such as Bag of Words (BoW) and Term Frequency-Inverse Document Frequency (TF-IDF) included to check their influence in the prediction. We have compared these approaches, features and the corpora. From the evaluation the highest accuracy of 79% is obtained for UJ_Corpus_Opinions_Nouns corpus with fastText for supervised Machine learning based approach.

机译：情绪分析（SA）是自然语言处理（NLP）的应用，以提取文本中表达的情绪。在本文中，我们尝试了五种方法来执行SA，即基于词汇的方法，受监管机器学习的方法，混合方法，K-mease，用弓法的袋子（弓）方法和k模式。我们使用五个Corpora尝试了这些方法，其中包含不同的特征表示技术，以预测在泰米尔文本中执行SA的最佳方法。在本研究中，除了包括单词（弓）和术语频率逆文档频率（TF-IDF）之类的传统功能之外，还使用了单词计数和标点符号等基本功能，以检查它们在预测中的影响。我们比较了这些方法，功能和语料库。从评估，对于UJ_CORPUS_OPINIONS_NOUNS语料库，可以获得79％的最高精度为基于监督机器学习的方法，获得了UJ_CORPUS_OPINIONS_NOUNS语料库。

著录项

来源
《Conference on Industrial and Information Systems》|2019年|1 v.|共6页
会议地点
作者
Sajeetha Thavareesan; Sinnathamby Mahesan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
Training; Natural language processing; Machine learning; Motion pictures; Testing; Conferences; Information systems;

机译：培训;自然语言处理;机器学习;电影;测试;会议;信息系统;

相似文献

外文文献
中文文献
专利

1. Feature Selection Techniques and Classification Accuracy of Supervised Machine Learning in Text Mining [J] . Loise Makara, Kennedy Ogada, Dennis Njagi Journal of Information Engineering and Applications . 2019,第3期

机译：文本挖掘中监督机器学习的特征选择技术与分类精度
2. AUTHORSHIP ATTRIBUTION OF TELUGU TEXTS BASED ON SYNTACTIC FEATURES AND MACHINE LEARNING TECHNIQUES [J] . N V GANAPATHI RAJU, Dr V VIJAY KUMAR, Dr O SRINIVASA RAO Journal of Theoretical and Applied Information Technology . 2016,第1期

机译：基于句法特征和机器学习技术的泰卢固语文本的作者归属
3. Sentiment Analysis Using Machine Learning Algorithms and Text Mining to Detect Symptoms of Mental Difficulties Over Social Media [J] . Hadj Ahmed Bouarara International journal of information systems and social change . 2021,第2期

机译：使用机器学习算法和文本挖掘来检测社交媒体精神困难症状的情绪分析
4. Sentiment Analysis in Tamil Texts: A Study on Machine Learning Techniques and Feature Representation [C] . Sajeetha Thavareesan, Sinnathamby Mahesan Conference on Industrial and Information Systems . 2019

机译：泰米尔语文本中的情感分析：机器学习技术和特征表示的研究
5. Machine Learning and Text Analysis Using Clustering, Classification, Categorization for Applied Industry Research and Its Effect on Trends and Prediction Analysis of a Doctor of Professionals Studies in Computing Dissertation Categories [D] . Haigler, Ashley. 2021

机译：采用集群，分类，分类，应用行业研究的机器学习和文本分析及其对计算论文中专业人士研究博士趋势和预测分析的影响
6. Radiomics for glioblastoma survival analysis in pre-operative MRI: exploring feature robustness class boundaries and machine learning techniques [O] . Yannick Suter, Urspeter Knecht, Mariana Alão, 2020

机译：术前MRI胶质母细胞瘤生存分析的辐射瘤：探索功能鲁棒性阶级边界和机器学习技术
7. Scientific Text Sentiment Analysis using Machine Learning Techniques [O] . Hassan Raza, M. Faizan, Ahsan Hamza, 2019

机译：使用机器学习技术的科学文本情绪分析

Sentiment Analysis in Tamil Texts: A Study on Machine Learning Techniques and Feature Representation

摘要

著录项

相似文献

相关主题

期刊订阅