...
首页> 外文期刊>Journal of Theoretical and Applied Information Technology >PRIVACY PRESERVING MINING OF WEB REVIEWS BASED ON SENTIMENT ANALYSIS AND FUZZY SETS
【24h】

PRIVACY PRESERVING MINING OF WEB REVIEWS BASED ON SENTIMENT ANALYSIS AND FUZZY SETS

机译:基于情感分析和模糊套装的网络评价隐私挖掘

获取原文
           

摘要

In the traditional Web, users are considered as information consumers. In social Web, users play a much more active role since they are now not only information consumers but also data providers. Users like online posting reviews which has become an increasingly popular way to express opinions and sentiments toward the products bought or services received. Analyzing these reviews can be helpful for collecting opinions of people about products, social events and problems and would produce useful actionable knowledge that could be of economic values to vendors and other interested parties. Thus, due to the huge number of reviews and their unstructured nature, efficient computational methods are needed for mining and summarizing these reviews, because regular analysis of reviews does not indicate user likes and dislikes. In a review, user typically writes about both the positive and negative aspects of the object, although the general sentiment toward that object may be positive or negative. That?s why sentiment analysis together with opinion mining try to extract and study of user?s opinions, sentiments and subjectivity of text. However, this analysis must come with careful consideration of user?s anonymity and the privacy of their sensitive data as privacy is today an important concern for both users and enterprises. In this research, automatic analysis of opinions (opinion mining) is performed to obtain such detailed aspects based on ontology. Opinion mining identify the features in the opinion and classify the sentiments of the opinion for each of these features. Opinion mining is a difficult task, owing to both the high semantic variability of the opinions expressed, and the diversity of the characteristics and sub-characteristics that describe the products and the multitude of opinion words used to depict them. In the proposed approach, the opinion polarity and polarity strength are measured using fuzzy set. As the fuzzy set theory is quite effective in processing natural languages, to measure the vagueness, it will also be effective in analyzing review articles, which are generally in natural languages. Additionally, the proposed system takes privacy into consideration by anonymizing data before final publishing. Methods of generalization and micro-aggregation are utilized for anonymizing quasi-identifiers to maintain the balance between data utility and user privacy.
机译:在传统的网络中,用户被视为信息消费者。在社交Web中,用户在现在不仅是信息消费者而且提供数据提供者的信息,用户扮演更积极的角色。喜欢在线发布评论,这已成为表达对购买或服务的产品的意见和情绪的日益流行的方式。分析这些评论可能有助于收集人们对产品,社交活动和问题的意见,并产生有用的可操作知识,这些知识可能是供应商和其他有关方面的经济价值。因此,由于审查和非结构化性质的数量巨大,挖掘和总结这些评论需要有效的计算方法,因为定期分析评定并不表示用户喜欢和不喜欢。在审查中,用户通常会写入对象的正面和负面方面,尽管对该对象的一般情绪可以是正的或负面的。这是为什么如何与意见采矿一起出发分析,试图提取和研究用户的意见,情绪和文本的主观性。然而,这种分析必须仔细考虑用户?S匿名和他们敏感数据的隐私,因为隐私是用户和企业的重要关注。在这项研究中,进行了意见(意见采矿)的自动分析,以获得基于本体的这种详细方面。意见挖掘确定了意见中的特征,并对每个特征进行意见的情绪。意见采矿是一项艰巨的任务,由于表达了高的语义变异性,以及描述了产品的特征和子特征的多样性以及用于描绘它们的多种观点词。在所提出的方法中,使用模糊装置测量意见极性和极性强度。由于模糊集理论在加工自然语言方面非常有效,以衡量模糊性,它也将有效地分析审查文章,这通常是自然语言。此外,所提出的系统通过在最终发布之前匿名数据匿名地考虑隐私。泛化和微聚合方法用于匿名的准标识符,以维持数据实用程序与用户隐私之间的平衡。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号