NLP-MTFLR: Document-Level Prioritization and Identification of Dominant Multi-word Named Products in Customer Reviews

Sivashankari R.; Valarmathi B.

首页> 外文期刊>Arabian Journal for Science and Engineering >NLP-MTFLR: Document-Level Prioritization and Identification of Dominant Multi-word Named Products in Customer Reviews

【24h】

NLP-MTFLR: Document-Level Prioritization and Identification of Dominant Multi-word Named Products in Customer Reviews

机译：NLP-MTFLR：客户评论中文档级优先级和主要多词命名产品的标识

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The accessibility to large amount of datasets in commercial domains has accentuated the importance of data mining in the last few years. Practitioners as well as researchers rely on them to reflect on the magnitude and effect of data-related problems that require solution in business environments. In recent years, the volume of online data submissions (e-commerce data) on products, services and organizations has increased exponentially. However, the submitted data are highly unstructured and largely dependent on language. Mining and extracting useful information from such data is a colossal task, as analysis of the data should include opinion word identification/extraction, aspect extraction and entity extraction. Of the three, the entity extraction is one of the governing approaches in text analysis and plays a major role in e-commerce, biomedical and automobile industries and supports the categorization of the records based on the entity names, generation of short summary on the entities and grouping of the similar records. The existing approaches in entity extraction are capable of recognizing and extracting single-word named entities. However, the product names are often given as a sequence of words (multiple words or multi-word named entities) and, therefore, cannot be recognized by the existing methods. To resolve this issue, this paper presents a novel approach of NLP-Modified Token-based Frequencies of Left and Right (NLP-MTFLR), which is considered as an effective approach to detect and extract the multi-word named products and dominant multi-word named product from the customer review corpus. Using this NLP-MTFLR approach, from the review corpus the subwords and multi-subwords are identified and mapped them with its multi-word named products to recognize dominant product of that corpus. With this dominant product identification, the proposed method reveals in that corpus that the identified dominant product is highly reviewed by the reviewers compared to other products. This NLP-MTFLR approach is achieved 97% accuracy, 77% precision, 89% recall and 82% F-score.

机译：在最近几年中，对商业领域中大量数据集的可访问性突显了数据挖掘的重要性。从业者和研究人员都依靠他们来思考需要在业务环境中解决的数据相关问题的严重性和影响。近年来，关于产品，服务和组织的在线数据提交（电子商务数据）的数量呈指数增长。但是，提交的数据高度非结构化，并且很大程度上取决于语言。从此类数据中提取和提取有用信息是一项艰巨的任务，因为对数据的分析应包括意见词识别/提取，方面提取和实体提取。在这三种方法中，实体提取是文本分析中的主要方法之一，并且在电子商务，生物医学和汽车行业中起着重要作用，并且支持基于实体名称的记录分类，生成实体的简短摘要。和类似记录的分组。实体提取中的现有方法能够识别和提取单个单词命名的实体。但是，产品名称通常以单词序列（多个单词或多个单词命名的实体）的形式给出，因此，现有方法无法识别。为解决此问题，本文提出了一种基于NLP修改的基于令牌的左右频率（NLP-MTFLR）的新方法，该方法被认为是检测和提取多词命名产品和占主导地位的多词的有效方法。客户评论语料库中名为产品的词。使用这种NLP-MTFLR方法，从复习语料库中识别出子词和多子词，并将其与多词命名产品进行映射，以识别该语料库的主导产品。通过这种主导产品识别，所提出的方法在该语料库中表明，与其他产品相比，审阅者对识别出的主导产品进行了高度审查。这种NLP-MTFLR方法可实现97％的精度，77％的精度，89％的召回率和82％的F分数。

著录项

来源
《Arabian Journal for Science and Engineering》 |2018年第2期|843-855|共13页
作者
Sivashankari R.; Valarmathi B.;
展开▼
作者单位

VIT Univ, SITE, Vellore 632014, Tamil Nadu, India;

VIT Univ, SITE, Vellore 632014, Tamil Nadu, India;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Natural language processing; Document-level mining; Entity extraction; Text mining; POS tagger;

机译：自然语言处理;文档级挖掘;实体提取;文本挖掘;POS标记器;

相似文献

外文文献
中文文献
专利

1. A novel approach to prioritize customer requirements in QFD based on customer satisfaction function for customer-oriented product design [J] . Yoon-Eui Nahm Journal of Mechanical Science and Technology . 2013,第12期

机译：一种基于客户满意度功能在QFD中确定客户需求优先级的新颖方法，用于面向客户的产品设计
2. IDENTIFICATION AND PRIORITIZATION OF FACTORS AFFECTING CUSTOMER SATISFACTION AND THEIR RELATION WITH LOYALTY IN BRANCHES OF EGHTESAD NOVIN BANK THROUGHOUT TEHRAN [J] . Kamran Bahoosh, Dr.Seyed Mehdi Alvani, Dr.Reza Vaezi Arabian Journal of Business and Management Review . 2013,第9期

机译：遍及德黑兰的爱德华兹·诺文银行分行中影响客户满意度的因素的识别和优先化及其与忠诚度的关系
3. An ordinal scale-based GDM approach to prioritize customer requirements in QFD product planning [J] . Yang Qiang, Li Yan-Lai, Chin Kwai-Sang Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第3aPta2期

机译：基于序列规模的GDM方法，以优先考虑QFD产品规划的客户要求
4. An empirical semi-supervised machine learning approach on extracting and ranking document level multi-word product names using improved C-value approach [C] . R. Sivashankari, B. Valarmathi International conference on advances in computing, communications and informatics . 2016

机译：使用改进的C值方法提取和排序文档级多词产品名称的经验性半监督机器学习方法
5. Predicting Product Recall by Using Machine Learning to Analyze Customer Reviews [D] . Williams, Hayden. 2021

机译：通过使用机器学习来分析客户评论，预测产品召回
6. Extracting Product Features and Opinion Words Using Pattern Knowledge in Customer Reviews [O] . Su Su Htay, Khin Thidar Lynn 2013

机译：使用客户评论中的模式知识提取产品功能和意见词
7. La banalisation du luxe (Democratization of luxury) Abstract : La plupart des achats pratiqués sont effectués par des clients occasionnels, ce qui a conduit les entreprises à s’adapter par une nouvelle stratégie. En effet, suite à l’intensification de la concurrence et à la concentration du secteur qui engendrent une contrainte de rentabilité immédiate, mais aussi en raison des impératifs nouveaux du marché, les maisons de luxe se voient obligées d’élargir leur offre afin d’ajouter à une clientèle dite traditionnelle une clientèle plus vaste. Dès lors, le secteur du luxe est progressivement passé d’une logique d’offre où seul un nombre restreint de personnes était ciblé, à une politique d’offre où des professionnels du marketing étudient la demande du marché afin d’orienter la production des biens vers un marché de masse. La banalisation est une problématique capitale et décisive car les maisons de luxe doivent préserver leur image de marque tout en élargissant leur clientèle : elles s’efforcent de créer et de pérenniser leurs marques, sans jamais oublier qu’une marque ne peut pas s’associer à n’importe quel objet, sous peine de menacer l’ensemble de ses représentations. Par conséquent, chaque nouveau produit présente un risque pour la gamme toute entière, d’où le risque d’une erreur stratégique par le choix de la banalisation. Le danger est de voir la clientèle aisée se tourner vers d’autres grands noms pratiquant toujours cet esprit d’élitisme qui caractérise le « luxe ». The luxury market is no longer reserved for an elite as its evolution over the last fifteen years clearly indicates. Luxury goods companies have been forced to adapt and resort to new strategies to take into account the fact that most purchases are now made by occasional clients. The keen competition and the on-going concentration in that sector – with the resulting short term profitability constraints - together with the new market conditions, have forced luxury goods companies to broaden their offer so as to add new customers to their traditional base. As a consequence, the luxury market has progressively moved away from an offer-driven logic – targeting a small number of people – in favor of an offer-based policy with marketing professionals studying market demand so as to direct the production of goods towards mass production. The democratization of luxury constitutes a major challenge for those companies which must preserve their image while broadening their customer base: they now strive to create and perpetuate their brand image, without ever forgetting that a brand cannot be associated to just any object, as this might constitute a threat to all its brand representations. Thus every new product constitutes a real threat to the whole range and there is a risk of making a strategic mistake by appealing to the mass market; and there is also a clear danger of seeing affluent customers turn to other great names that still foster this ‘elite spirit’ that characterizes « luxury ». [O] . Eric Vernier, Pierre Ghewy 100

机译：奢侈品的平庸化（奢侈品民主化）摘要：大多数购买都是由休闲客户制作的，这导致公司采用新战略。事实上，随着竞争的加剧和行业的集中导致了盈利的直接制约，而且由于市场的新要求，奢侈品公司被迫扩大其报价以便为所谓的传统客户增加一个更大的客户群。从那时起，奢侈品行业逐渐从以供应为导向的方式转变为只有少数人被定为供应方政策，营销人员正在研究市场需求以指导产品生产。货物进入大众市场。平庸化是一个至关重要且决定性的问题，因为豪宅必须保持其品牌形象，同时扩大其客户群：他们努力创造和延续其品牌，永远不要忘记一个品牌无法联想到在任何威胁他所有陈述的痛苦之下。因此，每个新产品都会带来整个色域的风险，因此通过选择平凡化就会产生战略错误的风险。危险的是看到富有的顾客转向其他仍在实践“豪华”特征的精英主义精神。奢侈品市场不会超过过去15年。奢侈品公司被迫采用新策略来充分利用其客户。由于市场领先的条件，激烈的竞争和该行业的持续集中迫使他们扩大了市场基础。因此，奢侈品市场已经从针对少数人的以报价驱动的逻辑向前发展 - 转而采用基于报价的政策与营销专业人士。对于那些必须在扩大客户群的同时保持形象的公司来说，奢侈品的民主化是一项重大挑战。对其所有品牌代表构成威胁。因此，每一种新产品都会对市场构成威胁，并且有可能通过吸引大众市场来制造战略错误;看到人们涌向那种“奢侈”的精神精神，也存在明显的危险。

NLP-MTFLR: Document-Level Prioritization and Identification of Dominant Multi-word Named Products in Customer Reviews

摘要

著录项

相似文献

相关主题

期刊订阅