An evolutionary algorithm for mining rare association rules: a Big Data approach

机译：采矿稀有关联规则的进化算法：大数据方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Association rule mining is one of the most well-known techniques to discover interesting relations between items in data. To date, this task has been mainly focused on the discovery of frequent relationships. However, it is often interesting to focus on those that do not occur frequently. Rare association rule mining is an alluring field aiming at describing rare cases or unexpected behavior. This field is really useful over Big Data where abnormal endeavor are more curious than common behavior. In this sense, our aim is to propose a new evolutionary algorithm based on grammars to obtain rare association rules on Big Data. The novelty of our work is that it is eminently designed to be parallel, enabling its use over emerging technologies as Spark and Flink. Furthermore, while other algorithms focus on maximizing a couple of quality measure ignoring the rest, our fitness function has been precisely designed to obtain a trade-off while maximizing a set of well-known quality measures. The experimental study includes more than 70 datasets revealing alluring results in efficiency when more than 300 million of instances and file sizes up to 250 GBytes are considered, and proving that it is able to run efficiently in huge volumes of data.

机译：关联规则挖掘是最着名的技术之一，可以发现数据中项目之间有趣关系的技术之一。迄今为止，这项任务主要集中在发现频繁的关系中。但是，专注于那些不会经常发生的人往往有趣。罕见的协会规则挖掘是一个诱人的领域，旨在描述罕见的病例或意外行为。这个领域对大数据真实有用，其中异常努力比共同行为更加好奇。从这个意义上讲，我们的目标是提出基于语法的新进化算法，以获得大数据的罕见关联规则。我们的作品的新颖之处在于它非常旨在平行，使其在新兴技术用作Spark和Flink。此外，虽然其他算法专注于最大化几个忽略其余的质量措施，但我们的健身功能精确地设计以获得折衷，同时最大化一套众所周知的质量措施。实验研究包括超过70个数据集，当考虑超过250个Gbytes的300亿个实例和文件大小时，揭示了效率的效率导致，并证明它能够以大量的数据有效运行。

著录项

来源
《IEEE Congress on Evolutionary Computation》|2017年|1358-2069 p. :|共8页
会议地点
作者
F. Padillo; J.M. Luna; S. Ventura;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-532;
关键词
mining; rare; association;

机译：矿业;罕见;联想;

相似文献

外文文献
中文文献
专利

1. Mining Association Rules from No-SQL data bases using Map-Reduce Fuzzy Association Rule Mining Algorithm [J] . Chatakunta Praveen Kumar, Pole Anjaiah, Santosh Patil, International Journal of Applied Engineering Research . 2017,第21aPta1期

机译：使用地图减少模糊关联规则挖掘算法来自No-SQL数据基础的挖掘关联规则
2. Rare-PEARs: A new multi objective evolutionary algorithm to mine rare and non-redundant quantitative association rules [J] . Almasi Mehrdad, Abadeh Mohammad Saniee Knowledge-Based Systems . 2015,第NOVa期

机译：稀有梨：一种新的多目标进化算法，用于挖掘稀有和非冗余的定量关联规则
3. An Efficient Approach of Association Rule Mining on Distributed Database Algorithm [J] . Baocang Wang Journal of Computational Intelligence in Bioinformatics . 2019,第1期

机译：分布式数据库算法关联规则挖掘的有效方法
4. An evolutionary algorithm for mining rare association rules: A Big Data approach [C] . F. Padillo, J.M. Luna, S. Ventura IEEE Congress on Evolutionary Computation . 2017

机译：一种挖掘稀有关联规则的进化算法：大数据方法
5. A formal concept analysis approach to association rule mining: The QuICL algorithms. [D] . Smith, David T. 2009

机译：关联规则挖掘的正式概念分析方法：QuICK算法。
6. Order Batching in Warehouses by Minimizing Total Tardiness: A Hybrid Approach of Weighted Association Rule Mining and Genetic Algorithms [O] . Amir Hossein Azadnia, Shahrooz Taheri, Pezhman Ghadimi, 2013

机译：通过最大程度地减少总拖延来进行仓库中的订单批处理：加权关联规则挖掘和遗传算法的混合方法
7. Enhancing association rules algorithms for mining distributed databases. Integration of fast BitTable and multi-agent association rules mining in distributed medical databases for decision support. [O] . Abdo Walid Adly Atteya 2012

机译：增强用于挖掘分布式数据库的关联规则算法。快速BitTable和多代理关联规则挖掘在分布式医疗数据库中的集成，以提供决策支持。
8. Constraint Satisfaction Neural Network Approach for Data Mining Classification and Association Rules in Breast Cancer Databases [R] . Tourassi, G. D. 2003

机译：基于约束满足神经网络的乳腺癌数据挖掘分类与关联规则

An evolutionary algorithm for mining rare association rules: a Big Data approach

摘要

著录项

相似文献

相关主题

期刊订阅