首页> 外文会议>IEEE International Conference on Data Science and Advanced Analytics >Constrained independence for detecting interesting patterns
【24h】

Constrained independence for detecting interesting patterns

机译:约束独立性以检测有趣的模式

获取原文

摘要

Among other criteria, a pattern may be interesting if it is not redundant with other discovered patterns. A general approach to determining redundancy is to consider a probabilistic model for frequencies of patterns, based on those of patterns already mined, and compare observed frequencies to the model. Such probabilistic models include the independence model, partition models or more complex models which are approached via randomization for a lack of an adequate tool in probability theory allowing a direct approach. We define constrained independence, a generalization to the notion of independence. This tool allows us to describe probabilistic models for evaluating redundancy in frequent itemset mining. We provide algorithms, integrated within the mining process, for determining non-redundant itemsets. Through experimentations, we show that the models used reveal high rates of redundancy among frequent itemsets and we extract the most interesting ones.
机译:在其他标准中,如果某种模式与其他已发现的模式不是多余的,那么它可能会很有趣。确定冗余度的一种通用方法是,基于已经挖掘的模式的频率,考虑模式频率的概率模型,并将观察到的频率与模型进行比较。这样的概率模型包括独立性模型,分区模型或更复杂的模型,这些模型由于缺乏在概率论中允许直接采取方法的适当工具而通过随机化进行处理。我们定义了受约束的独立性,即对独立性概念的概括。该工具使我们能够描述概率模型,以评估频繁项集挖掘中的冗余。我们提供了在挖掘过程中集成的算法,用于确定非冗余项目集。通过实验,我们表明所使用的模型揭示了频繁项目集之间的高冗余度,并且我们提取了最有趣的项目集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号