Privacy-Preserving Pattern Mining on Online Density Estimates

机译：在线密度估计的隐私保护模式挖掘

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional pattern mining algorithms require access to the data, either in the form of a complete set of data, as in batch data mining, or in the form of a window of recent data, as in stream mining. In the case of stream mining, this comes with a number of disadvantages, such as the possibly unbounded growth of relevant instances, drift, possibly changing data mining tasks, and issues with privacy, to name a few. Therefore, an approach has been recently proposed that extracts patterns just from statistical information of the stream - more precisely, an online density estimate that is inferred from it. As this approach is mainly based on sampling from the density estimates, it still struggles with itemsets having a medium to low frequency. To resolve this issue, we pursue an alternative strategy in this paper and directly exploit the structure of the density estimates to extract frequent itemsets. Additionally, we address the important matter of privacy-preserving data mining by ensuring that the density estimate fulfills privacy-related properties. To show the effectiveness of the proposed methods, we provide proofs and evaluate the performance on synthetic and real-world data.

机译：传统的模式挖掘算法要求以批处理数据挖掘的完整数据集的形式访问数据，或者像流挖掘一样以最近的数据窗口的形式访问数据。在流挖掘的情况下，存在许多缺点，例如相关实例的增长可能不受限制，漂移，数据挖掘任务可能会更改以及隐私问题等。因此，最近提出了一种仅从流的统计信息中提取模式的方法，更准确地说，是从流的统计信息中提取在线密度估计值。由于此方法主要基于密度估计值的采样，因此仍难以解决中低频率的项目集。为了解决这个问题，我们在本文中寻求一种替代策略，并直接利用密度估计的结构来提取频繁项集。此外，我们通过确保密度估算值满足隐私相关属性来解决隐私保护数据挖掘的重要问题。为了证明所提出方法的有效性，我们提供了证明并评估了综合和真实数据的性能。

著录项

来源
《2017 IEEE International Conference on Big Knowledge》|2017年|25-32|共8页
会议地点 Hefei(CN)
作者
Michael Geilke; Stefan Kramer;
展开▼
作者单位

Johannes Gutenberg Univ. Mainz, Mainz, Germany;

Johannes Gutenberg Univ. Mainz, Mainz, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Itemsets; Data privacy; Probability distribution; Probabilistic logic;

机译：项集;数据隐私;概率分布;概率逻辑;

相似文献

外文文献
中文文献
专利

1. The density-based clustering method for privacy-preserving data mining [J] . Wu Jimmy Ming-Tai, Lin Jerry Chun-Wei, Fournier-Viger Philippe, Annals of the American Thoracic Society . 2019,第3期

机译：保留隐私数据挖掘的基于密度的聚类方法
2. Online Mining Intrusion Patterns from IDS Alerts [J] . Kai Zhang, Shoushan Luo, Yang Xin, Applied Sciences . 2020,第8期

机译：来自IDS警报的在线挖掘入侵模式
3. Use of Data Mining to Determine Usage Patterns of an Online Evaluation Platform During the COVID-19 Pandemic [J] . Rafael E. Reigal, José Luis Pastrana-Brincones, Sergio Luis González-Ruiz, Frontiers in Psychology . 2020,第a期

机译：使用数据挖掘在Covid-19流行期间确定在线评估平台的使用模式
4. Privacy-Preserving Pattern Mining on Online Density Estimates [C] . Michael Geilke, Stefan Kramer IEEE International Conference on Big Knowledge . 2017

机译：在线密度估算上的隐私保留模式挖掘
5. Opinion Mining in Online Social Networks: Patterns, Influences and Anomalies. [D] . Wang, Teng. 2015

机译：在线社交网络中的观点挖掘：模式，影响和异常。
6. Use of Data Mining to Determine Usage Patterns of an Online Evaluation Platform During the COVID-19 Pandemic [O] . Rafael E. Reigal, José Luis Pastrana-Brincones, Sergio Luis González-Ruiz, 2020

机译：使用数据挖掘在Covid-19流行期间确定在线评估平台的使用模式
7. A DENSITY-BASED MICRO AGGREGATION TECHNIQUE FOR PRIVACY-PRESERVING DATA MINING [O] . S. K. Das, B. Borah 2015

机译：基于密度的隐私数据挖掘微集成技术
8. Privacy-Preserving Collaborative Sequential Pattern Mining [R] . Zhan, J. Z. , Chang, L. , Matwin, S. 2004

机译：隐私保护协同序列模式挖掘

Privacy-Preserving Pattern Mining on Online Density Estimates

摘要

著录项

相似文献

相关主题

期刊订阅