...
首页> 外文期刊>Knowledge and Information Systems >Geometric data perturbation for privacy preserving outsourced data mining
【24h】

Geometric data perturbation for privacy preserving outsourced data mining

机译:用于保护隐私的几何数据扰动外包数据挖掘

获取原文
获取原文并翻译 | 示例
           

摘要

Data perturbation is a popular technique in privacy-preserving data mining. A major challenge in data perturbation is to balance privacy protection and data utility, which are normally considered as a pair of conflicting factors. We argue that selectively preserving the task/model specific information in perturbation will help achieve better privacy guarantee and better data utility. One type of such information is the multidimensional geometric information, which is implicitly utilized by many data-mining models. To preserve this information in data perturbation, we propose the Geometric Data Perturbation (GDP) method. In this paper, we describe several aspects of the GDP method. First, we show that several types of well-known data-mining models will deliver a comparable level of model quality over the geometrically perturbed data set as over the original data set. Second, we discuss the intuition behind the GDP method and compare it with other multidimensional perturbation methods such as random projection perturbation. Third, we propose a multi-column privacy evaluation framework for evaluating the effectiveness of geometric data perturbation with respect to different level of attacks. Finally, we use this evaluation framework to study a few attacks to geometrically perturbed data sets. Our experimental study also shows that geometric data perturbation can not only provide satisfactory privacy guarantee but also preserve modeling accuracy well.
机译:数据扰动是保护隐私的数据挖掘中的一种流行技术。数据扰动的主要挑战是平衡隐私保护和数据实用程序,这通常被认为是一对冲突的因素。我们认为有选择地在扰动中保留任务/模型特定的信息将有助于实现更好的隐私保证和更好的数据实用性。这种信息的一种类型是多维几何信息,它被许多数据挖掘模型隐式利用。为了将这些信息保留在数据扰动中,我们提出了几何数据扰动(GDP)方法。在本文中,我们描述了GDP方法的几个方面。首先,我们证明了几种类型的知名数据挖掘模型将在几何扰动数据集上提供与原始数据集相当的模型质量。其次,我们讨论GDP方法背后的直觉,并将其与其他多维摄动方法(例如随机投影摄动)进行比较。第三,我们提出了一种多列隐私评估框架,用于评估几何数据扰动对不同级别攻击的有效性。最后,我们使用此评估框架来研究对几何扰动的数据集的一些攻击。我们的实验研究还表明,几何数据扰动不仅可以提供令人满意的隐私保证,而且可以很好地保持建模精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号