首页> 外文会议>IEEE International Conference on Big Data Analytics >Privacy-Preserving DBSCAN Clustering Algorithm Based on Negative Database
【24h】

Privacy-Preserving DBSCAN Clustering Algorithm Based on Negative Database

机译:基于负数据库的隐私保护DBSCAN聚类算法

获取原文

摘要

the negative database (NDB) is a new type of privacy protection and data security technology, which enhances the security of the data by storing the compressed form of the original data complement, thereby achieving the protection of privacy data. In practical applications, each record in the original database (DB) is usually transformed into a negative database to obtain negative database NDBs to achieve the protection of privacy data. Then, use the classification and clustering methods on the negative database are used to mine and analyze the privacy data. The DBSCAN clustering algorithm is a classical density-based clustering algorithm, and the Euclidean distance formula is one of the most commonly used distance measurement formulas in the clustering algorithm, and the DBSCAN algorithm is of no exception. However, the current Euclidean distance measurement of DBSCAN clustering algorithm is based on the distance measurement of plaintext data, so it is impossible to cluster the privacy data transformed into negative database. In this paper, we introduce a DBSCAN clustering algorithm based on the Euclidean distance formula on a negative database, which is used to complete clustering research while protecting privacy data. The experimental result showed that our algorithm achieved high clustering accuracy and effectively protected the security of privacy data by using irreversible negative database. Therefore, the algorithm we designed is very effective and feasible.
机译:负数据库(NDB)是一种新型的隐私保护和数据安全技术,它通过存储原始数据补码的压缩形式来增强数据的安全性,从而实现对隐私数据的保护。在实际应用中,通常将原始数据库(DB)中的每个记录转换为负数据库以获得负数据库NDB。 s 实现对隐私数据的保护。然后,使用负数据库上的分类和聚类方法来挖掘和分析隐私数据。 DBSCAN聚类算法是经典的基于密度的聚类算法,欧几里德距离公式是该聚类算法中最常用的距离测量公式之一,DBSCAN算法也不例外。但是,目前DBSCAN聚类算法的欧式距离度量是基于明文数据的距离度量,因此不可能将转换为负数据库的隐私数据聚类。在本文中,我们在负数据库中引入了基于欧几里德距离公式的DBSCAN聚类算法,该算法用于在保护隐私数据的同时完成聚类研究。实验结果表明,通过使用不可逆负数据库,该算法实现了较高的聚类精度,有效地保护了隐私数据的安全性。因此,我们设计的算法是非常有效和可行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号