Privacy-preserving logistic regression training

Charlotte Bonte; Frederik Vercauteren

首页> 外文期刊>BMC Medical Genomics >Privacy-preserving logistic regression training

【24h】

Privacy-preserving logistic regression training

机译：隐私保护逻辑回归训练

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Logistic regression is a popular technique used in machine learning to construct classification models. Since the construction of such models is based on computing with large datasets, it is an appealing idea to outsource this computation to a cloud service. The privacy-sensitive nature of the input data requires appropriate privacy preserving measures before outsourcing it. Homomorphic encryption enables one to compute on encrypted data directly, without decryption and can be used to mitigate the privacy concerns raised by using a cloud service. In this paper, we propose an algorithm (and its implementation) to train a logistic regression model on a homomorphically encrypted dataset. The core of our algorithm consists of a new iterative method that can be seen as a simplified form of the fixed Hessian method, but with a much lower multiplicative complexity. We test the new method on two interesting real life applications: the first application is in medicine and constructs a model to predict the probability for a patient to have cancer, given genomic data as input; the second application is in finance and the model predicts the probability of a credit card transaction to be fraudulent. The method produces accurate results for both applications, comparable to running standard algorithms on plaintext data. This article introduces a new simple iterative algorithm to train a logistic regression model that is tailored to be applied on a homomorphically encrypted dataset. This algorithm can be used as a privacy-preserving technique to build a binary classification model and can be applied in a wide range of problems that can be modelled with logistic regression. Our implementation results show that our method can handle the large datasets used in logistic regression training.

机译：Logistic回归是机器学习中用于构建分类模型的流行技术。由于此类模型的构建基于大型数据集的计算，因此将计算外包给云服务是一个很有吸引力的想法。输入数据的隐私敏感特性要求在外包之前采取适当的隐私保护措施。同态加密使人们可以直接对加密数据进行计算，而无需解密，并且可以用来减轻使用云服务引起的隐私问题。在本文中，我们提出了一种在同态加密数据集上训练逻辑回归模型的算法（及其实现）。我们算法的核心是一种新的迭代方法，可以将其视为固定Hessian方法的简化形式，但乘法复杂度要低得多。我们在两个有趣的现实生活应用中测试了该新方法：第一个应用是医学应用，并以基因组数据为输入，构建了一个模型来预测患者患癌症的可能性;第二个应用是金融，模型可以预测信用卡交易被欺诈的可能性。该方法可为两种应用程序产生准确的结果，与在纯文本数据上运行标准算法相当。本文介绍了一种新的简单迭代算法，用于训练适用于同态加密数据集的逻辑回归模型。该算法可以用作构建二进制分类模型的隐私保护技术，并且可以应用于可以通过逻辑回归建模的各种问题。我们的实施结果表明，我们的方法可以处理逻辑回归训练中使用的大型数据集。

著录项

来源
《BMC Medical Genomics》 |2018年第4期|共页
作者
Charlotte Bonte; Frederik Vercauteren;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类遗传学;
关键词
Homomorphic encryptionLogistic regressionPrivacyFixed Hessian;

机译：同态加密逻辑回归隐私固定的黑森州;

相似文献

外文文献
中文文献
专利

1. Privacy-preserving semi-parallel logistic regression training with fully homomorphic encryption [J] . Sergiu Carpov, Nicolas Gama, Mariya Georgieva, BMC Medical Genomics . 2020,第7期

机译：完全同态加密的隐私保留半行逻辑回归训练
2. High performance logistic regression for privacy-preserving genome analysis [J] . Martine De Cock, Rafael Dowsley, Anderson C. A. Nascimento, BMC Medical Genomics . 2021,第1期

机译：预保存基因组分析的高性能逻辑回归
3. Privacy-Preserving Logistic Regression with Distributed Data Sources via Homomorphic Encryption [J] . Yoshinori AONO, Takuya HAYASHI, Le Trieu PHONG, IEICE transactions on information and systems . 2016,第8期

机译：通过同态加密使用分布式数据源保护隐私的Logistic回归
4. LR-GD-RNS: Enhanced Privacy-Preserving Logistic Regression Algorithms for Secure Deployment in Untrusted Environments [C] . Jorge M. Cortés-Mendoza, Gleb Radchenko, Andrei Tchernykh, IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing . 2021

机译：LR-GD-RNS：增强隐私保留逻辑回归算法，用于在不受信任的环境中安全部署
5. Using the multivariate multilevel logistic regression model to detect DIF: A comparison with HGLM and logistic regression DIF detection methods [D] . Pan, Tianshu. 2008

机译：使用多元多级Logistic回归模型检测DIF：与HGLM和Logistic回归DIF检测方法的比较
6. Privacy-preserving semi-parallel logistic regression training with fully homomorphic encryption [O] . Sergiu Carpov, Nicolas Gama, Mariya Georgieva, 2020

机译：完全同态加密的隐私保护半并行逻辑回归训练
7. Privacy-preserving semi-parallel logistic regression training with fully homomorphic encryption [O] . Sergiu Carpov, Nicolas Gama, Mariya Georgieva, 2020

机译：完全同态加密的隐私保留半行逻辑回归训练
8. Predicting Training Success with the NEO-PI-R: The Use of Logistic Regression to Determine the Odds of Completing a Pilot Screening Program [R] . Anesgart, M. N. , Callister, J. D. 2001

机译：使用NEO-pI-R预测培训成功：使用Logistic回归确定完成试点筛选计划的可能性

Privacy-preserving logistic regression training

摘要

著录项

相似文献

相关主题

期刊订阅