首页> 美国卫生研究院文献>American Journal of Epidemiology >Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks
【2h】

Validity of Privacy-Protecting Analytical Methods That Use Only Aggregate-Level Information to Conduct Multivariable-Adjusted Analysis in Distributed Data Networks

机译:仅使用聚合级别信息在分布式数据网络中进行多变量调整分析的隐私保护分析方法的有效性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Distributed data networks enable large-scale epidemiologic studies, but protecting privacy while adequately adjusting for a large number of covariates continues to pose methodological challenges. Using 2 empirical examples within a 3-site distributed data network, we tested combinations of 3 aggregate-level data-sharing approaches (risk-set, summary-table, and effect-estimate), 4 confounding adjustment methods (matching, stratification, inverse probability weighting, and matching weighting), and 2 summary scores (propensity score and disease risk score) for binary and time-to-event outcomes. We assessed the performance of combinations of these data-sharing and adjustment methods by comparing their results with results from the corresponding pooled individual-level data analysis (reference analysis). For both types of outcomes, the method combinations examined yielded results identical or comparable to the reference results in most scenarios. Within each data-sharing approach, comparability between aggregate- and individual-level data analysis depended on adjustment method; for example, risk-set data-sharing with matched or stratified analysis of summary scores produced identical results, while weighted analysis showed some discrepancies. Across the adjustment methods examined, risk-set data-sharing generally performed better, while summary-table and effect-estimate data-sharing more often produced discrepancies in settings with rare outcomes and small sample sizes. Valid multivariable-adjusted analysis can be performed in distributed data networks without sharing of individual-level data.
机译:分布式数据网络可以进行大规模的流行病学研究,但是在对大量协变量进行充分调整的同时,保护隐私仍在方法上带来挑战。使用3个站点的分布式数据网络中的2个经验示例,我们测试了3种聚合级别的数据共享方法(风险集,汇总表和效果估计),4种混杂调整方法(匹配,分层,逆向)的组合概率加权和匹配加权),以及针对二进制结果和事件发生时间的2个摘要评分(倾向评分和疾病风险评分)。我们通过将它们的结果与相应的汇总个人级别数据分析(参考分析)的结果进行比较,评估了这些数据共享和调整方法组合的性能。对于两种类型的结果,所检查的方法组合所产生的结果在大多数情况下均与参考结果相同或相当。在每种数据共享方法中,汇总级数据分析和个体级数据分析之间的可比性取决于调整方法。例如,对汇总得分进行匹配或分层分析的风险集数据共享产生了相同的结果,而加权分析则显示出一些差异。在所研究的调整方法中,风险集数据共享通常表现得更好,而汇总表和效果估计数据共享则更经常出现在结果少,样本量小的情况下的差异。可以在分布式数据网络中执行有效的多变量调整分析,而无需共享各个级别的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号