【24h】

Observing and Preventing Leakage in MapReduce

机译:Mapreduce观察和防止泄漏

获取原文

摘要

The use of public cloud infrastructure for storing and processing large datasets raises new security concerns. Current solutions propose encrypting all data, and accessing it in plaintext only within secure hardware. Nonetheless, the distributed processing of large amounts of data still involves intensive encrypted communications between different processing and network storage units, and those communications patterns may leak sensitive information. We consider secure implementation of MapReduce jobs, and analyze their intermediate traffic between mappers and reducers. Using datasets that include personal and geographical data, we show how an adversary that observes the runs of typical jobs can infer precise information about their input. We give a new definition of data privacy for MapReduce, and describe two provably-secure, practical solutions. We implement our solutions on top of VC3, a secure implementation of Hadoop, and evaluate their performance.
机译:用于存储和处理大型数据集的公共云基础架构的使用引发了新的安全问题。当前的解决方案建议加密所有数据,并仅在安全硬件中以明文访问它。尽管如此,大量数据的分布式处理仍然涉及不同处理和网络存储单元之间的密集加密通信,并且这些通信模式可能会泄漏敏感信息。我们考虑安全实现MapReduce作业,并分析了映射器和减速器之间的中间流量。使用包括个人和地理数据的数据集,我们展示了观察典型工作的运行的对手如何推断有关其输入的精确信息。我们为MapReduce提供了新的数据隐私定义,并描述了两种可提供安全的实用解决方案。我们在VC3的顶部实施我们的解决方案,安全实现Hadoop,并评估其性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号