...
首页> 外文期刊>Social Media + Society >What You Can Scrape and What Is Right to Scrape: A Proposal for a Tool to Collect Public Facebook Data
【24h】

What You Can Scrape and What Is Right to Scrape: A Proposal for a Tool to Collect Public Facebook Data

机译:您可以刮擦的是什么是刮刮的权利:用于收集公共Facebook数据的工具的提案

获取原文
   

获取外文期刊封面封底 >>

       

摘要

In reaction to the Cambridge Analytica scandal, Facebook has restricted the access to its Application Programming Interface (API). This new policy has damaged the possibility for independent researchers to study relevant topics in political and social behavior. Yet, much of the public information that the researchers may be interested in is still available on Facebook, and can be still systematically collected through web scraping techniques. The goal of this article is twofold. First, we discuss some ethical and legal issues that researchers should consider as they plan their collection and possible publication of Facebook data. In particular, we discuss what kind of information can be ethically gathered about the users (public information), how published data should look like to comply with privacy regulations (like the GDPR), and what consequences violating Facebook’s terms of service may entail for the researcher. Second, we present a scraping routine for public Facebook posts, and discuss some technical adjustments that can be performed for the data to be ethically and legally acceptable. The code employs screen scraping to collect the list of reactions to a Facebook public post, and performs a one-way cryptographic hash function on the users’ identifiers to pseudonymize their personal information, while still keeping them traceable within the data. This article contributes to the debate around freedom of internet research and the ethical concerns that might arise by scraping data from the social web.
机译:在对Cambridge Analytica Scandal的反应中,Facebook已经限制了对其应用程序编程接口(API)的访问。这项新政策已损害独立研究人员在政治和社会行为中研究相关主题的可能性。然而,研究人员可能感兴趣的大部分公众信息仍然可以在Facebook上提供,并且可以通过Web刮擦技术系统地系统地收集。本文的目标是双重。首先,我们讨论了研究人员应该考虑的一些道德和法律问题,因为他们计划他们的收藏和可能的Facebook数据出版。特别是,我们讨论了什么样的信息可以在道德上聚集在线(公共信息),出版的数据应该如何看起来遵守隐私法规(如GDPR),以及违反Facebook的服务条款的后果可能需要?研究员。其次,我们为公共Facebook帖子展示了一个刮痕的例程,并讨论了可以对数据进行道德和法律上可接受的数据执行的一些技术调整。该代码采用屏幕擦除将反应列表收集到Facebook公共帖子,并在用户标识符上执行单向加密哈希函数,以便将其个人信息视为匿名,同时仍将它们视为可追溯。本文有助于围绕互联网研究自由的辩论以及通过从社交网络刮下数据可能产生的道德问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号