...
首页> 外文期刊>Science of Computer Programming >Measuring and analyzing code authorship in 1 + 118 open source projects
【24h】

Measuring and analyzing code authorship in 1 + 118 open source projects

机译:在1 + 118个开源项目中评估和分析代码作者

获取原文
获取原文并翻译 | 示例
           

摘要

Code authorship is a key information about large-scale software projects. Among others, it reveals the division of work, key collaborators, and developers' profiles. Seeking to better understand authorship in large and successful open source communities, we take the Linux kernel as our first case study. In total, we analyze authorship across 66 stable releases. Our analysis is centered around the Degree-of-Authorship (DOA) metric, which accounts for first authorship events (file creation), as well as further code changes. Authorship along the Linux kernel evolution reveals that (a) only a small portion of developers (26%) makes significant contributions to the code base; this ratio is almost constant during the Linux kernel evolution; (b) the number of files per author is highly skewed-a small group of top-authors (2%) is responsible for hundreds of files, while most authors (75%) are responsible for at most 10 files: (c) most authors in Linux (76%) are specialists and the relation between specialists and generalists tends to be constant; (d) authors with a high number of co-authorship connections tend to work with authors with fewer connections. Furthermore, we replicate the study in an extended dataset, composed of 118 well-known GitHub projects. We identify that most of the authorship patterns observed in the Linux kernel are also common to other open source projects. (C) 2019 Elsevier B.V. All rights reserved.
机译:代码作者身份是有关大型软件项目的关键信息。除其他外,它揭示了工作划分,主要合作者和开发人员资料。为了更好地理解大型成功开源社区中的作者身份,我们将Linux内核作为第一个案例研究。总共,我们分析了66个稳定版本中的作者身份。我们的分析集中在“授权度”(DOA)指标上,该指标说明了第一作者事件(文件创建)以及进一步的代码更改。 Linux内核演变的作者身份表明:(a)只有一小部分开发人员(26%)对代码库做出了重大贡献;在Linux内核演进过程中,该比率几乎恒定。 (b)每位作者的文件数量严重偏斜-一小撮顶级作者(2%)负责数百个文件,而大多数作者(75%)最多负责10个文件:(c)大多数Linux的作者(占76%)是专家,专家和通才之间的关系往往是恒定的。 (d)具有大量共同作者联系的作者倾向于与联系较少的作者合作。此外,我们将研究复制到一个扩展的数据集中,该数据集由118个著名的GitHub项目组成。我们发现,Linux内核中观察到的大多数作者模式在其他开源项目中也很常见。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号