【24h】

Discovering Program Topoi through Clustering

机译:通过群集发现Progral TopoI

获取原文

摘要

Understanding source code of large open-source software projects is very challenging when there is only little documentation. New developers face the task of classifying a huge number of files and functions without any help. This paper documents a novel approach to this problem, called FEAT, that automatically extracts topoi from source code by using hierarchical agglomerative clustering. Program topoi summarize the main capabilities of a software system by presenting to developers clustered lists of functions together with an index of their relevant words. The clustering method used in FEAT exploits a new hybrid distance which combines both textual and structural elements automatically extracted from source code and comments. The experimental evaluation of FEAT shows that this approach is suitable to understand open-source software projects of size approaching 2,000 functions and 150 files, which opens the door for its deployment in the open-source community.
机译:当只有很少的文档,了解大型开源软件项目的源代码非常具有挑战性。 新开发人员面临分类大量文件和功能而没有任何帮助的任务。 本文介绍了一个新的解决这个问题的方法,称为壮举,它通过使用分层附下聚类自动从源代码中提取TopoI。 程序Topoi通过将开发人员群集群集的功能列表以及其相关单词的索引来总结软件系统的主要功能。 emeg中使用的群集方法利用新的混合距离,该混合距离结合了从源代码和注释中自动提取的文本和结构元素。 Feat的实验评估表明,这种方法适合了解越来越多的函数的开放源软件项目和150个文件,可为其在开源社区中的部署开辟门。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号