...
首页> 外文期刊>EPJ Web of Conferences >Lessons Learned from the Assessment of Software Defect Prediction on WLCG Software: A Study with Unlabelled Datasets and Machine Learning Techniques
【24h】

Lessons Learned from the Assessment of Software Defect Prediction on WLCG Software: A Study with Unlabelled Datasets and Machine Learning Techniques

机译:从WLCG软件的软件缺陷预测评估中汲取的经验教训:具有未标记数据集和机器学习技术的研究

获取原文
           

摘要

Software defect prediction is an activity that aims at narrowing down the most likely defect-prone software modules and helping developers and testers to prioritize inspection and testing. This activity can be addressed by using Machine Learning techniques applied to software metrics datasets that are usually unlabelled, i.e. they lack modules classification in terms of defectiveness. To overcome this limitation, in addition to the usual data pre-processing operations to manage mission values and/or to remove inconsistencies, researches have to adopt an approach to label their unlabelled software datasets. The extraction of defectiveness data to label all the instances of the datasets is an extremely time and effort consuming operation. In literature, many studies have introduced approaches to build a defect prediction models on unlabelled datasets.In this paper, we describe the analysis of new unlabelled datasets from WLCG software, coming from HEP-related experiments and middleware, by using Machine Learning techniques. We have experimented new approaches to label the various modules due to the heterogeneity of software metrics distribution. We discuss a number of lessons learned from conducting these activities, what has worked, what has not and how our research can be improved.
机译:软件缺陷预测是旨在缩小最可能缺陷的软件模块,并帮助开发人员和测试人员优先考虑检查和测试的活动。通过应用于通常未标记的软件度量数据集的计算机学习技术可以解决此活动,即它们缺乏模块在缺陷方面的分类。为了克服这种限制,除了通常的数据预处理操作来管理任务价值和/或删除不一致,研究必须采用一种标记其未标签的软件数据集的方法。对缺陷数据的提取来标记数据集的所有实例是消耗操作的极其时间和精力。在文献中,许多研究引入了在未标记的数据集中构建缺陷预测模型的方法。在本文中,通过使用机器学习技术来描述来自WLCG软件的新未标记数据集的分析,通过使用机器学习技术。由于软件度量分布的异质性,我们已经尝试了标记各种模块的新方法。我们讨论了从开展这些活动中讨论了一些经验教训,所努力的是什么,没有以及如何改善我们的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号