Binary teaching-learning-based optimization algorithm with a new update mechanism for sample subset optimization in software defect prediction

Thanh Tung Khuat; My Hanh Le

首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Binary teaching-learning-based optimization algorithm with a new update mechanism for sample subset optimization in software defect prediction

【24h】

Binary teaching-learning-based optimization algorithm with a new update mechanism for sample subset optimization in software defect prediction

机译：基于二元教学 - 基于教学的优化算法，具有软件缺陷预测中的样本子集优化的新更新机制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Software defect prediction has gained considerable attention in recent years. A broad range of computational methods has been developed for accurate prediction of faulty modules based on code and design metrics. One of the challenges in training classifiers is the highly imbalanced class distribution in available datasets, leading to an undesirable bias in the prediction performance for the minority class. Data sampling is a widespread technique to tackle this problem. However, traditional sampling methods, which depend mainly on random resampling from a given dataset, do not take advantage of useful information available in training sets, such as sample quality and representative instances. To cope with this limitation, evolutionary undersampling methods are usually used for identifying an optimal sample subset for the training dataset. This paper proposes a binary teaching-learning- based optimization algorithm employing a distribution-based solution update rule, namely BTLBOd, to generate a balanced subset of highly valuable examples. This subset is then applied to train a classifier for reliable prediction of potentially defective modules in a software system. Each individual in BTLBOd includes two vectors: a real-valued vector generated by the distribution-based update mechanism, and a binary vector produced from the corresponding real vector by a proposed mapping function. Empirical results showed that the optimal sample subset produced by BTLBOd might ameliorate the classification accuracy of the predictor on highly imbalanced software defect data. Obtained results also demonstrated the superior performance of the proposed sampling method compared to other popular sampling techniques.

机译：近年来，软件缺陷预测已得到相当大的关注。已经开发了广泛的计算方法，用于基于代码和设计度量的故障模块精确预测。培训分类器中的一个挑战是可用数据集中的高度不平衡的类分布，导致少数阶级预测性能的不良偏见。数据采样是一种解决这个问题的广泛技术。然而，传统的采样方法主要取决于来自给定数据集的随机重新采样，不利用培训集中可用的有用信息，例如样本质量和代表实例。为了应对这种限制，进化的下采样方法通常用于识别训练数据集的最佳样本子集。本文提出了一种采用基于分布的解决方案更新规则，即BTLBOD的基于二元教学的优化算法，以产生高度有价值的例子的平衡子集。然后应用该子集以训练用于在软件系统中的潜在缺陷模块的可靠预测的分类器。 BTLBOD中的每个单独包括两个向量：由基于分布的更新机制生成的实值矢量，以及由所提出的映射函数由相应的实际矢量产生的二进制向量。实证结果表明，BTLBOD产生的最佳样本子集可能会改善预测器的分类准确性对高度不平衡的软件缺陷数据。获得的结果还表明，与其他流行的采样技术相比，所提出的采样方法的优异性能。

著录项

来源
《Soft computing: A fusion of foundations, methodologies and applications》 |2019年第20期|共17页
作者
Thanh Tung Khuat; My Hanh Le;
展开▼
作者单位

Univ Danang Univ Sci &

Technol Da Nang Vietnam;

Univ Danang Univ Sci &

Technol Da Nang Vietnam;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词
Teaching-learning-based optimization; Binary teaching-learning-based optimization; Distribution-based update; Sample subset optimization; Imbalanced learning; Software defect prediction;

机译：基于教学的优化;基于教学教学的优化;基于分配的更新;样本子集优化;吸入学习;软件缺陷预测;

相似文献

外文文献
中文文献
专利

1. Binary teaching-learning-based optimization algorithm with a new update mechanism for sample subset optimization in software defect prediction [J] . Thanh Tung Khuat, My Hanh Le Soft computing: A fusion of foundations, methodologies and applications . 2019,第20期

机译：基于二元教学 - 基于教学的优化算法，具有软件缺陷预测中的样本子集优化的新更新机制
2. Cross‑projects software defect prediction using spotted hyena optimizer algorithm [J] . M. A. Elsabagh, M. S. Farhan, M. G. Gafar SN Applied Sciences . 2020,第4期

机译：使用斑点鬣狗优化器算法进行跨项目软件缺陷预测
3. An Improved Neural Network Learning Algorithm Using Glow-worm Swarm Optimization for Software Defect Prediction [J] . Qifang Luo International Journal of Mechanics and Solids . 2018,第1期

机译：一种改进的神经网络学习算法，使用Glow-Worm Swarm优化进行软件缺陷预测
4. Tackling Feature Selection Problems with Genetic Algorithms in Software Defect Prediction for Optimization [C] . Rizal Broer Bahaweres, Arif Imam Suroso, Alam Wahyu Hutomo, International Conference on Informatics, Multimedia, Cyber and Information System . 2020

机译：解决软件缺陷预测中遗传算法的特征选择问题
5. Search and Optimization Algorithms for Binary Image Compression [D] . Hooda, Reetu. 2018

机译：二值图像压缩的搜索和优化算法
6. HSTLBO: A hybrid algorithm based on Harmony Search and Teaching-Learning-Based Optimization for complex high-dimensional optimization problems [O] . Shouheng Tuo, Longquan Yong, Fang’an Deng, -1

机译：HSTLBO：基于和谐搜索和基于教与学的优化的混合算法用于解决复杂的高维优化问题
7. Enhanced Binary Moth Flame Optimization as a Feature Selection Algorithm to Predict Software Fault Prediction [O] . Iyad Tumar, Yousef Hassouneh, Hamza Turabieh, 2020

机译：增强二进制蛾火焰优化作为特征选择算法，以预测软件故障预测

Binary teaching-learning-based optimization algorithm with a new update mechanism for sample subset optimization in software defect prediction

摘要

著录项

相似文献

相关主题

期刊订阅