Feature selection algorithm based on the Information Gain (IG) can solve the problem of high-dimension and magnanimous data in intrusion forensics, but it neglects the correlation between features, which can lead to the redundancy of features, and affect the speed and accuracy of intrusion forensics. Therefore, an improved Information Gain (IIG) algorithm based on feature redundancy was proposed. In the improved algorithm, the irrelevant features and the redundant features were removed by adding the judgments of redundancy between features, which effectively simplified feature subset. The experimental results show that the proposed algorithm can effectively select features, ensure detection accuracy and improve processing speed.%基于信息增益算法的特征选择虽然能够较好地解决入侵取证中存在的数据高维海量问题,但由于没有考虑特征之间的关系,导致特征子集中存在着冗余特征,从而影响了入侵取证的速度和精度,由此提出一种改进的基于特征冗余度的信息增益算法.通过添加对特征之间冗余度的判断,在删除无关特征的同时过滤了冗余特征,使特征子集得到有效精简.经实验验证,该算法能有效地选择特征向量,保证检测精度,提高检测速度.
展开▼