首页>
外国专利>
AUC-maximized high-accuracy classifier for imbalanced datasets
AUC-maximized high-accuracy classifier for imbalanced datasets
展开▼
机译:AUC-最大化的高精度分类,用于非衡度数据集
展开▼
页面导航
摘要
著录项
相似文献
摘要
An AUC-maximized high-accuracy classification method and system for imbalanced datasets integrates an under-sampling-and-ensemble strategy, a true-outliers-removing strategy and a fake-outliers-concealing strategy, with the hope to effectively and robustly enhance both the AUC and the accuracy metrics in imbalanced classification. Applying under-sampling to construct multiple sub-datasets and assembling classification results of multiple classifiers greatly decline the risk of misclassification and lead to highly accurate and robust results in imbalanced classification task. Moreover, this invention pays attention to detect and identify extremely hidden outliers in a sub-dataset which includes a sub-majority dataset and the entire minority dataset. In this way, more hidden outliers can be located and thus exert less influence on the decision boundary, which contributes to both high AUC and accuracy. Furthermore, this invention proposes to conceal fake outliers when building decision boundary, which can achieve a higher classification accuracy of the majority class without changing that of the minority class.
展开▼