Regularized fisher linear discriminant through two threshold variation strategies for imbalanced problems

Zhu Yujin; Wang Zhe; Cao Chenjie; Gao Daqi

首页> 外文期刊>Knowledge-Based Systems >Regularized fisher linear discriminant through two threshold variation strategies for imbalanced problems

【24h】

Regularized fisher linear discriminant through two threshold variation strategies for imbalanced problems

机译：通过两种阈值变化策略对不平衡问题进行正则化Fisher线性判别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Fisher Linear Discriminant (FLD) has been widely applied to classification tasks due to its simple structure, analytical optimization, and useful criterion. However, when dealing with imbalanced datasets, even though the weight vector of FLD could be trained correctly to preserve the global distribution information of samples, the threshold of FLD might be seriously misled by the extreme proportion of classes. In order to modify the threshold and preserve the weight vector at the same time so as to improve FLD in imbalanced cases, this paper first regularizes the original FLD in a way inspired by the locality preserving projection, and then utilizes two strategies to optimize the threshold: the multi-thresholds selection strategy trains several FLDs with different empirically-defined thresholds, and then selects the optimal threshold out; the threshold-eliminated strategy generates two hyperplanes parallel to the original one built by FLD, and then utilizes a heuristic similarity metric for prediction. It is seen that the former seeks new threshold instead of the old one, while the latter ignores the original threshold. After introducing both strategies into the regularized FLD, two new classifiers are proposed in this paper and abbreviated as RFLD-S1 and RFLD-S2, respectively. Subsequently, the comprehensive comparison experiments on forty-one datasets among nine typical classifiers validate the effectiveness of the proposed methods. Especially, RFLD-S1 performs better than RFLD-S2 and achieves the best on most datasets. (C) 2018 Elsevier B.V. All rights reserved.

机译：Fisher线性判别（FLD）由于其结构简单，分析优化和有用的准则而被广泛应用于分类任务。但是，当处理不平衡的数据集时，即使可以正确地训练FLD的权重向量以保留样本的全局分布信息，FLD的阈值也可能会因类别的极端比例而严重误导。为了在不平衡的情况下修改阈值并同时保留权重向量以改善FLD，本文首先通过保留局部投影的方法对原始FLD进行正则化，然后利用两种策略来优化阈值：多阈值选择策略训练多个具有不同经验定义阈值的FLD，然后从中选择最佳阈值；消除阈值的策略会生成两个与FLD构建的原始超平面平行的超平面，然后利用启发式相似性度量进行预测。可以看出，前者寻求新的阈值而不是旧的阈值，而后者则忽略了原始的阈值。在将两种策略引入正则化FLD之后，本文提出了两个新的分类器，分别简称为RFLD-S1和RFLD-S2。随后，在9个典型分类器上对41个数据集进行的全面比较实验验证了所提方法的有效性。特别是，RFLD-S1的性能优于RFLD-S2，并且在大多数数据集上均达到最佳。（C）2018 Elsevier B.V.保留所有权利。

著录项

来源
《Knowledge-Based Systems》 |2018年第15期|57-73|共17页
作者
Zhu Yujin; Wang Zhe; Cao Chenjie; Gao Daqi;
展开▼
作者单位

East China Univ Sci & Technol, Minist Educ, Key Lab Adv Control & Optimizat Chem Proc, Shanghai 200237, Peoples R China;

East China Univ Sci & Technol, Minist Educ, Key Lab Adv Control & Optimizat Chem Proc, Shanghai 200237, Peoples R China;

East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China;

East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Imbalanced data; Pattern classification; Fisher linear discriminant; Regularization; Heuristic learning;

机译：数据不平衡;模式分类;Fisher线性判别;正则化;启发式学习;

相似文献

外文文献
中文文献
专利

1. Regularized least squares fisher linear discriminant with applications to image recognition [J] . Xiaobo Chen, Jian Yang, Qirong Mao, Neurocomputing . 2013,第deca25期

机译：正则化最小二乘费舍尔线性判别及其在图像识别中的应用
2. EEG based Autism Diagnosis Using Regularized Fisher Linear Discriminant Analysis [J] . Mahmoud I. Kamel, Mohammed J. Alhaddad, Hussein M. Malibary, International Journal of Image, Graphics and Signal Processing . 2012,第3期

机译：基于正则化Fisher线性判别分析的基于EEG的自闭症诊断
3. Fisher's linear discriminant ratio based threshold for moving human detection in thermal video [J] . Sharma Lavanya, Yadav Dileep Kumar, Singh Annapurna Infrared physics and technology . 2016,第Null期

机译：基于费舍尔线性判别比率的阈值，用于在热视频中移动人体检测
4. Identity and Variation Spaces: Revisiting the Fisher Linear Discriminant [C] . Sheng Zhang, Terence Sim, Mei-Chen Yeh International Conference on Computer Vision . 2010

机译：身份和变异空间：重新审视Fisher线性判别
5. A COMPARISON OF SIX MODELS FOR PREDICTING CORPORATE BANKRUPTCY: MULTIPLE LINEAR REGRESSION ANALYSIS, MULTIPLE LINEAR DISCRIMINANT ANALYSIS, STEPWISE REGRESSION ANALYSIS, STEPWISE DISCRIMINANT ANALYSIS, MULTIPLE LINEAR REGRESSION ANALYSIS WITH RIDGE REGRESSION, AND MULTIPLE LINEAR DISCRIMINANT ANALYSIS WITH BIASED MINIMUM CHI-SQUARE RULE [D] . MAPP, JOHNNIE ALBERT. 1981

机译：六种预测公司破产的模型的比较：多个线性回归分析，多个线性判别分析，逐步回归分析，逐步判别分析，多个带岭点回归的线性回归分析，以及多个线性离散
6. Performance Improvement of Near-Infrared Spectroscopy-Based Brain-Computer Interface Using Regularized Linear Discriminant Analysis Ensemble Classifier Based on Bootstrap Aggregating [O] . Jaeyoung Shin, Chang-Hwan Im 2020

机译：基于Bootstrap聚合的正规化线性判别分析集体分类器的基于近红外光谱的脑电电脑界面性能改进
7. Identity and Variation Spaces: Revisiting the Fisher Linear Discriminant [O] . Sheng Zhang, Terence Sim, Mei-chen Yeh 2013

机译：身份和变异空间：重新审视Fisher线性判别
8. Extensions to Fishers Linear Discriminant Function [R] . Longstaff, I. D. 1985

机译：对渔民线性判别函数的扩展

Regularized fisher linear discriminant through two threshold variation strategies for imbalanced problems

摘要

著录项

相似文献

相关主题

期刊订阅