首页> 外文会议>IEEE International Conference on Fuzzy Systems >A first approach towards the usage of classifiers’ performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets

【24h】

A first approach towards the usage of classifiers’ performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets

机译：利用分类器性能为分类器集合创建模糊度量的第一种方法：以高度不平衡的数据集为例

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work we study the possibility of learning fuzzy measures from classifiers' performance for improving the standard aggregation methods in classifier ensembles. Fuzzy measures are set-valued functions, which are not necessarily additive, and they are the basis for constructing non-linear fuzzy integrals, such as Choquet or Sugeno integral. These integrals have shown to be very useful in the aggregation of interacting criteria, since this interaction can be well modeled by a fuzzy measure. Classifier ensembles are composed of several classifiers and are aimed at improving the performance of every one of their counterparts. There are two main aspects about ensembles, first, how to build them, and second, how to combine the outputs of all their members. In this work, we focus on the second part, which is a key factor to obtain a successful ensemble. More specifically, we focus on the usage of fuzzy measures for the aggregation phase aiming at taking into account the coalitions and interactions among the members of the ensemble. Our hypothesis is that taking such information into account can lead to better performance. Moreover, we propose to directly obtain the fuzzy measure from data by considering the performance of each subset of classifiers in the ensemble. This way, one needs not include any additional learning for the fuzzy measure that can easily lead to overfitting. In order to test the usefulness of the proposed fuzzy measure, we will consider a set of 33 highly imbalanced datasets and we will develop a complete experimental study comparing the proposed combination scheme with other approaches commonly considered in the literature.

机译：在这项工作中，我们研究了从分类器的性能中学习模糊度量的可能性，以改进分类器集成中的标准聚合方法。模糊测度是集值函数，不一定是可加的，它们是构造非线性模糊积分（如Choquet或Sugeno积分）的基础。这些积分在交互标准的汇总中非常有用，因为可以通过模糊度量很好地建模这种交互。分类器乐团由几个分类器组成，旨在提高每个对应器的性能。关于合奏有两个主要方面，首先是如何构建它们，其次是如何组合所有成员的输出。在这项工作中，我们专注于第二部分，这是获得成功的合奏的关键因素。更具体地讲，我们关注于聚集阶段的模糊度量的使用，旨在考虑集合成员之间的联盟和相互作用。我们的假设是，将此类信息考虑在内可以带来更好的性能。此外，我们建议通过考虑集合中每个分类器子集的性能，直接从数据中获取模糊测度。这样，就不需要对模糊量度进行任何额外的学习，而这很容易导致过度拟合。为了测试所提出的模糊测度的有效性，我们将考虑一组33个高度不平衡的数据集，并且将进行一项完整的实验研究，将所提出的组合方案与文献中通常考虑的其他方法进行比较。

著录项

来源
《IEEE International Conference on Fuzzy Systems》|2018年|1-8|共8页
会议地点
作者
M. Uriz; D. Paternain; H. Bustince; M. Galar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Diversity reception; Open wireless architecture; Additives; Aggregates; Proposals; Weight measurement; Tools;

机译：分集接收;开放的无线体系结构;添加剂;集合体;提案;重量测量;工具;

相似文献

外文文献
中文文献
专利

1. Ordering-based pruning for improving the performance of ensembles of classifiers in the framework of imbalanced datasets [J] . Galar Mikel, Fernandez Alberto, Barrenechea Edurne, Information Sciences: An International Journal . 2016,第Null期

机译：基于排序的修剪在不平衡数据集框架内提高分类器集合的性能
2. Two Stage Comparison of Classifier Performances for Highly Imbalanced Datasets [J] . Goran Ore?ki, Stjepan Ore?ki Journal of Information and Organizational Sciences . 2015,第2期

机译：高度不平衡数据集分类器性能的两阶段比较
3. Feature Selection and Ensemble Learning Techniques in One-Class Classifiers: An Empirical Study of Two-Class Imbalanced Datasets [J] . Chih-Fong Tsai, Wei-Chao Lin Quality Control, Transactions . 2021,第1期

机译：单级分类器中的特征选择和集合学习技术：两级不平衡数据集的实证研究
4. A first approach towards the usage of classifiers' performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets [C] . M. Uriz, D. Paternain, H. Bustince, IEEE International Conference on Fuzzy Systems . 2018

机译：用于对分类器集合创建模糊措施的第一种方法：对高度不平衡数据集的案例研究
5. Diversified ensemble classifiers for highly imbalanced data learning and its application in bioinformatics. [D] . Ding, Zejin. 2011

机译：用于高度不平衡数据学习的多元化集成分类器及其在生物信息学中的应用。
6. iPPBS-Opt: A Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets [O] . Jianhua Jia, Zi Liu, Xuan Xiao, 2016

机译：iPPBS-Opt：一种基于序列的集成分类器用于通过优化不平衡训练数据集来识别蛋白质与蛋白质的结合位点
7. iPPBS-Opt: A Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets [O] . Jianhua Jia, Zi Liu, Xuan Xiao, 2016

机译：ippBs-Opt：基于序列的集成分类器，用于通过优化不平衡训练数据集来识别蛋白质 - 蛋白质结合位点

A first approach towards the usage of classifiers’ performance to create fuzzy measures for ensembles of classifiers: a case study on highly imbalanced datasets

摘要

著录项

相似文献

相关主题

期刊订阅