Optimization-based feature selection with adaptive instance sampling

Jaekyung Yang; Sigurdur Olafsson

首页> 外文期刊>Computers & operations research >Optimization-based feature selection with adaptive instance sampling

【24h】

Optimization-based feature selection with adaptive instance sampling

机译：基于自适应实例采样的基于优化的特征选择

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Preprocessing the data to filter out redundant and irrelevant features is one of the most important steps in the data mining process. Careful feature selection may improve both the computational time of inducing subsequent models and the quality of those models. Using fewer features often leads to simpler and easier to interpret models, and selecting important feature can lead to important insights into the application. The feature selection problem is inherently a combinatorial optimization problem. This paper builds on a metaheuristic called the nested partitions method that has been shown to be particularly effective for the feature selection problem. Specifically, we focus on the scalability of the method and show that its performance is vastly improved by incorporating random sampling of instances. Furthermore, we develop an adaptive variant of the algorithm that dynamically determines the required sample rate. The adaptive algorithm is shown to perform very well when applied to a set of standard test problems.

机译：预处理数据以滤除冗余和不相关的功能是数据挖掘过程中最重要的步骤之一。仔细的特征选择可以改善引入后续模型的计算时间以及这些模型的质量。使用较少的功能通常可以使模型更容易解释，选择重要的功能可以对应用程序产生重要的见解。特征选择问题本质上是组合优化问题。本文基于一种称为嵌套分区方法的元启发式方法，该方法已被证明对特征选择问题特别有效。具体来说，我们专注于该方法的可伸缩性，并表明通过合并实例的随机抽样极大地提高了其性能。此外，我们开发了一种算法的自适应变体，可以动态确定所需的采样率。当应用于一组标准测试问题时，自适应算法表现出很好的性能。

著录项

来源
《Computers & operations research》 |2006年第11期|p.3088-3106|共19页
作者
Jaekyung Yang; Sigurdur Olafsson;
展开▼
作者单位

Department of Industrial and Manufacturing Systems Engineering, Iowa State University, 2019 Black Engineering, Ames, IA 50011, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
feature selection; combinatorial optimization; metaheuristics; data mining;

机译：特征选择;组合优化;计量学;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Drug activity prediction using multiple-instance learning via joint instance and feature selection [J] . Zhendong Zhao, Gang Fu, Sheng Liu, BMC Bioinformatics . 2013,第SUPPLEMENTa14期

机译：通过联合实例和特征选择使用多实例学习进行药物活动预测
2. Integrating Instance Selection, Instance Weighting, and Feature Weighting for Nearest Neighbor Classifiers by Coevolutionary Algorithms [J] . Derrac J., Triguero I., Garcia S., Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on . 2012,第5期

机译：通过协进化算法集成最近邻分类器的实例选择，实例加权和特征加权
3. Mammogram classification using contourlet features with forest optimization-based feature selection approach [J] . Mohanty Figlu, Rup Suvendu, Dash Bodhisattva, Multimedia Tools and Applications . 2019,第10期

机译：使用基于森林优化的特征选择方法的Contourlet特征进行乳房X线照片分类
4. Scalable Optimization-Based Feature Selection Using Random Sampling [C] . Jaekyung Yang, Sigurdur Olafsson Institute of Industrial Engineers Annual Conference . 2003

机译：基于可扩展的基于优化的特征选择使用随机采样
5. Scalable optimization-based feature selection with application to recommender systems. [D] . Yang, Jaekyung. 2003

机译：可扩展的基于优化的功能选择，适用于推荐系统。
6. Drug activity prediction using multiple-instance learning via joint instance and feature selection [O] . Zhendong Zhao, Gang Fu, Sheng Liu, 2013

机译：通过联合实例和特征选择使用多实例学习进行药物活动预测
7. A Co-evolutionary Framework for Nearest Neighbor Enhancement: Combining Instance and Feature Weighting with Instance Selection [O] . Joaquín Derrac, Isaac Triguero, Salvador García, 2014

机译：最近邻增强的协同进化框架：将实例和特征权重与实例选择相结合

Optimization-based feature selection with adaptive instance sampling

摘要

著录项

相似文献

相关主题

期刊订阅