An Efficient Parallel Hybrid Feature Selection Approach for Big Data Analysis

Mohamed Amine Azaiz; Djamel Amar Bensaber

首页> 外文期刊>International journal of swarm intelligence research >An Efficient Parallel Hybrid Feature Selection Approach for Big Data Analysis

【24h】

An Efficient Parallel Hybrid Feature Selection Approach for Big Data Analysis

机译：An Efficient Parallel Hybrid Feature Selection Approach for Big Data Analysis

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Classification algorithms face runtime complexity due to high data dimension, especially in the context of big data. Feature selection (FS) is a technique for reducing dimensions and improving learning performance. In this paper, the authors proposed a hybrid FS algorithm for classification in the context of big data. Firstly, only the most relevant features are selected using symmetric uncertainty (SU) as a measure of correlation. The features are distributed into subsets using Apache Spark to calculate SU between each feature and target class in parallel. Then a Binary PSO (BPSO) algorithm is used to find the optimal FS. The BPSO has limited convergence and restricted inertial weight adjustment, so the authors suggested using a multiple inertia weight strategy to influence the changes in particle motions so that the search process is more varied. Also, the authors proposed a parallel fitness evaluation for particles under Spark to accelerate the algorithm. The results showed that the proposed FS achieved higher classification performance with a smaller size in reasonable time.

著录项

来源
《International journal of swarm intelligence research》 |2022年第4期|1404-1425|共22页
作者
Mohamed Amine Azaiz; Djamel Amar Bensaber;
展开▼
作者单位

Ecole Superieure en Informatique, Sidi Bel-Abbes, Algeria;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Big Data Analytics; Feature Selection; Parallel Binary Particle Swarm Optimization;

An Efficient Parallel Hybrid Feature Selection Approach for Big Data Analysis

摘要

著录项

相关主题

期刊订阅