Comparison of Classification Algorithms for Detection of Phishing Websites

Paulius VAITKEVICIUS; Virginijus MARCINKEVICIUS

首页> 外文期刊>Informatica >Comparison of Classification Algorithms for Detection of Phishing Websites

【24h】

Comparison of Classification Algorithms for Detection of Phishing Websites

机译：分类算法检测网络钓鱼网站的比较

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Phishing activities remain a persistent security threat, with global losses exceeding 2.7 billion USD in 2018, according to the FBI's Internet Crime Complaint Center. In literature, different generations of phishing websites detection methods have been observed. The oldest methods include manual blacklisting of known phishing websites' URLs in the centralized database, but they have not been able to detect newly launched phishing websites. More recent studies have attempted to solve phishing websites detection as a supervised machine learning problem on phishing datasets, designed on features extracted from phishing websites' URLs. These studies have shown some classification algorithms performing better than others on differently designed datasets but have not distinguished the best classification algorithm for the phishing websites detection problem in general. The purpose of this research is to compare classic supervised machine learning algorithms on all publicly available phishing datasets with predefined features and to distinguish the best performing algorithm for solving the problem of phishing websites detection, regardless of a specific dataset design. Eight widely used classification algorithms were configured in Python using the Scikit Learn library and tested for classification accuracy on all publicly available phishing datasets. Later, classification algorithms were ranked by accuracy on different datasets using three different ranking techniques while testing the results for a statistically significant difference using Welch's T-Test. The comparison results are presented in this paper, showing ensembles and neural networks outperforming other classical algorithms.

机译：根据FBI的互联网犯罪投诉中心，网络钓鱼活动仍然是持续的安全威胁，2018年全球损失超过2018年超过27亿美元。在文献中，已经观察到不同几代网络钓鱼网站检测方法。最旧的方法包括在集中式数据库中的已知网络钓鱼网站URL的手动黑名单，但他们无法检测到新推出的网络钓鱼网站。最近的研究已经尝试解决网络钓鱼网站检测作为网络钓鱼数据集的受监控机器学习问题，设计在网络钓鱼网站URL中提取的功能上。这些研究已经示出了一些在不同设计的数据集上执行的分类算法，但是没有区分用于网络钓鱼网站检测问题的最佳分类算法。该研究的目的是将所有公开的网络钓鱼数据集进行比较具有预定义的特征的经典监督机器学习算法，并区分用于解决网络钓鱼网站检测问题的最佳性能算法，无论特定数据集设计如何。使用Scikit学习库在Python中配置了八种广泛使用的分类算法，并在所有公开的网络钓鱼数据集中测试了分类准确性。后来，使用三种不同的排名技术在不同数据集上的准确度排序分类算法，同时使用Welch的T检验测试统计学上显着差异的结果。本文介绍了比较结果，显示了优于其他经典算法的集合和神经网络。

著录项

来源
《Informatica》 |2020年第1期|143-160|共18页
作者
Paulius VAITKEVICIUS; Virginijus MARCINKEVICIUS;
展开▼
作者单位

Vilnius University Institute of Data Science and Digital Technologies Akademijos str. 4 LT-08412 Vilnius Lithuania;

Vilnius University Institute of Data Science and Digital Technologies Akademijos str. 4 LT-08412 Vilnius Lithuania;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
phishing detection; classification algorithms; phishing datasets;

机译：网络钓鱼检测;分类算法;网络钓鱼数据集;

相似文献

外文文献
中文文献
专利

1. Optimization of URL-Based Phishing Websites Detection through Genetic Algorithms [J] . Muhammad Taseer Suleman, Shahid Mahmood Awan Automatic Control and Computer Sciences . 2019,第4期

机译：基于URL的网络钓鱼网站通过遗传算法进行了优化
2. A Comparative Analysis of Different Feature Set on the Performance of Different Algorithms in Phishing Website Detection [J] . Hajara Musa, Bala Modi, Ismail Abdulkarim Adamu, International Journal of Artificial Intelligence & Applications (IJAIA) . 2019,第3期

机译：不同特征对网络钓鱼网站检测中不同算法性能的比较分析
3. A new fast associative classification algorithm for detecting phishing websites [J] . Hadi Wael, Aburub Faisal, Alhawari Samer Applied Soft Computing . 2016,第Null期

机译：一种新型的网络钓鱼网站快速关联分类算法
4. Algorithm Evaluation for Classification “Phishing Website” Using Several Classification Algorithms [C] . Rizki Wahyudi, Hendra Marcos, Uswatun Hasanah, International Conference on Information Technology, Information System and Electrical Engineering . 2018

机译：使用几种分类算法对“钓鱼网站”分类的算法评估
5. Categorization of Phishing Detection Features and Using the Feature Vectors to Classify Phishing Websites [D] . Namasivayam, Bhuvana. 2017

机译：对网络钓鱼检测特征的分类，并使用特征向量对网络钓鱼网站进行分类
6. Improving the phishing website detection using empirical analysis of Function Tree and its variants [O] . Abdullateef O. Balogun, Kayode S. Adewole, Muiz O. Raheem, 2021

机译：使用函数树及其变体的实证分析改善网络钓鱼网站检测
7. Phishing website detection using intelligent data mining techniques. Design and development of an intelligent association classification mining fuzzy based scheme for phishing website detection with an emphasis on E-banking. [O] . Abur-rous Maher Ragheb Mohammed 2010

机译：使用智能数据挖掘技术的网络钓鱼网站检测。一种基于智能关联分类挖掘模糊的网络钓鱼网站检测方案的设计与开发，重点是电子银行。

Comparison of Classification Algorithms for Detection of Phishing Websites

摘要

著录项

相似文献

相关主题

期刊订阅