An implementation of Botnet dataset to predict accuracy based on network flow model

机译：Botnet数据集基于网络流模型预测准确性的实现

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Botnet is a malicious software that can perform malicious activities, such as (Distributed Denial of Services) DDoS, spamming, phishing, key logging, click fraud, steal personal information and important data, etc. Botnets can replicate themselves without user consent. Several systems of botnet detection have been done by using a machine learning method with feature selection approach. Currently, the creation of dataset feature based on network flow, Domain Name System (DNS) traffic and content based that represent botnet behavior. Unfortunately the dataset for botnet detection is dummy dataset, to implement in machine learning needs extractor tool which is very expensive to buy. Therefore we create our own features extractor. In this paper we propose network flow using connection logs approach on the dataset. First of all we made the data model using pair of source IP (Internet Protocol), destination IP and source port, destination port in a period time to extract new features. To predict the accuracy, the extracted features will be validated using K-Fold Cross Validation with number of k= 10. The results of the validation with six various types of botnet shows the high Precision=98.70%, F-Measure=99.40%, Recall=98.80%, and Accuracy=98.80% for Rule Induction algorithm, while K-Nearest Neighbor is the most stable than all algorithms that achieve precision, Recall, F-measure and accuracy to 98.10% and high speed (50 ms).

机译：僵尸网络是一种可以执行恶意活动的恶意软件，例如（分布式拒绝服务）DDoS，垃圾邮件，网络钓鱼，密钥记录，点击欺诈，窃取个人信息和重要数据等。僵尸网络可以在未经用户同意的情况下复制自身。通过使用具有特征选择方法的机器学习方法，已经完成了多个僵尸网络检测系统。当前，基于网络流量，域名系统（DNS）流量和表示僵尸网络行为的内容的数据集功能的创建。不幸的是，用于僵尸网络检测的数据集是虚拟数据集，要在机器学习中实现需要提取器工具，这是非常昂贵的购买工具。因此，我们创建了自己的特征提取器。在本文中，我们使用数据集上的连接日志方法提出网络流量。首先，我们在一段时间内使用源IP（Internet协议），目标IP和源端口，目标端口对创建数据模型，以提取新功能。为了预测准确性，将使用K-Fold交叉验证对提取的特征进行验证，k =10。使用六种不同类型的僵尸网络进行验证的结果表明，高精度为98.70％，F-Measure = 99.40％，对于规则归纳算法，召回率为98.80％，准确度为98.80％，而K最近邻是所有精度最稳定的算法，这些算法可实现98.10％的精度，召回率，F量度和准确性以及高速（50毫秒）。

著录项

来源
《2017 International Electronics Symposium on Knowledge Creation and Intelligent Computing》|2017年|33-39|共7页
会议地点 Surabaya(ID)
作者
Yesta Medya Mahardhika; Amang Sudarsono; Ali Ridho Barakbah;
展开▼
作者单位

Departement of Information and Computer Engineering, Graduate Program of Engineering Technology, Politeknik Elektronika Negeri Surabaya, Surabaya, Indonesia;

Departement of Information and Computer Engineering, Graduate Program of Engineering Technology, Politeknik Elektronika Negeri Surabaya, Surabaya, Indonesia;

Departement of Information and Computer Engineering, Graduate Program of Engineering Technology, Politeknik Elektronika Negeri Surabaya, Surabaya, Indonesia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Feature extraction; Data mining; Training data; Protocols; Ports (Computers); Data models;

机译：特征提取;数据挖掘;训练数据;协议;端口（计算机）;数据模型;;

相似文献

外文文献
中文文献
专利

1. A Novel Application for Combining CASs and Datasets to Produce Increased Accuracy in Modeling and Predicting Cancer Recurrence [J] . John Norris, Erin Barns, Olivia Schultz, Procedia Computer Science . 2013,第1期

机译：结合CAS和数据集以在建模和预测癌症复发中提高准确性的新型应用
2. Significant improvement of miRNA target prediction accuracy in large datasets using meta-strategy based on comprehensive voting and artificial neural networks [J] . Bi Zhao, Bin Xue BMC Genomics . 2019,第1期

机译：基于综合投票和人工神经网络的元策略，在大型数据集中重大改善MiRNA靶预测准确性
3. RadialGAN: Leveraging multiple datasets to improve target-specific predictive models using Generative Adversarial Networks [J] . Jinsung Yoon, James Jordon, Mihaela Schaar JMLR: Workshop and Conference Proceedings . 2018,第4期

机译：RadialGAN：利用生成对抗网络，利用多个数据集来改善针对特定目标的预测模型
4. An implementation of Botnet dataset to predict accuracy based on network flow model [C] . Yesta Medya Mahardhika, Amang Sudarsono, Ali Ridho Barakbah International Conference on Knowledge Creation and Intelligent Computing . 2017

机译：基于网络流模型预测精度的僵尸网络数据集
5. Predictive Networking and Optimization for Flow-Based Networks [D] . Arnold, Michael. 2017

机译：基于流的网络的预测网络和优化
6. Significant improvement of miRNA target prediction accuracy in large datasets using meta-strategy based on comprehensive voting and artificial neural networks [O] . Bi Zhao, Bin Xue 2019

机译：基于综合投票和人工神经网络的元策略可大幅提高大型数据集中miRNA目标预测的准确性
7. A Novel Application for Combining CASs and Datasets to Produce Increased Accuracy in Modeling and Predicting Cancer Recurrence [O] . Norris John, Barns Erin, Schultz Olivia, 2013

机译：组合CAS和数据集以在建模和预测癌症复发中提高准确性的新型应用

An implementation of Botnet dataset to predict accuracy based on network flow model

摘要

著录项

相似文献

相关主题

期刊订阅