Malicious Urls Detection Using Deep Learning

Posted on:2019-03-09

Degree:Master

Type:Thesis

Country:China

Candidate:Farhan Douksieh Abdi

Full Text:PDF

GTID:2428330578472717

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

In recent years,with the rapid development of the Internet,websites registering becomes extremely easy.So,the number and size of websites grow rapidly.On one hand,this has become an important part of our everyday life for information communication and knowledge dissemination.It helps to transact information timely,rapidly and easily.On the other hand,Malicious URLs host unsolicited content(spam,phishing,drive-by exploits,etc.)and lure unsuspecting users to become victims of scams(monetary loss,theft of private information,and malware installation),and cause losses of billions of dollars every year.it also provides living space for malicious websites.URL detection is one of the most important technologies to prevent attacks from malicious websites.However,to detect such crimes systems should be fast and precise with the ability to detect new malicious content.Traditionally,this detection is done mostly through the usage of blacklists.However,blacklists cannot be exhaustive,and lack the ability to detect newly generated malicious URLs.As malicious URL becomes more complex,traditional URL detection methods seem to be unable to handle it.Thus,it is particularly important to explore new,more reliable and accurate URL detection methods.Our work is a high-precision,susceptible-learning and rapidly-reacting malicious URL detection system based on the deep learning algorithms.The whole work can be divided into three parts:the features selecting and processing,the training part and the classification part.In the features selecting and processing,we get the word2vec feature,TF-IDF feature and content-based feature respectively.These features have been used to train different machine learning and deep learning algorithms to detect and predicting URLs it is good or bad.In the results section,the performance evaluation of each of these algorithms have been presented.The results show that the algorithns Convolutional Neural Network(CNN)achieved the highest accuracy to compared with two other algorithms to know(SVM,LR).In the final classification stage,we use a layer of a fully connected network to further address the three probability matrices,and give a final result(malicious,benign).

Keywords/Search Tags:

Malicious URL Detection, Convolutional Neural Network(CNN), Support Vector Machines(SVM), Cyber Security, Logistic Regression(LR)

PDF Full Text Request

Related items

1	Malicious Link Detection System Based On Support Vector Machines And Node Detection
2	Research On The Key Problems Of Twin Support Vector Machines
3	Research Of Dynamic Forecasting Method And Implementation Technique Based On Support Vector Machines
4	Research On Technology Of URL Security Detection Based On Machine Learning
5	Research On Model And Information Transform Mechanism Of Process Support Vector Machines
6	Some Research For Neural Networks And Support Vector Machines
7	Research On Automatic Classification Algorithms For All-sky Cloud Image
8	Studies Of Some Problems In Support Vector Machines And Semi-supervised Learning
9	Improved Regression Estimation Algorithm Of SVM And Its Applications
10	Research On Intrusion Detection Methods Based On Clustering And Support Vector Machines