Research On Phishing Website Detection Based On Data Mining Classification Algorithm

Posted on:2019-04-20

Degree:Master

Type:Thesis

Country:China

Candidate:J Q Zhao

Full Text:PDF

GTID:2428330548487286

Subject:Information management and information systems

Abstract/Summary:

PDF Full Text Request

In recent years,with the development of the Internet,the Internet has gradually become a platform for people's activities such as work,study,shopping,and financial transactions.He has brought great convenience to people's work and life,and is prospering in the e-commerce and e-finance industries.At the same time,the issue of network security has become increasingly prominent.Among them,phishing websites are particularly harmful.The phishing website is a kind of social engineering-based attack method and is a criminal mechanism for criminals to deceive user identity data and financial accounts.The website,content and layout of phishing websites are very similar to those of real websites.This makes it difficult for netizens without security awareness to be deceived.Phishing brings huge economic losses to people and seriously hinders the development of e-commerce..Effectively curbing phishing websites is a guarantee of network security.Research on the detection and defense of phishing attacks is imminent.At present,there are achievements in the research of defense phishing websites at home and abroad.However,they all have flaws.In recent years,in the research of fishing website detection methods,the application of machine learning has made great achievements.The application of various classification algorithms in machine learning to phishing website detection and identification can effectively improve the detection efficiency.Therefore,this paper studies the phishing website identification method based on data mining classification algorithm.The focus of this paper is to analyze and compare the classification performance of different classifier algorithms in phishing website detection in machine learning.The main work of this paper is divided into three parts.First of all,in this paper,PCA principal component analysis method is used to analyze the importance of URL anomaly feature and multi-source fusion feature of selected phishing websites,and obtain effective features.Secondly,using three classical classification algorithms in machine learning—support vector machine,random forest,and RBF neural network—to generate classification models from web page feature sets,and conduct comparative experiments on them.Finally,aiming at the shortcomings of the three classification models,an improved new classifier model-KELM classification model is proposed based on this,and further evaluation and analysis are made based on URL anomaly characteristics and multi-source fusion characteristics through six evaluation indexes.The precision and performance of phishing website recognition.The experiment proves that it can effectively improve the recognition efficiency under the condition of using the KELM algorithm,and proves the effectiveness of the KELM classifier model in the detection of phishing websites.

Keywords/Search Tags:

Phishing websites, Data Mining, Classification model, Machine learning

PDF Full Text Request

Related items

1	A Phishing Website Detection Method Based On Stacking Model
2	A Flexible Phishing Detection Approach Based On Software-Defined Networking Using Ensemble Learning Method
3	The Application Of The Classification Method In Identifying The Website Phishing Data
4	Research And Implementation On Joint Features And Intelligent Detection Algorithms Of Phishing Webpages
5	Research On The Recognition Of Phishing Websites Based On K-Means And SVM
6	Phishing Websites Detection Using Selected Features Classification And Bidirectional Long Short-Term Memory Neural Networks
7	Research On Phishing Webpages Detection Based On Machine Learning
8	The Graduates’ Income Forecast And Analysis Research Based On Machine Learning
9	Research On Model Selection For Machine Learning
10	Design And Implementation Of Phishing Websites Information System Based On Web