Research On Classification Method Of Malicious Code Based On Bagging

Posted on:2020-05-27

Degree:Master

Type:Thesis

Country:China

Candidate:H C Jiang

Full Text:PDF

GTID:2518306047998449

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the rapid development of the Internet,all walks of life are increasingly relying on Internet technology.While people store a large amount of important data information on the Internet,they are paying more and more attention to information security.Internet information security issues such as information disclosure,hacker attacks,and virus transmission are emerging one after another.Various malicious codes are transmitted on the network,which jeopardizes the Internet environment.Due to the rapid development of malicious code and high update frequency,the detection and classification of malicious code has always been the focus of research.With the rapid development of machine learning algorithms,the application of machine learning dissemination to malicious code classification detection has achieved good results.Most malicious code classification detection methods extract all the malicious code features by use a single feature extraction method.The supervised learning decision tree algorithm processes the acquired feature information data,and classifies and detects viruses and Trojans according to the extracted malicious code features.The feature extraction method of the target program is easy to use,and the operation is simple and easy to implement.At the same time,the decision tree method has the advantages of not needing to prepare a large amount of data,relatively low time complexity,and being able to verify the model by using statistical tests.However,different feature extraction methods have different ability to extract feature information from the same malicious code.The classification accuracy of all types of malicious code using the same feature extraction method will be greatly different.On the other hand,there are over-fitting phenomena in the decision tree,which will have an impact on the processing of the data and the final classification results.Therefore,this thesis proposes a Bagging-based malicious code classification detection method to solve the above problems.The main contents of this thesis are as follows:Firstly,Research on existing malicious code detection methods,Analyze and compare different detection and classification methods of malicious code to study the existing problems.Then,according to the deficiencies in the existing methods,the Bagging integrated learning method is introduced to train and classify the acquired feature information,effectively solve the over-fitting phenomenon in the decision tree and improve the classification efficiency.Aiming at the problem that different malicious codes use the same feature extraction method,a method based on Bagging for malicious code classification is proposed.Finally,the proposed method is compared with the existing methods,and the evaluation methods are used to compare different malicious code classification detection methods.The experimental results show the practicability and efficiency of the method.

Keywords/Search Tags:

Malicious Code Classification Detection, Bagging, Decision Tree, Feature Selection

PDF Full Text Request

Related items

1	Research On The Main Techonologies In Malware Code Detection
2	Research And Realization Of The Malicious Code Detection System Based On Behavior Feature Analysis
3	Research On Subspace Classification Methods And Their Applications
4	Design And Implementation Of Hardware Algorithm For Decision Tree Classification Based On ZYNQ
5	Research On Malicious Code Detection Technology
6	Research On Classification Of P2P Traffic Based On Machine Learning
7	Research On Key Technologies Of Malicious Code And Emergency Response In Communication Networks
8	Research On Statistical Feature Based Malware Classification Methods
9	Clustering Analysis Of Malicious Code Based On N-gram Feature Extraction
10	A System Of Decision Trees Construction