Research On Anomaly Detection Of High-dimensional Data Based On Ensemble Generative Adversarial Networks

Posted on:2024-07-28

Degree:Master

Type:Thesis

Country:China

Candidate:M L Zhou

Full Text:PDF

GTID:2568307124463814

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Anomalies are objects that are significantly different from others in a dataset,which often contain some important information.Anomaly detection of high-dimensional sparse data is still a challenge due to the curse of dimensionality and the sparsity of the data.It is valuable to explore anomaly detection approaches for high-dimensional data.The idea of data generation is introduced to address such problems in the thesis.(1)An unsupervised anomaly detection approach combining Generative Adversarial Network(GAN)with ensemble learning,named GAN＿Ensemble,is proposed.Owing to the ability of the generator of a GAN to simulate the distribution of real data,a large volume of latent data can be generated to avoid the data space to be too sparse.Moreover,in model training,multiple pairs of generators and discriminators are fully connected with each other.Thus,the mismatch of generators and discriminators will enhance the generative model to learn more complex data distributions,and to improve anomaly detection effect finally.Experiments on public datasets show that the index AUC can be improved by 7% averagely compared to traditional GAN-based anomaly detection approaches.At the same time,there is also an increase of 7.5% to 21.8% on AUC compared with the classical anomaly detection approaches.(2)Considering that the proposed approach GAN＿Ensemble may drop into overfitting and has high time consumption,it is optimized further and an unsupervised anomaly detection approach DGANs based on selective ensemble Generative Adversarial Networks is proposed.DGANs removes connections between generators and discriminators with a specific probability during the training phase.This makes the connections between generators and discriminators sparser.For anomaly detection,the well trained discriminators are integrated selectively based on dynamical voting weights adjusting.This can not only avoid the model falling into overfitting,but also reduce the time cost and improve the detection.Experiments on public datasets show that DGANs can improve the average accuracy by 4.63% compared with GAN＿Ensemble.Moreover,DGANs also show advantages on recall rate and F1 score compared to classical anomaly detection approaches.

Keywords/Search Tags:

Anomaly detection, High-dimensional sparse data, Generative Adversarial Networks, Ensemble Learning

PDF Full Text Request

Related items

1	An Intrusion Detection Method Based On Generative Adversarial Networks And Ensemble Learning
2	Research On Anomaly Detection Algorithm For Sparse Spatiotemporal Data
3	Unsupervised Anomaly Detection Based On Sparse Autoencoder And Ensemble Learning
4	Research And Implementation Of Video Anomaly Detection Based On Generative Model
5	Research On Customer Profile Model Of A Commercial Bank Based On Machine Learning
6	Research On Anomaly Detection Methods Based On Generative Adversarial Networks
7	Research On Sparse Generative Adversarial Nets And Its Application In Recommendation System
8	Research On Anomaly Detection Algorithm Based On Autoencoder
9	Research On Network Anomaly Detection Based On Deep Learning
10	Research On Unsupervised Learning Algorithms For High-Dimensional Data