Data Sanitization In Adversarial Environment

Posted on:2017-06-27

Degree:Master

Type:Thesis

Country:China

Candidate:H J Li

Full Text:PDF

GTID:2348330536453462

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Machine learning has been applied in many applications,such as spam filtering and intrusion detection with satisfying performance.However,it now faces a threat of an adversary attack in security applications.The attacker may exploit a weakness of a learning process and degrades the performance of the system.This study aims to analyze a causative attack,which is one of the most serious attacks in adversarial environment.In causative attack,an attacker has the ability to manipulate some of the training data to mislead the learning process of a classifier.We proposed two methods of data sanitization to defend against causative attack.We firstly present an improved method to increase the time complexity of RONI defense.Then,an efficient data sanitization based on data complexity,which is a measure that characterizes the difficulty of a classification problem,is proposed.The major contributions of this study are as follows:1)We proposed an improved defense which simplifies a time complexity of RONI defense by considering a set of similar samples grouped by the relative neighborhood graph.Experimental results suggest that the performance of our proposed method is similar to the RONI defense but with less time complexity.2)Our study shows that causative attack causes the different distributions on data complexity between the poisoned data and the normal data in a dataset.Based on this finding,we present a new defense of data sanitization based on data complexity to remove the attacked data of the training set.Experimental results show that those poisoned samples can be sanitized effectively by our method.

Keywords/Search Tags:

Adversarial learning, Causative attack, Data sanitization, Data complexity

PDF Full Text Request

Related items

1	Robust Machine Learning In Adversarial Environment And Its Applications
2	Research On Data Sanitization-based Strategy Of Sensitive Information Protection
3	Research On Multi-view-based Geometric Deep Learning Model Of 3D Data And Its Adversarial Attack
4	A Research On The Sanitization Technology For Data In NAND Flash
5	Robustness Of URL Online Learning
6	The Attack And Defense Researches Based On The Adversarial Examples
7	Research And Application On Adversarial Examples Under Security Of Deep Learning
8	Research On Adversarial Attack And Defense Technology In Machine Learning
9	Research And Implementation Of Deep Learning Protection Technologies For Adversarial Examples
10	Research On Several Problems In Privacy Preserving Information Sharing