Efficient Black-box Adversarial Attack With Low Query Cost

Posted on:2021-04-26

Degree:Master

Type:Thesis

Country:China

Candidate:J H Cai

Full Text:PDF

GTID:2428330620468103

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

With the fast development of deep learning,deep neural network is playing an important role in the fields such as computer vision,natural language processing and voice recognization.However,recent studies have shown that the neural networks without any defensive mechanism were easily affected by small perturbations.For example,the attacker can generate a small perturbation which is hard to be detected by human eye but fools the neural network to classify the adversarial picture to a wrong category with high confidence.For this reason,adversarial examples are harmful in the fields that need security such as self-driven systems.There are two kinds of adversarial attacks.First kind of adversarial attacks is whitebox adversarial attack.It assumes that attackers can get not only outputs of models but also structures and parameters of objective deep models.It is easy for attackers to generate adversarial examples by models' parameters and defeat the white-box models.The other kind of adversarial attacks is black-box adversarial attack.It's more practical and it assumes that attackers can't get the models' parameters.The black-box adversarial attack is more difficult than the white-box adversarial attack but more harmful.We propose a black-box attack method which called Multi-Model Efficient Query Attack(MEQA)to solve the problem that state of the art black-box attack methods need lots of queries to estimate the gradients of black-box models.We think that there exists some relationship between white-box models and the objective black-box model,so MEQA uses the gradients of white-box models to estimate the black-box model's gradient and guide the direction of the optimization during the attack.Whatsmore,we combine the natural evolution strategy and MEQA to finetune the adversarial example.To imporve the success rate of target attack,we define a new target attack loss called CombineLoss and a new target attack method called Target MEQA-NES.In one word,MEQA has good attack perfermance and improves the black-box attack's query efficiency.The main contributions of this papers are as follows:1.We propose the Multi-Model Efficient Query Attack(MEQA)method to imporvethe query efficiency of black-box adversarial attack.MEQA use the white-box mod-els' gradients to estimate the black-box model's gradient instead of estimating withonly mathematical methods.This change helps MEQA greatly imporve the queryefficiency.2.MEQA's effect depends on how related between white-box models and the black-box model.When the relationship is weak,MEQA has a poor attack success rate.So we propose MEQA-NES which combines the white-box models' gradient di-rections and gussian random directions to estimate the real black-box model gradi-ents.Whatsmore,we use an Encoder and Generator to reduce the search dimension.MEQA-NES is both efficient and effective.3.In target adversarial attack field,we propose CombineLoss which combine the un-target adversarial attack loss and the ordinary target adversarial loss.Our workcalled Target MEQA-NES is designed to attack face recognization.It can pretenda common people to a target celebrity.4.We design the MEQA algorithm and make experiments on the classifction task andface recognization task.MEQA has good results on all of the tasks and it provesour assumption that there exists relationship between all deep neural models.

Keywords/Search Tags:

Black-box Model Attack, Adversarial Example, Transfer Attack, Model Robustness, Face Adversarial Attack

PDF Full Text Request

Related items

1	Hybrid Gradient Model Based Black-Box Adversarial Attack
2	Research On Image Adversarial Attack Algorithm Based On Decision Boundary In The Black-Box Scenario
3	Research On Audio Adversarial Attack Technology
4	Research On Image Adversarial Attack Algorithms Based On Black-box Transferability
5	Research On Image Classification Adversarial Attack Based On Multiobjective Evolution Model In The Black-box Scenario
6	Research On Robust Adversarial Attack Method In Three-dimensional Space
7	Design And Implementation Of Image Recognition Adversrial System Based On Black-Box Attack Technology
8	Research On Black-box Attack Methods In Image Adversarial Examples Generation
9	Heuristic Black-Box Adversarial Attacks On Video Recognition Models
10	Research On The Robustness Of Deep Image Classification Models Based On Adversarial Examples