Research On Anonymous Method Based On Proximity Resistance Of Sensitive Information

Posted on:2021-02-06

Degree:Master

Type:Thesis

Country:China

Candidate:Y J Lv

Full Text:PDF

GTID:2428330611994592

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Data anonymization is often used to solve the problem of privacy leakage in data publishing due to its unique advantages in security and effectiveness.aiming at the problem of privacy leakage caused by similarity attacks,this paper proposes(r,k)-anonymity model,sets proximity resistance threshold based on proximity association of sensitive attributes,and designs an anonymous algorithm GDPPR(Generalized data for privacy proximity resistance)that satisfies(r,k)-anonymity model.the fuzzy clustering technique based on(r,k)-anonymous model improvement is used to complete the cluster division,and the membership matrix is obtained,so the frequency of sensitive attributes with adjacent associations in each equivalence class is not higher than the threshold After anonymization is ensured,the data can meet the user's privacy protection needs and it is valuable to use.this article is mainly based on the following two key points:1.Research the anonymous model of sensitive attribute proximity resistance and design an anonymous algorithm that satisfies(r,k)-anonymous model GDPPR.aiming at the problem that similarity attacks will cause privacy leakage,this paper proposes(r,k)-anonymous model.it is required that the data table after anonymization meets the k-anonymity model,and the frequency of sensitive attributes with adjacent associations in the same equivalence class does not exceed the threshold .effectively reduces the probability of attackers obtaining similar sensitive information through similarity attacks and verifies(r,k)-the anonymous effect of the anonymous model through experiments.2.Research the availability of data in the GDPPR algorithm.the GDPPR algorithm completes the division of clusters by fuzzy clustering technology and then effectively improves data availability during the process of generalization to form equivalent classes.in view of the problem of unnecessary information loss caused by generalization of attributes,this paper improves fuzzy clustering technology based on(r,k)-anonymous model,and completes the clustering of data sets according to the improved fuzzy clustering.through experimental comparison,it is proved that the data set after clustering can effectively reduce information loss and improve data availability in the process of forming equivalence classes.In this paper,two standard data sets of the Adult data set and the Census-Income data set in the UCI machine learning library were selected to conduct the experimental comparison tests.the results prove that the GDPPR algorithm can well satisfy the(r,k)-anonymous model and can effectively resist similarity attacks.and it has better anonymity and data availability under the similar time cost as Mondrian algorithm.

Keywords/Search Tags:

privacy protection, data anonymity, proximity association, fuzzy clustering, data generalization

PDF Full Text Request

Related items

1	Research And Implementation Of K-anonymity Privacy Protection Algorithm Based On Local Generalization
2	Research On Privacy-preserving Data Publishing Algorithms Based On Different Anonymity Requests
3	An Privacy Protection Algorithm Based On K-Anonymity
4	Research On Privacy Protection Algorithm Based On K-anonymity
5	Research On Rational Privacy Protection Model And Algorithm Based On Association Rule
6	Research On Anonymization Privacy Protection Techniques Based On Clustering
7	Research On Privacy Protection Technology Of Data Publishing Based On Anonymization
8	Research On Anonymity Models And Algorithms For Privacy-Preservation Data Publishing
9	Research On Multi-Domain Data Privacy Protection Technology For Cloud Platform
10	Research And Application On Privacy Protection For Data Mining