Research On Instance-based Federated Transfer Learning Method And Its Differential Privacy Protection

Posted on:2024-09-02

Degree:Master

Type:Thesis

Country:China

Candidate:C Y Xu

Full Text:PDF

GTID:2568307124486244

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Federated learning can solve the data silo problem of traditional machine learning,and the original data of each participant can be used to train machine learning models without leaving the local area.However,in reality,the dataset of each participant may have large differences in the instance space and feature space,which makes the training of the federated model difficult and causes the prediction accuracy of the federated model to decrease.In addition,the data in the training process of the federated model,such as gradients,original local models,etc.,may indirectly leak the privacy of participants,and privacy protection strategies need to be introduced to avoid the indirect leakage of privacy.To address the above issues,this paper has carried on the related research:(1)An Instance Similarity based Federated Transfer Learning method(IS-FTL)is proposed for the problem that the dataset of each participant may have large differences in the instance space and feature space,which makes the prediction accuracy of the federated model to decrease.The hash values of the participant instances are calculated using the BoundaryExpanding Locality-sensitive hashing algorithm to construct a hash table and a similarity matrix,which in turn mines the similarity between the participant instances.Participants increase the weight of local data based on instance similarity to achieve instance-based federated transfer learning.XGBoost gradient boosting tree model is used to implement the IS-FTL method.Experiments show that IS-FTL can improve the federated model prediction accuracy.(2)To address the shortcomings of the IS-FTL method in privacy protection,a Differential Privacy based Federated Transfer Learning(DPFTL)method is proposed in conjunction with the key of tree model privacy protection.To address the privacy leakage problem,differential privacy is used to protect the gradient information exchanged by each participant,the splitting process of split nodes,and the output values of leaf nodes.To address the problem that the sensitivity bounds of the utility function of split nodes and the output values of leaf nodes are difficult to determine when differential privacy is applied to the tree model,the method introduces a sensitivity calculation method based on the maximum gradient.A privacy budget allocation strategy EA-APA based on information entropy adjustment is proposed to improve and optimize the traditional privacy budget allocation strategy.Theoretical analysis shows that DP-FTL satisfies ε-differential privacy protection.Experiments show that the combination of DP-FTL and EA-APA can reduce the model prediction error and enhance the stability.

Keywords/Search Tags:

Federated Learning, Federated Transfer Learning, XGBoost Gradient Boosting Tree, Locality-sensitive hashing, Differential Privacy, Privacy Budget Allocation

PDF Full Text Request

Related items

1	Research On Gradient Boosting Decision Tree Federated Learning Algorithm For Privacy Protection
2	Research And Implementation Of Differential Privacy Data Protection Method In Federated Learning
3	Efficient Federated Learning Of Adaptive Communication Based On Differential Privacy
4	Research On Federated Learning Methods Based On Local Differential Privacy
5	Research On Optimization Of Federated Learning Algorithm Based On Differential Privacy
6	Evaluating Local Differential Privacy Under Membership Inference Attacks In Federated Learning
7	Research On Federal Learning Privacy Protection Method Based On Differential Privacy
8	Research On Attacks And Privacy-Preserving Methods For Federated Learning Based On Gradient Transformation
9	Differential Privacy-Based Horizontal Device-Cloud Federated Learning
10	Research On Key Technologies Of Privacy Protection In Federated Learning Based On Collaborative Training