Research On Counterfactual Explanation Generation Method With Minimal Feature Boundary

Posted on:2024-06-27

Degree:Master

Type:Thesis

Country:China

Candidate:S N Niu

Full Text:PDF

GTID:2568307151467564

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The complex and opaque decision-making process of machine learning models limits the interpretability of predictions,resulting in the inability to effectively mine results beyond the empirical.Counterfactual explanation,which can track the causal mechanisms behind data,is a hot topic in the field of interpretable machine learning.However,existing methods lead to lower prediction accuracy by ignoring the interference of irrelevant and redundant features in the instances,or lead to false causal explanations due to the inability to accurately identify the key causes affecting the target variables,resulting in unstable algorithms and large reversal costs.By injecting counterfactual explanations into the prediction model and automatically adjusting feature values to generate counterfactual instances within a minimal boundary to achieve prediction inversion,the causal relationships between target variable and features can be explained more accurately.The main elements of this study are as follows.First,considering that the presence of irrelevant or redundant features in data instances can reduce the prediction accuracy and lead to false causal explanations,the min FB algorithm for mining the minimal feature boundary（MFB）is proposed.The min FB algorithm mines the Markov boundary of the target variable using conditional independence tests,and then introduces additive factors to supplement the causal features that may be missed to obtain the MFB with irrelevant or redundant features removed.Second,the counterfactual explanation generation algorithm CEG_MFBbased on the minimal feature boundary（MFB）is proposed.The algorithm uses the MFB mined by min FB algorithm as the generation range of counterfactual instances,and automatically adjusts the feature values within the MFB to achieve prediction inversion with minimum cost.By limiting the counterfactual changes to the range of MFB,i.e.,the causal features of the target variable,invalid adjustments can be reduced,reducing counterfactual generation costs and false causal explanations.Third,the performance of the algorithm is verified by parametric analysis and experimental comparison.The performance of the min FB algorithm and the effectiveness of MFB for counterfactual explanation were verified using 16 representative datasets from different domains.In addition,the performance of CEG_MFBis analyzed by comparing it experimentally with state-of-the-art counterfactual explanation generation algorithms in terms of evaluation metrics such as validity,proximity,sparsity,and distance.Finally,CEG_MFBand its comparison algorithm are applied to a real glioma classification scenario to find classification decision boundary using the obtained minimal feature boundary to explain the grading of gliomas using the generated counterfactual instances.The effectiveness of the algorithm is verified by analyzing the experimental results.

Keywords/Search Tags:

Counterfactual Explanation, Feature Boundary, Interpretability, Causal Discovery

PDF Full Text Request

Related items

1	Research On Local Interpretability Of Random Forest Based On Satisfiability Module Theory
2	Counterfactual-based Visual Contrastive Explanations For Deep Learning Networks
3	Research On Causal Discovery Method Based On Reinforcement Learning
4	Research On Counterfactual Interpretation Method Based On Visual Analysis
5	Research And Applications Of Causal Discovery Algorithms For Functional Causal Models
6	Feature Selection Algorithm And Its Research And Application In Causal Discovery
7	The Research On Causal Feature Selection Algorithm Based On AD-tree
8	Counterfactual Causal Inference Method And Application Based On Observational Data
9	Research On Several Kinds Of Algorithms For Supporting High-dimensional Causal Discovery
10	Methods Of Feature Selection And Local Causal Discovery For High-dimensional Data Based On Markov Blanket