Based On Gibbs Sampling And XGBoost Personal Loan Default Prediction Research

Posted on:2023-07-20

Degree:Master

Type:Thesis

Country:China

Candidate:G H Zhang

Full Text:PDF

GTID:2558306623491044

Subject:Applied statistics

Abstract/Summary:

PDF Full Text Request

At present,the scientific and accurate approval of personal credit is the main focus of major banks and related financial institutions,which is related to the final loan recovery.Therefore,major financial lending institutions must also be required to obtain a reliable indicator system on personal loan defaults,and establish a scientific and accurate loan risk prediction model that can mine users’ potential information and automatically identify users’ lending behavior.The classic and reliable loan default prediction model is a classification model based on XGBoost.The feature screening method of this model generally uses feature engineering and IV value(the predictive ability of feature variables)to screen feature indicators,but the calculation method of the IV value itself is linear.The calculation method,and XGBoost is a nonlinear model,the variables obtained by screening based on the IV value are not in line with the XGBoost model.In this paper,the Gibbs Sampling method under the MCMC framework will be used to screen and extract the features that affect the personal loan situation,and XGBoost will be used as a screening tool to randomly search and extract the associated feature factors that affect personal loans.The variables obtained by screening are more in line with expectations.Construction of the XGBoost model.Compared with the traditional feature screening method,the similarities and differences between the two feature systems are analyzed,and the classic XGBoost model is built based on the two index systems.The performance metrics outperform XGBoost based on IV value screening features.And during the construction process,it was found that the machine learning model is not interpretable,so the SHAP interpretation method is added to explain in detail how each indicator affects the classification and prediction results of the XGBoost model.

Keywords/Search Tags:

machine learning, personal loan, XGBoost, Gibbs Sampling, SHAP

PDF Full Text Request

Related items

1	Research On Personal Credit Default Prediction Based On XGBoost+RF
2	The Study On Credit Rating Of Personal Credit Loan In Bank A Based On Machine Learning
3	Evaluation Of The Effect Of The Personal Loan Default Model After Adding The Characteristics Of The Guarantee Network
4	An Empirical Analysis Of P2P Loan Default Prediction Models
5	Research And Application Of Loan Default Prediction Model Based On XGBoost-Stacking Ensemble Learning
6	A Study On The Application And Evaluation Of Machine Learning Algorithms In Personal Loan Default Prediction
7	Analysis For Cancelled Orders Of Online Car-hailing With Three Gradient Boosting Algorithms And SHAP Value
8	XGBoost-based Online Loan Risk Prediction
9	The Prediction And Analysis Of User's Loan Risk Based On CNN And XGBoost
10	Application Of Data Mining In Personal Credit Risk Identification Of P2P Online Loan