Research On Financial Credit Forecasting Based On Ensemble Decision Tree

Posted on:2022-02-23

Degree:Master

Type:Thesis

Country:China

Candidate:C Qin

Full Text:PDF

GTID:2518306476978849

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Issuing loans to potential users is the core business of lending institutions around the world.The loan business brings considerable benefits to the organization,but also makes the company bear the huge risk of financial losses due to the default of the borrower.Therefore,the lending institution needs to comprehensively analyze the basic information and credit history submitted by the applicant to assess the possibility of repayment,and decide whether to approve the application.Although there are relatively mature prediction methods in this field,in the face of the inconsistency of credit records of various institutions,credit data has complex characteristics,a large number of missing values,and sparse data.Research on the prediction of defaults applied by borrowers Still very challenging.The ensemble decision tree model is a powerful regression and classification method,which has made outstanding achievements in data competitions for many times.It has attracted wide attention of researchers for its good explanatory ability in financial credit prediction.Appropriate hyper-parameter setting is the necessary basis for good performance of the model.In order to ensure the accuracy of the credit prediction model,we use the improved adaptive optimization algorithm to optimize the limit gradient lifting tree and establish the credit scoring model.In addition,the traditional prediction models mainly based on hard information data modeling usually ignore the importance of soft information in credit application to predict default.In order to supplement the limitations of hard information features in credit model,we propose to use multi-granularity text feature extractor to extract text information features from word level granularity and sentence level granularity,and combine soft and hard information to enrich the data support and prediction basis of credit model.In view of the above problems,this paper constructs an ensemble decision tree model based on improved hyper-parameter optimization to predict default.With more reasonable super parameter settings,the model obtains more accurate prediction results.In addition,the text constructs an integrated decision tree model for credit prediction based on multi-granularity text feature extraction combined with soft and hard information.Compared with the traditional prediction model,this model can use the characteristics of text soft information to improve the prediction performance of the model.The main contributions of this paper are:Firstly,based on the optimization algorithm of adaptive learning strategy,the hyper-parameters of the limit gradient boosting tree are optimized,and the credit scoring model is established to accurately determine the optimal setting of the super-parameters of the model.The model structure more matches the characteristics of financial credit data,the model makes more accurate prediction and judgment.Secondly,the proposed improved particle swarm optimization algorithm is based on adaptive subgroup division,and uses two different learning strategies to update different types of particles to enhance the diversity of particle populations.The improved optimization algorithm can average the information of local optimal particles to prevent premature convergence caused by local aggregation of particles.Thirdly,a new credit risk assessment model based on text feature extraction of multi-granularity analysis is proposed.The model can extract multi-granularity feature of credit text,extract text context semantics and word level semantics,and improve the richness of text features.The model uses the feature combination characteristics of decision tree to integrate soft and hard information to achieve the purpose of accurate prediction of credit data.

Keywords/Search Tags:

Credit prediction, integrated learning, Extreme gradient boosting tree, Adaptive partitioning, Parameter optimization

PDF Full Text Request

Related items

1	The Research Of Gradient Boosting-based Ensemble Scheme For Extreme Learning Machine
2	Research On XGBoost Decision Tree Optimization And Its Application
3	Intrusion Detection Model Optimization Scheme For Gradient Boosting Trees In Edge Computing
4	Research On The Trend Forecast Of Shanghai And Shenzhen 300 Index Based On EMD-XGBoost
5	Research On Anomaly Detection And Prediction Algorithm For Multidimensional Time Series Data Based On Deep Learning
6	Link Quality Estimation Method Based On Gradient Boosting Decision Tree
7	Recognition Of Radar Work Mode Based On Auto-encoder And Gradient Boosting Model
8	Research On Context-aware Recommendation Method Based On Gradient Boosting Decision Tree
9	Research On Fair Privacy Gradient Boosting Decision Tree System Based On Trusted Execution Environment
10	Reordering Of Question Answering System Based On Deep Learning And Gradient Lifting Tree Algorithm