A Data-driven Financial Products Recommendation Model And Algorithm

Posted on:2021-05-01

Degree:Master

Type:Thesis

Country:China

Candidate:P Y Ye

Full Text:PDF

GTID:2428330647450913

Subject:Probability theory and mathematical statistics

Abstract/Summary:

PDF Full Text Request

This paper uses the data from a security company's telemarketing to optimize the recommendation model and algorithm,and studies how to recommend financial products to new customers more accurately.In order to reduce the impact of data loss on the performance of the model,we tried to solve the problem of data loss with three solutions,namely,univariate interpolation,multivariate interpolation and the introduction of new features.We demonstrated the rationality of the above three methods through experiments.In addition,we experimentally demonstrated that the addition of additional socio-economic indicators would significantly improve the accuracy of the model.We examine five classical machine learning models: logistic regression,random forest,support vector machines,neural networks,stacking ensemble algorithm.What's more,we evaluate the model from different perspectives.The advantage of logistic regression and random forest is that the fitting model is easy to understand and also provides good prediction in classification.Comparing with the previous two classical statistical models,support vector machines and neural networks are more flexible,while showing the ability to learn from linear mapping to complex nonlinear mapping.Because of this flexibility,support vector machines and neural networks can often provide accurate predictions.But they are difficult to understand intuitively.Stacking ensemble algorithm fills the gap.Stacking,however,is slow because it involves a two-tier learning process,which means four complex models need to be.Interestingly,the theoretically optimal neural network classification is just of average performance,which fully demonstrates the difference between theoretical model and empirical research.The most profound lesson of this paper is: Data drives production.When we make a decision,we cannot brainstorm a model,rather we should choose the right model according to the data,hence we can make a reasonable and appropriate decision.On one hand,we should consider whether the original data supports the model we build.On the other hand,we need to evaluate whether the output results matches the decision maker's requirement.

Keywords/Search Tags:

classification problem, evaluation indicators, machine learning, discrete choice model, data-driven

PDF Full Text Request

Related items

1	Linguistic indicators for language understanding: Using machine learning methods to combine corpus-based indicators for aspectual classification of clauses
2	Research On Model Selection For Machine Learning
3	Research On Human-machine Collaboration Method Driven By Data And Model
4	Analysis Of Individual Credit Evaluation Indicators Based On Random Forsets
5	Research On Correlation Coefficient Of User Portrait Acquisition Based On Machine Learning
6	Study On The Incremental Learning Algorithms For Support Vector Machines
7	Research On Data-driven Data Assimilation Methods Incorporating Machine Learning Algorithms
8	Reinforcement Learning-based Data Driven Control For Nonlinear Discrete-time Systems
9	Data Driven Modelling And Model Free Adaptive Control With The Applications To Complex Industrial Systems
10	Classification Of Abnormal Data Flow Based On Machine Learning