Study On The Application Of Statistical Learning Methods To Triple Classification Problems

Posted on:2024-02-29

Degree:Master

Type:Thesis

Country:China

Candidate:J W Yang

Full Text:PDF

GTID:2557306917991889

Subject:Statistics

Abstract/Summary:

PDF Full Text Request

With the development of artificial intelligence technology,statistical learning methods have been maturely applied in many fields.As the core field of statistical learning methods,classification models have become a hot research topic today.Existing classification models are usually designed for binary classification problems.With the continuous refinement of application areas,new challenges have been raised for conventional classification models.Existing classification methods often have difficulty achieving satisfactory performance when dealing with multiclass labels,time series features,and high-dimensional sparse data.Therefore,proposing a classification method that can effectively handle high-dimensional large-scale data and can be extended to multi-class scenarios has high practical value.This thesis combines statistical learning methods with deep learning methods to propose an effective ENet-LSTM/DNN combination model for solving classification problems.ENet/SCAD/MCP penalty logit models,machine learning models,and deep learning models are introduced as comparative objects.The performance of the ENet-LSTM/DNN model in stock price prediction and disease classification problems is evaluated using Kappa coefficient,PDI index,HUM index,and ROC curve.In the stock price prediction problem,this thesis obtained historical stock price data and financial news data for three companies,AAPL,MNST and BAC,from the Nasdaq index.Based on the TTR package and text sentiment analysis,32 technical indicators and 2 market sentiment indicators were calculated as predictive variables,with the rise,plateau,and decline of stock prices as response variables.Eight prediction methods were compared for their predictive performance,and the results showed that the ENet-LSTM model had the best predictive performance,with a predictive accuracy of up to 0.6350.The ENet-LSTM model combines the advantages of the ENet penalty method and the LSTM framework,which can eliminate multicollinearity among data and accurately capture effective information in time series data using gate mechanisms.In the disease classification problem,this thesis selected the infant genetic disease dataset from the Kaggle database as the research object.The physiological indicators of patients were taken as predictive variables,and the genetic disease types of patients were taken as response variables.The ENet-DNN model was established,and seven classification models were introduced for comparison.The results showed that the predictive accuracy of the ENet-DNN model was up to 0.702,which could effectively handle high-dimensional large-scale datasets and had better predictive performance than single ENet penalty logit model and DNN model.Therefore,based on statistical learning methods and deep learning methods,this thesis proposed ENet-LSTM/DNN classification methods that combine the advantages of ENet penalty method and deep learning models.These methods achieved good predictive performance in three classification problems in stock price prediction and disease field,and have high research value and practical significance.

Keywords/Search Tags:

Classification models, Deep learning, Statistical learning, Disease classification, Stock price prediction

PDF Full Text Request

Related items

1	Research On Learning Performance Prediction In Online Learning Environment
2	Research On Prediction Of Oil Price Fluctuation Based On Deep Learning Model
3	Research On The Analysis Method Of Teacher’s Classroom Question Based On Deep Learning
4	Research On Online Learner Behaviour Mining And Classification Model Based On Deep Learning
5	Research On Classification Of Gene Expression Data Based On Statistical Learning Algorithm
6	Primary School Classroom Based On SOLO Classification Theory Deep Learning Research
7	Research On Prediction Of Stock Price Based On Neural Network
8	Research On Housing Liquidity Based On Statistical Learning Methods
9	Research On The Development Of “Deep Learning” Course In Secondary Vocational School Based On Educational Objective Classification Theory
10	Research And Application Of Text Classification Method Based On Deep Learning