Font Size: a A A

Data Mining-based Subhealth Analysis Of Chinese Software Programmers

Posted on:2019-01-19Degree:MasterType:Thesis
Country:ChinaCandidate:B D WangFull Text:PDF
GTID:2404330593450369Subject:Software engineering
Abstract/Summary:PDF Full Text Request
In recent years,with the continuous advancement of society and medical care,people have paid more attention to their own health problems.Subhealth,as the "nemesis" of modern human health,has received widespread attention.With the wide application of electronic devices,the programmer community is becoming larger and larger,and the health problems of the programmers that come with it are also getting more and more attention from people.However,as of now,China’s research on subhealth issues has not yet formed a unified and mature theoretical and practical research framework,research system,and research scale.As the causes of subhealth are relatively complex,and the analysis and research on subhealth issues are relatively limited,there has been little research on the subhealth issues of the programmer community.Based on data mining theory,this article analyzes and studies the problem of programmers’ subhealth.The main research work is as follows.First,the subhealth measurement scale was studied.The Chinese Subhealth Evaluation Scale(CSHES)was used to investigate the subhealth status of programmers.At the same time,by investigating daily habits of programmers and other aspects,new entries for the programmer’s basic situation and life and work habits have been added as data collection for the programmer’s basic situation and daily behavior.The questionnaire published in the form of a questionnaire on the Internet,invited the Chinese programmers to collect the scale data,provide data support for the pertinence of the scale and subsequent experimental analysis.Secondly,the logistic regression algorithm and decision tree algorithm are studied.The data is analyzed by the combination of logistic regression model and CART tree.Finally,based on the processing and analysis of the data collected by the programmer’s subhealth evaluation scale,a decision tree model and a logistic regression model were established to process the experimental data.In the experiment,a decision tree model was established through the decision tree algorithm,and 18 rules were derived based on each path from the root node to the leaf node in the decision tree model.Through the logistic regression analysis,a logistic regression model was established and summarized.There are five main aspects of the programmer’s subhealth status.At the same time,the experiment uses logistic regression algorithms to model and analyze the influencing factors in the five aspects summarized by the logistic regression model,and summarizes the factors that have important influence on it.The data analysis results show that the prevalence rate of Chinese programmers’ subhealth in 2017 is 79.2%.The subhealth status of the Chinese subhealth status mainly includes metabolic disorders,depression,stress,satisfaction and sexual life.The combination of overtime,exercise,work status,and age is the main reason why a programmer’s body develops from a healthy state to a sub-healthy state.Through investigation and research based on the decision tree algorithm,Chinese programmers can compare the decision tree model and its 18 rules according to their basic conditions and living habits,and understand their own basic conditions and living habits affecting their subhealth.Determine the probability that you are in a subhealth state.Through the investigation and research based on the logistic regression algorithm,Chinese programmers can clearly understand their current subhealth status based on their own physical feelings and comparing the logistic regression models.Through the logistic regression analysis of the five main aspects of programmers’ subhealth status,the study found that the five major aspects correspond to the main influencing factors,according to the habits that may lead to the corresponding symptoms of life and work habits,improve the level of health,develop health Lifestyle.
Keywords/Search Tags:Chinese programmers, Subhealth, Logistic regression, Decision tree
PDF Full Text Request
Related items