Font Size: a A A

The Application Of Association Analysis In The View Of Hight Of Urban And Rural Residents Of Jilin Province

Posted on:2007-03-03Degree:MasterType:Thesis
Country:ChinaCandidate:S J YaoFull Text:PDF
GTID:2178360182996429Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Since the ancient times, the height is people issue of concern, and theview of each person is not all same to the height. Some people like high, somepeople then thought the short person is also good. Some people thought theheight and the work, selects a friend for marrage and so on to have therelations, some people then think otherwise. Each time, the people to theheight aspect viewpoint all are different. Then, present stage? How many ispeople's average height, how many is the expectation height also, height to usis whether influential?"The research of the height view in urban and rural areas in Jilin Province "iscarries on the investigation from each aspect regarding the height related item,thus obtains each kind of data, comes to have from the sociological angle tothe nowadays society's height view to understand specifically. But this articleduty is (1) establishes a database, and through establishes a friendly usercontact surface to realize in the height view each data input, maintains,browsing and inquiry. (2) uses the mathematical statistic the method to carryon each kind of statistical analysis to in the database data, for example countseach height sector the population proportion, the statistics different years ofschooling is the ideal body higher (3) uses in the data mining the connectionrule, discovers, the valuable rule latent in the height view attribute, thuscauses the researcher the in-depth to understand the people from all sorts ofrules to height each kind of understanding in order to thoroughly studies thisdomain.Data Mining is a course of excavating meaningful knowledge mode fromthe noise and incomplete database. The proposition of data mining methodsmake people have abilities to realize the sterling worth of the data. Theeffective data mining from large-scale database has put forward a largenumber of demands and enormous challenges to researchers and developers.Association rules is the main data mining methods, which reflects theinterdependence and relation among one thing and other things. It means therules that the support and confidence accord with the given value in data sets.The classical algorithm of association rules is Apriori, which is name bythe priori knowledge about the characteristic of frequent itemsets. Afundamental property that contains in the algorithm of Apriori is that anysubsets of a frequent itemsets should be frequent. In virtue of certainprofessional knowledge, association rules can be directly used to analyse thedata causalities, to study further and to make predictions of rules. Findingrelated relations from a large number of data is extremely useful in such fieldsas the market orientation, the decision analysis and the commercialmanagement, etc..Third chapter introduced this project mainly is the application datamining technology carries on the analysis to the correlation data, by isconnected the rule theory constitution data model with by the height viewexamination table the database which establishes for the foundation, theconstitution also has the contact surface the software system. This systemcontains the module has the data feeds, the data maintenance, the databrowsing, the data inquiry, the statistical analysis and is connected the ruleapplication. Connection rule application module is this item design key point,the difficulty, also is this article main research work. The connection ruleapplication module faces is "the Jilin Province city and countryside inhabitantheight view research" researcher. Through the utilization connection ruleprinciple analysis height view examination table in each data, searchescorrelation factor and the latency, provides the data support for the researcher.Chapter 4. This text is proposed a new method to analyse the data innerlink of view questionnaire of the height. It disposes the traffic accident data,distills and analyses the all respects laws by association rules in data mining.In allusion to the fact, this text improves the classical single-dimensionaland single-layer algorithm of Apriori and utilizes the new multi-dimensionaland mutli-type algorithm of Apriori based on association rules. It organizes alarge number of complicated and disordered traffic accident data to beinformation, analyses various kinds of complicated relevant factors. Hereprovides the definition of property of the road traffic accidents (PRTA) andadopts star-type the full -connection data model to build. The first algorithmstep is to find out frequent itemsets that accords with the minimum support.The improvement of the algorithm is through improving the production wayin candidate itemsets, which is named the function of gen_candidate(). Thesecond step is to utilize frequent itemsets that produced above to get expectedrules. At the same time, It utilizes the important nature, which is "The supportof any data itemsets is always less than or equal that of subsets." That canreduce the range of search, so it improves the efficiency of the algorithm.In succession, this text adopts the research approach that consists withthe theoretical research, the algorithm experiment and the practical applicationclosely. system disposes the systematic environment to develop, and utilizesimproved multi-dimensional Apriori algorithm. System utilizes the 2000groups of data which are provids by the database of the Jilin province city andcountryside inhabitant height view examination table, to test. It draws andanalyses association rules among the datas, which produces a large amount ofstrong association rules that accord with support and confidence to support theviews of person which research on the view of height.Problems that can be further investigated in this text:The method that is used to draw and analyse association rules of the dataof the view of height of urban and rural residents of Jilin province in this text,which represents the use value of the drawing technology of association rules.To draw the multi-dimension association rules used in the decision andanalysis part, which can be further optimized in order to improve the speedand efficiency of operation.
Keywords/Search Tags:Application
PDF Full Text Request
Related items