Data Mining Applications In The Census Data | | Posted on:2005-04-16 | Degree:Master | Type:Thesis | | Country:China | Candidate:S Bin | Full Text:PDF | | GTID:2208360122497927 | Subject:Computer software and theory | | Abstract/Summary: | PDF Full Text Request | | With the development of Database technology and the comprehensive application of Database Management system, a greate amount of data is accumulated in various areas and there is usually much information behind the accumulated data. How to detect useful knowledge and mine the potential value from the data in time is an important research subject in the field of Information Technology.This research subject gives birth to thetechnology of data mining.Data mining, means a process of nontrivial extraction of implicit, previously unknown and potentially useful information from data in databases or datawarehouses. It involves such subject areas as Database, Artificial intelligence, Machine learning and Statistics.Census is a kind of scientific method that almost every country in the world uses to collect population data. It is the most primary source to offer the countrywide basic population data. Census is an important investigate of the situation of a country and the national power. It can be used to plan people's substance life and civilization life as a whole. It also can afford trustiness gists to realize the continuable development of population, economy and resource entironment. Most of the national policies are constituted straight based on the population status.Data mining is a subject to extract the pattern and the relation in the data. It can find the orderliness that hide behind the volume data to offer corresponding information for making manage decisions. It has very important meanings to use data mining in census, both in science and market.In this paper, we first research the concept hierarchy and the classification; then design a system of census data analysis based on these data mining techniques; finally,we use the census data analysis system to analyze the data in chengyang and laixi, and evaluate the results.First, we described the background of research and pointed out its significance. The domestic and foreign situation of data mining research was analyzed from theoretical and applying aspects. After analyzing the general progress of knowledge discovery we gave a classic framework of a data mining system, analyzed main function of every module and expatiate on the technique of data mining.Second, we expatiate the important significance of using data mining in census data, intruduce the concept hierarchy and the classification: reviewing their investigationactuality, recommending the interrelated algorithms, such as the dynamic concept hierarchy adjustment algorithm, CART algorithms and PUBLIC algorithms.Third, we build up the census data analysis system, realize the interrelated algorithms, and use this system to analyze the fifths census data in chengyang and laixi, and evaluate the results.Finally, all the results are summarized, and the study prospect is discussed. | | Keywords/Search Tags: | Data Mining, Census, Concept Hierarchy, Classification, Decision Tree, CART, PUBLIC, Gini index, MDL principle | PDF Full Text Request | Related items |
| |
|