Font Size: a A A

Design And Implementation Of General Data Mining System Platform

Posted on:2012-12-06Degree:MasterType:Thesis
Country:ChinaCandidate:W Z GuoFull Text:PDF
GTID:2178330335477731Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Data mining is the process of abstracting unaware, potential and useful information and knowledge from plentiful, incomplete, noisy, fuzzy and stochastic data. With the rapid development of Computer Network and Database Techniques especially the wide application of Database Management System, large-scale data is produced, which is in an information explosion. However, people are so difficult to understand and use previously unknown, comprehensible, and actionable information from large data sets that people face the awkward dilemma. All kinds of data mining system tolls are the powerful tool for people to laugh off this embarrassing situation, because data mining system tool can help people draw valuable knowledge and information intelligently and automatically from large data sets. This paper mainly focuses on the research on Second Data Mining System Platform which is applicable to multiple domains, is on Data Base and includes many data mining algorithms. So far, the research items are as follows:(1) Starting with introducing data mining theory and data mining tool, this article has analyzed in a step-by-step the main function and characteristics of this data mining system platform. This lays the theoretic foundation for design and implementation of Data Mining System Platform.(2) This paper discussed that Data Preprocessing is essential for a General Data Mining System Platform. In this article the importance, content and methods of Data Preprocessing are discussed in detail.(3) Classification Algorithm,Clustering Algorithm,Association analysis,Linear regression analysis are discussed. This article finished some data mining algorithms:ID3, C4.5, Naive Bayesian categorization, shortest distance method, longest distance method, DBSCAN, K-means, K-modes, Apriori, Linear regression analysis and so on. This paper discusses indetail on the basic idea and achievement process of ID3, Naive Bayesian categorization, Apriori, Linear regression analysis and so on.(4) According to the research on the system frame, the system function is divided to three functional modules which include data processing, data mining engine, visualization.
Keywords/Search Tags:Data Mining, Classification, Clustering, Association, Linear regression analysis
PDF Full Text Request
Related items