Font Size: a A A

Research Of A Heuristic Bayesian Classification Algorithm And Its Application In Railway Freight Customer Segmentation

Posted on:2009-06-30Degree:MasterType:Thesis
Country:ChinaCandidate:Y S GuoFull Text:PDF
GTID:2178360242989461Subject:Systems analysis and integration
Abstract/Summary:PDF Full Text Request
Data mining is a technology which can discover underlying rules and extract useful knowledge. In recent years, data mining has attracted widely attention and became one of hotspots in the research of information system and computer science.As a classification algorithm in data mining, Bayesian network is a graphical model which can express the probabilities between the variables. It is one of the most effective models in the field of uncertain knowledge and is playing an important role in design and analysis aspects of machine learning algorithms.The research environment of Bayesian network is roundly introduced in this paper, furthermore, the theoretical foundation of Bayesian classifier and three classical Bayesian classification algorithms are typically analyzed, which are naive Bayes classifier, Bayesian network classifier and TAN classifier. Based on this, a heuristic Bayesian classification algorithm is proposed which combines the merits of K2 algorithm and TAN classifier, and gets rid of their defects. Edge order can be fixed in the procedure of constructing maximum weighted spanning tree in TAN, and then nodes order can be fixed according to certain rules, at last K2 algorithm is used to construct Bayesian network. The experiment result shows that network structure of this algorithm is more reasonable and it has higher classification precision.Based on the broadly application of data mining in CRM, a scheme of railway freight customer segmentation is also proposed in this paper, that is, using the clustering and classification of data mining to mine the information hided in the mass data of railway waybill database. First, the historical freight data is analyzed with clustering method, and then the new customer can be classified with Bayesian classifier according to the previous result. This customer segmentation method could support the marketing department's decision-making and improve the CRM level of railway enterprise.In addition, based on the in-depth research on Bayesian classification, Bayesian algorithms software is developed in need of railway freight customer segmentation, and as a universal data mining platform, it could be applied in relative fields.
Keywords/Search Tags:Data Mining, Bayesian Network, Heuristic Information, Railway Freight, Customer Segmentation
PDF Full Text Request
Related items