Font Size: a A A

Cloud Computing Research And Application Of Filtering Spam Messages Based On Bayesian Classification

Posted on:2011-02-03Degree:MasterType:Thesis
Country:ChinaCandidate:J ZhuFull Text:PDF
GTID:2208360308467175Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
The short-message has become an important communication tool gradaully for its mobility, convenience and low price. The amount of short-message is increasing geometrically driven by the increasing user of mobile phone. The problem of junk message has become more severe. The flood of junk message has not only greatly disturbed people`s life and also endangered public security and social stability. Therefore, the research of accurate and intelligent filter of junk message is of great significance. The research of existing filtration methods indicates that their implement has some shortcoming.The filtration methods based on black and white list are too simple and brutal. Although, the accuracy of content-based filtration has been improved greatly, their complexity of algorithm usually is cause of operator service network jam.The main problem of content-based filtation is lack of computing power rather the methods incorret. To overcome the shortcomings of content-based filtration, the essay studied cloud computing in detail which has a rapid development in last two years. The research indicates that the cloud computing technology has a great advantage in scalability, reliability, cost and other aspects. In particular, the scale of computing power can be made of infinite size in low cost relied on its high-expansion of scale. So the cloud computing is a good platform.Based on this foundation, the essay conducted a careful analysis of algorithm principle of content-based filtation and found that almost all the algorithm of content-based filtation currently used is based on Bayes classification theory. After a detailed study and relevant experiment, found that the content-based filter can be implemented by relying on the cloud computing platform and MapReduce programming model.The main work includes:(1) Analysis the problem of the existing junk message filtering technology and the implement. (2) Analysis the applications of cloud computing technology. Hadoop open-source implementations of the MapReduce programming model has been studied in-depth.(3) The essay conducted a careful analysis of principle of Bayes classification theory and its algorithm. The junk message filter based on Bayes classification theory and cloud computing is introduced in this essay.(4) The junk message filter based on Bayes classification theory and cloud computing has been implemented on the Hadoop cloud computing platform.
Keywords/Search Tags:Junk Message, Cloud Computing, Bayes Classification, MapReduce
PDF Full Text Request
Related items