Font Size: a A A

Implementation Of News Classification System Based On The Naive Bayesian

Posted on:2013-03-08Degree:MasterType:Thesis
Country:ChinaCandidate:Y Y ZhouFull Text:PDF
GTID:2248330362965473Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The explosive growth of information in the past few decades has generated ademand for technologies to collect and archive vast amount of news in an efficientmanner, because it would be practically impossible to do so in a manual manner dueto the concerns about manpower, money and time needed.This dissertation presents a brief survey on news classification techniques. Thepros and cons of Bayes classification algorithm has been discussed, before theincremental learning algorithm based on Na ve Bayes classification is studiedtogether with the CHI method for feature extraction. A new approach to choose textapproach based on incremental learning is then presented, based on which aprototype system is developed in Java. This system covers the whole range of newsclassification, such as text preprocessing, feature extraction, incremental learning,classifier and performance appraisal. Its performance has been tested with raw datafrom Nanfang Daily news archive.
Keywords/Search Tags:News Classification, Naive Bayesian, Feature Selection, IncrementalLearning, Classification Model
PDF Full Text Request
Related items