Font Size: a A A

Research And Implementation Of Communication Administration Oriented E-Government System And Web Page Classification

Posted on:2010-06-28Degree:MasterType:Thesis
Country:ChinaCandidate:J WangFull Text:PDF
GTID:2178360275970018Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Communication administration is an important government function. In order to improve administrative level and make strategic decisions, it is necessary for administration section on communication to grasp the business distribution and operation status of commutation industry. Statistical analysis that is both timely and exact has become an important part in the business of communication administration. However, various operations result in complex statistic. As a result, E-Government system which also can automatically classify web pages is urgently required so as to provide convenience for statistical analysis.This thesis conducted requirement analysis for Communication Administration Oriented E-Government system, and then finished the designing and implementation for the system. The thesis also studied particularly on web page classification algorithm which can automatically classify those grabbed web pages. This thesis has finished the following researching tasks.1. System architecture and designing of communication administration oriented E-Government System. RDBMS, J2EE and directory service are used for the running environment of system. The E-Government system is based on B/S mode. System architecture includes portal, application layer, development supporting platform and data base. Web page classification module, as well as modules of authority control, approval control and online business, has been designed in the system.2. Kernel algorithm research of web page classification in communication administration oriented E-Government System. This paper presents CUCS(Combined UC and SVM), a new algorithm for web page classification. CUCS combines the advantages of UC(Unsupervised Clustering)and SVM(Support Vector Machine). The biggest characteristic of CUCS is'2-time pruning'. CUCS trained classifier by pruned training set rather than the initial training set, and then combined with UC. As a result, CUCS has made web page classification more exact with higher speed.3. System implementation and validation of communication administration oriented E-Government System. Java programming language is used to implement primary modules which include web page classification module, modules of authority control, approval control, license application business, backup business and annual exam business. The whole system running flow, which is from business handling to web page classification, has been validated in the thesis.
Keywords/Search Tags:E-Government System, Web Page Classification, Support Vector Machine, Web Mining, Clustering Algorithm
PDF Full Text Request
Related items