Research Of Automatic Web Page Categorization And Cluster Based On Web Mining Technology

Posted on:2005-05-29

Degree:Master

Type:Thesis

Country:China

Candidate:Z L Xie

Full Text:PDF

GTID:2168360122987408

Subject:Computer applications

Abstract/Summary:

Text classification and cluster are two important missions of information processing. Traditional algorithms of classification and cluster aim at pure text files, but with the development of Internet, half-struct web data become the main objects of information processing, and it makes evolution to the algorithms of classification and cluster.This paper focuses on how to achieve high precision of classification and cluster using web-mining technology compounded with existing technology. The stand of this paper is that the page's positon in the site topology shows the manager's viewpoint of content and class of the page and this information is very helpful to classification and cluster. We extract the hiberarchy class infomation of pages through web content mining and web structure mining, and use this infomation to classify and cluster the pages.

Keywords/Search Tags:

Text Classification, Text Cluster, Web Mining, Anchor Text

Related items

1	Research On Text Classification Of Web Text Mining
2	The Research & Realization On The Key Techniques Of Text Mining
3	Text Emotional Classification Based On Text Mining
4	Research On Key Problems In Text Mining Based On Cloud Method
5	Research On News Classification And Clustering Based On Text Mining
6	Research And Implementation Of The Text Cluster Based On Text Similarity Caculation
7	Study Of The Multi-class Text Classification Based-on Svm
8	Study Of The Multi-Class Text Classification Based-On SVM
9	Research On Web Text Mining
10	Research On Several Models In Text Classification And Clustering