Font Size: a A A

Research Of Automatic Web Page Categorization And Cluster Based On Web Mining Technology

Posted on:2005-05-29Degree:MasterType:Thesis
Country:ChinaCandidate:Z L XieFull Text:PDF
GTID:2168360122987408Subject:Computer applications
Abstract/Summary:PDF Full Text Request
Text classification and cluster are two important missions of information processing. Traditional algorithms of classification and cluster aim at pure text files, but with the development of Internet, half-struct web data become the main objects of information processing, and it makes evolution to the algorithms of classification and cluster.This paper focuses on how to achieve high precision of classification and cluster using web-mining technology compounded with existing technology. The stand of this paper is that the page's positon in the site topology shows the manager's viewpoint of content and class of the page and this information is very helpful to classification and cluster. We extract the hiberarchy class infomation of pages through web content mining and web structure mining, and use this infomation to classify and cluster the pages.
Keywords/Search Tags:Text Classification, Text Cluster, Web Mining, Anchor Text
PDF Full Text Request
Related items