Font Size: a A A

Study Of Web Text Mining Based On Rough Set Theory

Posted on:2010-07-21Degree:MasterType:Thesis
Country:ChinaCandidate:X L ZhaoFull Text:PDF
GTID:2178360275499536Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet technology, information processing has become an indispensable tool for people to obtain useful information. Text categorization is an important research field, whose target is to allocate one or more suitable classes to texts, based on analyzing the text contents.My thesis research web text classification based on Rough Set theory., Firstly, introducing primary knowledge of web text mining, detailed process of web text mining, emphasizing introducing text characteristic terms and characteristic discretized technology. Secondly, introducing primary knowledge of Rough Set and application of found knowledge in detail, researching the rules of text categorization are extracted by knowledge reduction of Rough Set and discussing feasibility and realization of the rules. Finally, utilizing imitating test, validate that the text categorization is feasible based Rough Sets theory.
Keywords/Search Tags:Text categorization, Characteristic choice, Rough Set, Attributes reduction, Classification rule
PDF Full Text Request
Related items