Font Size: a A A

Research On Text Categorization Algorithm For Science And Technology Text Based On Subject Conceptual Tree

Posted on:2007-07-19Degree:MasterType:Thesis
Country:ChinaCandidate:H Z ZhangFull Text:PDF
GTID:2178360182482710Subject:Computer applications
Abstract/Summary:PDF Full Text Request
As a powerful tool of organizing and managing data, text categorization has been a key technology of great practical worth. This is because text categorization can solve the problem of information disorder to great content.The research situation about text categorization in the world is presented first in this paper. Key technologies and common methods of text categorization at present are introduced. Then a new knowledge expression method, conceptual network theory, is proposed. A text categorization algorithm for science and technology text based on subject conceptual tree is put forward in this paper. The text is categorized according to the association degree between the concept nodes of the subject conceptual tree and the unknown text. The semantic information of the unknown text is taken into account in the calculation of the association degree, and the algorithm of association degree calculation is proposed.A text categorization system is built using the algorithm. The validity of the algorithm is tested through experiment.
Keywords/Search Tags:text categorization, subject conceptual tree, categorization algorithm
PDF Full Text Request
Related items