Font Size: a A A

Basic Theory Of Chinese Text Mining And Its Applications

Posted on:2015-10-25Degree:MasterType:Thesis
Country:ChinaCandidate:D KangFull Text:PDF
GTID:2285330428499642Subject:Applied statistics
Abstract/Summary:PDF Full Text Request
Text mining has an extensive application prospect. Compared with Western Linguistictext, Chinese text has its own uniqueness. Given that reason, this paper selectsChinese text as its study object. First of all, the background and development of textmining are reviewed, and then presents its conception, procedures, and especially thefeature extraction, dimensionality reduction and Classification Algorithm. Packages ofChinese text mining in R and CHQ’S text multi-classified system are introduced, andapply this system to classify documents.This paper focuses on using widely used open source tools to build your ownChinese text mining system. According to the flow of Chinese text mining, itdescribes the exploration of the system. To segment words with the LTP system,transform unstructured textual data to the structured data using StringTOWordVectorof Weka, finally with LibSVM train Chinese text classification model and use it topredict.
Keywords/Search Tags:text mining, text classification, Chinese text
PDF Full Text Request
Related items