Font Size: a A A

Text Information Filtering Based On Reconstruction Of The Genetic Algorithm Mutation Operator

Posted on:2015-08-09Degree:MasterType:Thesis
Country:ChinaCandidate:W TangFull Text:PDF
GTID:2298330431482504Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet technology, networkinformation shows geometric multiplication trend. A large amount ofinformation brings convenience to people’s lives. At the same time, butalso led to a series of questions: It is difficult to screen and filter the spammessages and bad information. Information filtering technology(Information Filtering, abbreviated IF) which can shield the useless orunhealthy information and help customer to receive information quicklyand accurately is coming. Thus it can improve the efficiency and accuracyof the information search.On the basis of the proposed purity Gini index, the text studies thepreprocessing algorithm and proposes reconstruction mutation operator ofgenetic algorithm.. Combined the application of the purity Gini index inthe text message preprocessing, applying the genetic algorithm of thereconstruction of mutation operator in the text message filtering toimprove the accuracy of template user category in the text informationfiltering. The main achievements as follows:1. Proposed text preprocessing algorithm based on the purity of theGini indexThe preparatory work of a text message filter is preprocessing textmessage, the key is the text feature selection, the purpose of featureselection is to select the most representative of the document feature wordas a dimension of feature space, thus improving classifier accuracy. Baseon the disadvantages of the traditional Gini index, paper improvedtraditional Gini index on the text information pretreatment, and applied totext feature selection, reducing the space dimension of the original text,reducing the time complexity and improve the accuracy of the classifier.2. Proposing the reconstruction algorithm of Genetic mutationoperator and applying in text information filteringGenetic algorithm of reconstruction mutation operator balance thestatus of crossover and mutation operator, and then promoted the user’stemplates. The results of the comparative experiment in four categoriesfiltering precision show:indicating the genetic algorithm of reconstruction mutation operator can be well applied to the text messagefiltering.Finally, the article designed and implemented the Internet filteringsystem which is based on reconstruction mutation operate of geneticalgorithm, which can find the desired information accurately and quicklyin the massive information,which improved the accuracy and efficiencyof the Internet information filtering.
Keywords/Search Tags:information filtering, Purity Gini index, text preprocessing, reconstruction of the genetic algorithm mutation operator
PDF Full Text Request
Related items