Font Size: a A A

Information Filtering System Based On Data Mining Design And Implementation

Posted on:2007-08-05Degree:MasterType:Thesis
Country:ChinaCandidate:B Y SongFull Text:PDF
GTID:2208360182497258Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Internet has developed rapidly during the past years and it becomes the largestinformation net around the world. But when people share the convenience brought byInternet ,the harmful information and data comes along with the ones beneficial. Sothe question that how we can get the prime part from the abundant database comes tous and it's becoming one of the important domain in the net-tech research. The main task of the Info-Filtering system is to purify the data packet from theweb sites in the Internet. And most of the Web sites organize their data by thesemi-structured Html page. So this article focuses on the approaches to check the webdata from Internet. It contains the points below: 1.The introduction to the current technique of information filtering. First, it expound the development of information filtering and the pivotaltechnique. The article also lists some defects of the current information filteringsystem includes the bad veracity ,the low rate ,the bad agility and so on. 2. Bringing the new scheme forward. one of the most important task of the information filtering system is to establish acorrect, reliable and exact warehouse which contains the samples. This articlemakes a scheme which lead the system to renew itself and organize the samplesautomatically. So the system's working velocity grows faster. 3. Because the final purpose of the information filtering system is to decidewhether the data packet is good or bad so that to hold it up or not. This article bringsout a scheme to do this . It's detached into two parts includes ascertaining its topic andquality. To complete the first one ,we use Bayes Technique;to accomplish the secondone ,we use KNN model. 4. Establing the framework of the information system based on Data mining. Based on data mining and the technique of dealing with the data packet fromInternet, the article designed a framework of the information system which contains ahiberarchy and multi strategy. This article carries out a design to implement the framework above through sometechnique according to the Transmitting Layer and Application Layer. The experiments show us that the system is capable of filtering the data packetwhich has been received by the host machine. It can complete the task correctly andreliably.
Keywords/Search Tags:Information Filtering, Data Mining, WinsockSPI, Clustering Analysis
PDF Full Text Request
Related items