Font Size: a A A

The Research On Web-based Text Mining

Posted on:2005-04-05Degree:MasterType:Thesis
Country:ChinaCandidate:Y LiuFull Text:PDF
GTID:2168360125471041Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
It's a real challenge for us to make the Internet easier to use. The information in the Internet is in short of organization, and full of a mass of pages, and on the other side, people want to obtain the information quickly and accurately. With the flood of information on the Web, Web mining is a new research issue which draws great interest from many communities. Currently, there is no agreement about Web mining yet. It needs more discussion among scientists in order to define what it is exactly. Meanwhile, the development of Web mining system will promote its research in turn.This paper discusses the principle of Web mining, and focuses on Web text mining architecture and technique. The paper includes following contents: Firstly, discusses the principle of Web mining; Secondly, on the base of the study of the Web text mining technique, introduce an architecture and function of Web text mining system; Thirdly, discusses the design philosophy of data acquisition based on world wild web, and studies the preprocessing of the Web data; Fourthly, in order to apply the Genetic Algorithm to the theory put forward by us, analyses the Genetic Algorithm particularly; At last, on the base of the improvement of conventional Genetic Algorithm, we put forward a document feature extract algorithm. The result of experiment shows, the approach is feasible.
Keywords/Search Tags:Web Mining, Text Mining, Feature Extract, Genetic Algorithm
PDF Full Text Request
Related items