Font Size: a A A

The Research Of Search Engine Technology Based On Web Mining

Posted on:2006-05-03Degree:MasterType:Thesis
Country:ChinaCandidate:X R HuFull Text:PDF
GTID:2178360182966996Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The fast-growing Internet is the largest information depository in the world and plays an important role in the information spreading. The World Wide Web (WWW) develops quickly. Since Internet is an open system, which has no unified management and structure, it is difficult to search information from the Internet. How to get useful information from the large depository quickly and correctly is a difficult problem people faced. Everyone hopes there is a new tool to help them retrieve the information more easily.The technology searching information on the internet combines the technology of modern information retrieval and the WWW technology. It aims developing an intelligent search engine, which finds information on the Internet automatically, indexes a structure Database and serves the Internet user.As a result, search engine's further development should be accelerated by the assistance of various newly and efficient technique. As the newest research knowledge mining direction, Web Mining is the high level information processing and it has many affinities with the search engine. So it often used for reference by Search Engine technique. The application of Web Mining will benefit the progress of search engine, the information processing capability of Search Engine is enhanced, it makes web information retrieval grow into a new high level.In this thesis, we particularly focus on the analysis and discussion about WWW Search Engine from development and research angle. And we introduce a medium-sized enterprise oriented intelligent search engine prototype—WMSE which based on Web Mining. In the point of view of researching and development, it optimizes and sorts the pre-return search result base on the theory of web structure mining. It can provide more accurate results for the user, and satisfy user's requirement much better.In the following chapter, we describe the subsystems of Search Engine according to its architecture. They are Crawling subsystem, Indexing subsystem, Searching subsystem, User interface subsystem. In the meantime, special emphases are placed on the related key algorithms and technologies used in Search Engine system.In the end, we briefly introduce the performance of the system and conclude with some future works for the system.
Keywords/Search Tags:Web Mining, Search Engine, Information Retrieval, Pagerank
PDF Full Text Request
Related items