Font Size: a A A

The Research Of Web Information Search Technology Based On Meta-search

Posted on:2013-03-20Degree:MasterType:Thesis
Country:ChinaCandidate:C L ZhangFull Text:PDF
GTID:2248330371485196Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the development and popularization of the Internet, the contents of internetinformation are increasing not only contains text, also includes image, audio, video and otherinformation style. Therefore, in information retrieval, it has become a hot research topic thathow to select and sort the information needed in a fast and accurate way.In the field of artificial intelligence, data mining technology is also known as knowledgediscovery. This is a way that we can find the same rules and display the rules through theanalysis of the massive amounts of existing data. Web search technology is a development ofdata mining technology on the Internet research field.The first way of Search Engine is artificial collection method (for example: Yahoo).It isbased on experts’ arrangement. This method is manually collecting, filtering and classifyingthe information from the Internet, and collecting the arranged results into the Website.However, considering the high cost of artificial maintenance and the wide range of userknowledge structure, this method can’t meet various needs of the users. Therefore, along withthe development of data mining techniques, automatic search engine comes out. This searchengine includes a network robot programs to relate all the data together and grabs thecrawling results in order to get the data index. It provides a platform for information retrieval,users can use keywords to do queries through the platform.Search engine can be divided into three categories, they are text search engine,directories-search engine and Meta-search engine. Meta-search is a further extension of theWeb Search Engine. Users can select several search engine based on keywords on a userinteraction platform for the relevant retrieval operation. The feature of Meta-search is that it can use other search engines independently, implement information fusion on different searchengines and satisfy the need of effective information reorganization. Compared withtraditional search engine, the operation of a Meta-search engine can provide more accurateinformation.In this article, we elaborate the current research status and main principles of the Webinformation extraction technology systematically. At the same time, the key steps of the Webinformation extraction technology are introduced. Focusing on the processes and the keytechnologies of search engines, we make a further research about Meta-search.The main work in this article is listed as follows:(1) We make an introduction to the research background of Web information extractiontechnology, as well as the classification and the process of Web information extractiontechnology.(2) We also make an introduction to the Web information extraction model, HTML andDOM document object.(3) Struts, Spring and Hibernate frameworks of SSH framework are introduced in thisarticle, besides, we analyze the Website structure information.(4) We summarize the background, classification and the key of technologies of SearchEngines. We also design and implement a Meta-search engine based on AJAX technology,HTML Parser technology and so on.(5) We make a comparison of the search engine results.(6) We test the search engine program.The research in this paper is based on the original search engine technology. It is thefoundation to achieve better Meta-search and to develop better network information retrievaltools.
Keywords/Search Tags:Meta-search, JSP, HTML Parser
PDF Full Text Request
Related items