The Research Of Web Information Search Technology Based On Meta-search

Posted on:2013-03-20

Degree:Master

Type:Thesis

Country:China

Candidate:C L Zhang

Full Text:PDF

GTID:2248330371485196

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

With the development and popularization of the Internet, the contents of internetinformation are increasing not only contains text, also includes image, audio, video and otherinformation style. Therefore, in information retrieval, it has become a hot research topic thathow to select and sort the information needed in a fast and accurate way.In the field of artificial intelligence, data mining technology is also known as knowledgediscovery. This is a way that we can find the same rules and display the rules through theanalysis of the massive amounts of existing data. Web search technology is a development ofdata mining technology on the Internet research field.The first way of Search Engine is artificial collection method (for example: Yahoo).It isbased on expertsâ€™ arrangement. This method is manually collecting, filtering and classifyingthe information from the Internet, and collecting the arranged results into the Website.However, considering the high cost of artificial maintenance and the wide range of userknowledge structure, this method canâ€™t meet various needs of the users. Therefore, along withthe development of data mining techniques, automatic search engine comes out. This searchengine includes a network robot programs to relate all the data together and grabs thecrawling results in order to get the data index. It provides a platform for information retrieval,users can use keywords to do queries through the platform.Search engine can be divided into three categories, they are text search engine,directories-search engine and Meta-search engine. Meta-search is a further extension of theWeb Search Engine. Users can select several search engine based on keywords on a userinteraction platform for the relevant retrieval operation. The feature of Meta-search is that it can use other search engines independently, implement information fusion on different searchengines and satisfy the need of effective information reorganization. Compared withtraditional search engine, the operation of a Meta-search engine can provide more accurateinformation.In this article, we elaborate the current research status and main principles of the Webinformation extraction technology systematically. At the same time, the key steps of the Webinformation extraction technology are introduced. Focusing on the processes and the keytechnologies of search engines, we make a further research about Meta-search.The main work in this article is listed as follows:(1) We make an introduction to the research background of Web information extractiontechnology, as well as the classification and the process of Web information extractiontechnology.(2) We also make an introduction to the Web information extraction model, HTML andDOM document object.(3) Struts, Spring and Hibernate frameworks of SSH framework are introduced in thisarticle, besides, we analyze the Website structure information.(4) We summarize the background, classification and the key of technologies of SearchEngines. We also design and implement a Meta-search engine based on AJAX technology,HTML Parser technology and so on.(5) We make a comparison of the search engine results.(6) We test the search engine program.The research in this paper is based on the original search engine technology. It is thefoundation to achieve better Meta-search and to develop better network information retrievaltools.

Keywords/Search Tags:

Meta-search, JSP, HTML Parser

PDF Full Text Request

Related items

1	The Technology Of Web Information Extraction Based On HTML Parser
2	Design And Implementation Of Search Engine Based On Lucene And HTML Parser
3	Based Web Image Search Engine Spiders System Design And Realization
4	Application Research On Image Search Based On Lucene
5	Search Engine System Inside Web Site Based On Lucene And Heritrix
6	Research And Implementation Of Vertical Search Engine Based On Characters Of Webpage Structure
7	Oriented Research And Realization Of The Digital Tv Set-top Box Embedded Browser
8	Design And Implement Of The Embedded HTML Parser Based On Automaton
9	The Research And Realization Of The Key Technology In The Meta Search Engine
10	Research On General Frame Model Of Vertical Search System Based On Meta Search