Font Size: a A A

Research Of Vertical Search Engine Based On XML

Posted on:2017-04-22Degree:MasterType:Thesis
Country:ChinaCandidate:H SunFull Text:PDF
GTID:2428330545455952Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the rapid development of computer and network technology in the world today.Internet has become the most efficient way to obtain information,Through the Internet to people from all walks of life,the Internet information sharing more and more people's attention.People in order to better and faster access to information on the Internet is also a search engine came into being.But now it is found that the general search engine query to the information is too much lack of targeted and accuracy,based on the development of a vertical search engine based on various professional fields.General search engines are based on the HTML Web format,HTML is mainly focused on display and the content of the processing is limited,so that greatly reduces the search engine query accuracy.With the launch of a new XML W3C extensible markup language,the query accuracy in a certain degree of improvement.XML's tags have both text information and structural information,so it can better show the meaning and content of the representative.According to this information,the search engine can be used to locate and search the target accurately,so that it can effectively narrow the search scope and improve the accuracy of query.Based on this,the paper presents the research of the vertical search engine based on XML document.Firstly,this paper introduces the development of search engine,and the principle of search engine,especially the common topic identification technology,Chinese word segmentation technology,web sort technology and information retrieval technology.This paper also analyzes the HTML technology and XML technology,and introduces the structure of the XML document.The research and analysis of the principle of the realization of the search engine based on XML.According to the principle of search engine and the technical characteristics of XML document,this paper designs a vertical search engine model based on XML documents,and realizes some modules.Focus on the implementation of the vertical search engine based on XML search engine of the crawler module,conversion module,XML parsing module,index module and query module.Finally,the performance index of the model is introduced,and the relevant experimental data are given,the index of the model is established,and the response time is analyzed.
Keywords/Search Tags:XML, vertical search engine, index, document structure, web crawler, word segmentation technology
PDF Full Text Request
Related items