Font Size: a A A

Research On Topic Search Engine Based On Shark Optimization Algorithm

Posted on:2019-07-15Degree:MasterType:Thesis
Country:ChinaCandidate:Q LengFull Text:PDF
GTID:2428330545482435Subject:Computer technology
Abstract/Summary:PDF Full Text Request
As high and new technology continues to promote the development of industry types and structures toward specialization and refinement,how to efficiently obtain professional and practical information has become one of the issues that people are generally concerned about.Because the search accuracy of the traditional search engine is continuously declining,a boom in researching on search engines based topic has been created.This paper is based on research on the related theoretical foundations on search engine.It briefly introduces the features of the search engine,basic architecture,key technologies and the principles of the topic web crawler,structure,and working principles,etc.Then it analyzes the common three thematic crawling algorithms in detail.and compares and analyzes the two kinds of webpage sorting algorithms based on link structure,such as Page Rank and HITS.Due to the fact that using a single assessment method can not achieve effectively predicting the actual value of link addresses,this paper proposes a search strategy scheme based on a combination of content evaluation and web link structure,and based on the basic idea of topic search engine,a new topic crawling model is proposed and a new multi-threaded collaborative topic crawler is designed.After discussing and designing the search engine basic program and crawler system,aiming to the disadvantages of the shark algorithm,such as links to related pages behind the unrelated web pages,too little difference in priority,and long URL queues.The shark algorithm is designed and implemented in the paper.Finally,according to the relevance calculation of webpage content links,the steps to implement the webpage search algorithm for multimedia themes is described and through the simulation experiment,the traditional Fish,shark Search algorithm and optimized shark algorithm were compared.Compared with the general search engine,the theme search engine is a variant,which optimizes some functions of the general search engine in its basic structure and technology.For professional users to more efficiently and accurately obtain the required professional domain information,a web crawler is designed specifically for the topic search engine.The basic idea of crawling web pages is to search a web page for a given topic and filter web pages that are irrelevant to the topic.Under the theme related pages.Shark-Search algorithm is a classical topic Search algorithm,aiming at the characteristics of multimedia distribution in the web page,width of Shark-Search in the Search,link similarity judgment and to crawl links made correspondingimprovement on selection strategy,at the same time to take the first Search,and then judge the Search process,improve the Search efficiency of the subject of multimedia web page.
Keywords/Search Tags:Topical Search Engine, Web Crawler, Web page sorting, Shark Search
PDF Full Text Request
Related items