Font Size: a A A

Design And Implementation Of Vertical Search Engine On Food

Posted on:2019-07-23Degree:MasterType:Thesis
Country:ChinaCandidate:M H DuFull Text:PDF
GTID:2428330548961244Subject:Engineering
Abstract/Summary:PDF Full Text Request
These years with the development of Internet technology,the information on the internet is becoming more and more complex and the amount of information is increasing.Facing the huge amount of data on the network,it becomes difficult to find the information you want from it.In order to solve the urgent need of information retrieval of the Internet user,the search engine came into being.The search engines that we always use,such as Google,Baidu and so on,are called universal search engines,and users can query the key information from the network by entering keywords.But the search results of general search engines are lack of pertinence,more search results but less useful information.In order to provide better and more efficient retrieval experience to users,vertical search engines have come into being.Vertical search engine is a search engine that performs search function for users with specific requirements.It has higher query efficiency and better query effect.Vertical search engine is a search engine specially targeted at a certain field.The results are all included in the field of search keywords,which saves a lot of time of search users for filtering irrelevant information in general search engine.Nowadays,the vertical search engines on the Internet are diverse,and there are much more vertical search engines on music,pictures,news and shopping,but there are less vertical search engines on food.With the improvement of people's living standards,more and more people have no idea about what to eat,where to eat,and how to eat healthfully.A vertical search engine on food can help people to solve these problems.A vertical search engine on food can help users to screen information which is not related to food on the Internet.Although Users enter a word which seems unrelated to food,they can also retrieve information such as food recommendations,recipes and restaurants about this word.If you search about the word “winter”,you will find all kinds of food information related to winter.This article will introduce the whole process of research and implementation of a vertical search engine in the field of food.It includes the acquisition of network data,the processing of data acquisition,the study of the index,and the design and implementation of the vertical search engine system.In the research of data acquisition and processing,we will use Heritrix to create web crawlers to crawl data from the Internet.Heritrix is an open source Web web crawler developed by Java,and users can use it to grab the desired resources[1] from the network.Then,we will use regular expression and HTMLParse to extract the content of web pages,and the Chinese segmentation is done through ICTCLAS and IKAnalyzer.Then we need to study and deal with data such as weight removal,word frequency statistics and sorting.In the creation of index,it is necessary to study Lucene and Solr,and the Solr is the further encapsulation of Lucene on the basis of Lucene,which can improve the efficiency of the index.In the final part of the creation of system,we will use the framework of Spring+Struts+Hibernate,and in the database,the open source relational database MySQL will be used.
Keywords/Search Tags:vertical search engines, food, Heritrix, Lucene
PDF Full Text Request
Related items