Font Size: a A A

Design And Experiment Of Search Engine In Station Based On Lucene.net

Posted on:2019-05-04Degree:MasterType:Thesis
Country:ChinaCandidate:Z J LiFull Text:PDF
GTID:2428330566985720Subject:Engineering
Abstract/Summary:PDF Full Text Request
At present,the Internet is developing rapidly,all kinds of network applications are increasing rapidly,and network information has exploded.In order to improve the efficiency of production and the convenience of life,but it also makes us have to face an important problem,in the face of massive information,how to quickly locate the information that we are interested in.Indeed,there are many portal sites and search engines like Baidu and Bing,which help people to retrieve information on the current network,but these do not fully meet the daily needs of the users.Especially for some local area network,campus network,enterprises and institutions,large enterprise parks.These organizations often lack a unified management because they are involved in many departments,large amount of information,high information privacy,and often lack a unified management of information.So they often need a good information search entry.When the information is accumulated over time,employees,students,or foreign visitors need to be in these sites.Search for relevant information by page,resulting in great invariance.Therefore,I have elaborated on this specific issue,and put forward solutions.The main purpose of this paper is to study and design a site search engine for large and medium-sized enterprises and institutions,mainly for closed or semi closed single or several web sites.The basic architecture of search engine is introduced from zero to the key technologies,such as the principle and implementation of crawler,how to construct index directory,the main method of word segmentation,the construction of search and sort model,and so on,then use the C# language,and combine the relational database technology,the search engine framework Lucene.net,the front page technology.Build a complete site search engine.The main contents of the development include two modules,one is the data collection module,which mainly includes the functions of network crawler,information cleaning,data warehousing,and index construction.The other is the search module,which uses the index established by the collection module to implement the search function,and add the functions of search cache and sensitive word filtering.The whole development process will be carried out in accordance with the requirements of the software engineering development.First,the requirements analysis is carried out.In accordance with the UML standard,a large number of diagrams are used to introduce the module function,and then the whole system is implemented gradually,and the system is tested at the end of the article.
Keywords/Search Tags:C#, Search Engine, Lucene.net
PDF Full Text Request
Related items