Font Size: a A A

Design And Implementation Of Solr-based Search Engine

Posted on:2012-04-09Degree:MasterType:Thesis
Country:ChinaCandidate:X S WangFull Text:PDF
GTID:2178330335960202Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
With the rapid growth of information resources on the Internet, people are more and more concerned about how to get their potentially valuable information quickly and effectively from a large number of network information. As a result, the Internet search engine appears. It effectively solves the problem that people are hard to access to their interested information from Internet and is an effective tool. Nowadays research on search engine is one of the most popular areas of Internet technologies.As more in-depth study on search engine, its technology is constantly moving forward. Meanwhile, design and implementation of a search engine is a very hard word, because it involves many technical expertises. This makes the threshold of research and development of search engine high and restricts the popularity of it.In this paper, I have researched and implemented an instance of search engine. Firstly, the search engine's knowledge and principles are introduced. Secondly, through the research of Web crawling, we have made an expansion and custom of it based on the practical application and we make it running successfully. At the same time, using the information extraction, to filter out the useless and repetitive information which are used for appearance, structure and so on, we retrieve the information which the user may be interested and store them into the database. Again, via the studying of Solr and writing a custom model to interact with the user, a search engine System is complete. Finally, make a summary of the work, and analyze the current shortcoming of the system, which point out the direction and method of making it more effective and optimized in order to improve its performance.
Keywords/Search Tags:Search Engine, Solr, Heritrix, Web Crawler
PDF Full Text Request
Related items