Font Size: a A A

The Research And Implementation Of Full-text Search Engine Based On Solr

Posted on:2015-07-08Degree:MasterType:Thesis
Country:ChinaCandidate:L Y GaoFull Text:PDF
GTID:2308330473952835Subject:Software engineering
Abstract/Summary:PDF Full Text Request
At present,there are many search engines on the market,they have high technical threshold. Existing search engine technology is not shared, This makes it hard for search engine development. Solr is currently open source server enterprise search engine; it has efficient, independent features, it has received widespread attention, At present, Marx college website of Zhejiang Sci-Tech University not embed search function, this makes website information find no way to start.This thesis will build search engine for the Marx college website of Zhejiang Sci-Tech University based on the Solr technology.In this thesis, the development of the full text search engine based on Solr, The full-text search system is mainly implemented for the Marx college Website of Zhejiang Sci-Tech University.The web pages are grabed on the basis of Heritrix framework, at the same time, grabbing achievements are downloaded and stored into local. After that, the content of the web pages are extracted, the contents are stored into the database. The stored data is imported to Solr, the index of relevant content are established.The data is retrieved On the basis of index programming results, and the retrieval results are presented to the user.First, the development background of the search engine is analyzed, the research and development of search engine urgency are cleared; at the same time, related technologies of search engine are researched, the development and technical characteristics of search engine are cleared. Then related technologies for the search engine system was analyzed, such as web crawler technology and so on, these technologies will be as a foundation for the follow-up research and development of search engine; at the same time, the tools of search engine development are introducted. Then the needs analysis of full text search engine are completed, the search engine architecture are carried out, the design of each function module of search engine is completed based on this architecture, the design of the database, including data analysis and data table design are accomplished. Then development environment is introduced, including hardware, software and the development kit; code realization of full text search engine is implemented. Finally, the test overview of full-text search engine is finished, the objectives, principles of the search engine tests and test environment are cleared about. at the same time, the function test and the performance test(including retrieval rate, Chinese word segmentation) are completed.The entire test showed that the full-text search engine can satisfy the requirements of the college. It can achieve full text search. The search engine interface is clear which has strong operability.
Keywords/Search Tags:Search Engines, Full-text Search, Solr
PDF Full Text Request
Related items