Font Size: a A A

The Design And Research Of Personalized Search Engine Based On Solr

Posted on:2013-01-22Degree:MasterType:Thesis
Country:ChinaCandidate:J SunFull Text:PDF
GTID:2218330374457371Subject:Control Engineering
Abstract/Summary:PDF Full Text Request
With the rapidly development of Internet the amount of information thatexists in the Internet becomes very huge. Faced with such a huge amount ofinformation, how to use the resources more efficiently has proved to be a mainsubject people researched. The sources of information exist in the Internetaboard. Normally, they have different forms. For the different sources andforms of stupendous information, how to search the information we needexactly and fast has become a problem when we use the Internet. Theemergence of Search Engine mitigates this problem preferably.However, as living tempo quickens, the requirement of Search Engine'sperformance is improving, so is efficiency. So how to search the informationwe need exactly and fast has become a subject most of researchers devoted to.The Personalized Search Engine is emerged under this background. It willachieve the~Personalized of search result with relevant technology. With usermodel building, user information and keywords collecting, and TF-IDFalgorithm, we obtain the weighting values of user's keywords. Then we canshow user model with the vector that keywords and weighting values formed.The main study of this article includes: 1. Have designed the framework of Search Engine base on the workingprinciple, key technology, and working process of the Search Engine.2. According to the working principle of Web Crawler and Heritrix, havedesigned a suitable Web Crawler for the system, and used it to gainwebpage.3. Have built the Indexing and retrieval system for the PersonalizedSearch Engine based on Solr.4. Have built the User model and Personalized Search Engine based onthe Solr. According to the experiments, the system is proved havingmuch more improvement of the accuracy of search results and closer tothe needs of users.
Keywords/Search Tags:Personalized, User model, TF-IDF algorithm, Heritrix, Solr
PDF Full Text Request
Related items