Search engine as an information retrieval technology,in today's Internet era has a great effect.And now it is hard to distinguish more and more online rental platforms true or false.Renting a satisfactory house has become more and more people's problem.Therefore,the rental information distributed search engine based on Bayes classification is imperative.Based on the deep study of reptile technology,distributed technology and Bayesian classification,this paper designs and implements a small distributed search engine based on rental information.First,I proceed with the demand analysis of distributed search engine,and put forward the design goal,functional requirement and overall framework analysis of the whole system.Then,the three modules of the system are designed and implemented.The distributed information acquisition module includes distributed crawler crawling data,data cleansing based on Map/Reduce,data storage based on HBase and a simple weighted Bayesian classifier.The distributed index module proposes the ElasticSearch index scheme.The distributed retrieval module interacts with users for search by users,and provides a retrieval in a structured way later.Finally,through many tests to prove that the system meets the design expectations,to meet the needs of users. |