Font Size: a A A

Research And Application Of Key Technology In Intelligent Search Engine

Posted on:2018-09-10Degree:MasterType:Thesis
Country:ChinaCandidate:Y J GaoFull Text:PDF
GTID:2348330512496124Subject:Engineering
Abstract/Summary:PDF Full Text Request
The construction of information has made great achievements in various fields including the construction of bandwidth and higher speed of the network infrastructure,research and development based on the memory database cluster of new data warehouse,large-scale distributed cloud computing applications,design and development more user experience of the various application interface,All of the above different levels of innovation are to cope with the challenges of large data era.However,people are faced to the challenge of more accurate and efficient access to information.Therefore,more and more network companies and research institutions began to adopt new technologies to develop or optimize search engines in their areas,academics also hope to have a greater breakthrough in the field of search engine innovation.Therefore,on the basis of studying the core principles of the search engine and the classical algorithms,this paper has the following aspects of the research results:(1)Mainly to analysis the status and development trend of search engine at home and abroad,the relevant theory,the system structure and evaluation standard.(2)This paper focuses on the Chinese search engine,according to the features of the principle and algorithm of Chinese word segmentation,the principle of vector space model(VSM)and the similarity algorithm based on VSM,a new algorithm based on VSM is proposed.And the optimization performance of the improved algorithm has been verified.(3)According to the characteristics of the word segmentation,which is based on the“forward iterative fine-grained segmentation” algorithm,this paper have optimized the word segmentation of the engine framework based on Nutch,also designed and built a search engine system.In conclusion,compared with the old method,the search engine system based on Nutch has higher retrieval accuracy,and has good practical value.
Keywords/Search Tags:Search Engine, Nutch, Chinese Word Segmentation, Vector Space Model, Text Similarity
PDF Full Text Request
Related items