Font Size: a A A

Construction Strategy Towards Intra-Organization Search Engine

Posted on:2017-05-08Degree:MasterType:Thesis
Country:ChinaCandidate:Z F BianFull Text:PDF
GTID:2348330485952684Subject:Computer technology
Abstract/Summary:PDF Full Text Request
In the era of big data,hundreds of millions of users always enjoy the latest news by the huge data information produced by the Internet.Whether in study or work,it is necessary to get the latest news of schools or enterprises.However,the latest updated information in the intranet is difficult to be identified.In addition,the website also has lots of available resources,which are in the deep website,and the users can usually find the required information through jumping a lot of network links.While the existing enterprise search tools can not solve the above problems,this paper proposes a solution through studying the features of the updated website information.This article's main contributions are as follows:(1)Put forward the information updating method based on internal search engine.The information integration is the basis of calculating the updating cycle of intranet information.It decided whether the intranet information can be retrieved fully.Based on the problem,according to the characteristics of the intranet,intranet information integration method is proposed.In this article,we defined some concepts,for example,the structure of intranet information,effective accessing and the updating cycle of nodes and so on.Based on above concepts,we put forward the information updating method based on internal search engine,and comparing the advantages and disadvantages of the scanning method,information update cycle method,and the adaptive updating method.(2)Put forward the optimization method based on TF-IDF ranking.Comparing with the traditional internal search tools,we use a full-text search method,combining with the returned results of web and search statement to calculate weight of the web pages,and using NDCG evaluation strategy to evaluate the optimized results.Finally,the program will present a good query result to the user.(3)System implementation based on the proposed methods.Based on the above method,a system was implemented.Through actual use,the effectiveness of the proposed method in this paper is verified.
Keywords/Search Tags:Information Integration, Information Update, TF-IDF, Search Ranking
PDF Full Text Request
Related items