Font Size: a A A

The Design And Implementation Of Enterprise Information Search Engine

Posted on:2012-07-06Degree:MasterType:Thesis
Country:ChinaCandidate:J H ZhengFull Text:PDF
GTID:2248330371965459Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the development of IT application process, more and more enterprises have built their own Intranets, in which the volume and variation of data growsvery fast. Consequently, it becomes more and more difficult for users to find out information that they are really interested in, it’s almost impossible without effective search engine.In-site-search service provided by commercial search engine such as Google can be a choice, however this kind of service is mainly designed to satisfy most enterprises’ common demands, some deficiencies can not be overcome, for example,①lack of quantity:commercial search engine will never traverse a site very deeply, furthermore, the spider call only collect HTML page and can not do anything about other data format such as pdf, word and even plain text.②Can not update in real time, there is a certain cycle for commercial search engine to update, some times newly-added data call not be indexed on time;③Accuracy is also low, as said before, commercial search engine call only collect data through HTML page, it is very difficult to avoid duplication.In order to provide more high-quality searching service, enterprise must develop their own search engine, that is why we develop the ESE(enterprise search engine).According to the demand, this paper provides a solution to build a ESE. We design a maintenance-module and a search-service module to help the searching and a monitor filesystem to get the new file.(1) Putting forward the architecture framework of retrieval system of enterprise literature, in this framework, database and index files to achieve loosely coupled.(2) Base on the API of inotify, we monitor the file systems to get the change information of the file systems.(3) On the basis of study of existing index technology, design and implement index services and enquiry services to meet system requirements. Index services not only created index for the body of literature, but also stored informationof literature attributes and relevant information of document database to the index. Consequently,we provide the Synonyms retrives to provide convenience for the enquiry services.
Keywords/Search Tags:search engine, Lucene, Chinese word, file monitor
PDF Full Text Request
Related items