Font Size: a A A

The Design And Realization Of The Full-text Search Engine Used In The E-mail

Posted on:2011-06-08Degree:MasterType:Thesis
Country:ChinaCandidate:X CengFull Text:PDF
GTID:2208360308467288Subject:Software engineering
Abstract/Summary:PDF Full Text Request
E-mail has been to widely use as one of means with communication, it has an important role on monitoring reasonablely and effectively to ensure network security and to prevent the spread of unhealthy information. Thus, a variety of email archiving systems came into being, it is hoped that through e-mail archiving system for e-mail can be effective monitoring, network management provides reference information.To this end, full-text search technology in the e-mail archiving system has been fully applied, e-mail archiving system is only through full-text search engine in order to achieve efficient management and rapid e-mail queries, such as Rose introduced RoseMK company's e-mail archiving system is one of the However, due to the performance of its search engine's own shortcomings, but also unable to meet the needs of the user's current search query.This paper is to address the problem, a full-text retrieval system as the main object of study, in depth analysis of the Lucene full-text search tool kit, based on the system for full-text search engine RoseMK made a comprehensive upgrade, to develop a new set of full-text search engine. The main contents are:1. Through research and testing Lucene full-text search tool kit, analysis Lucene full-text search provided by the API, on this basis to cut, edit and package design a core, to achieve the minimum requirements of the full-text retrieval system cited.2. The introduction of e-mail parsing module, form a set of specific full-text retrieval system on the e-mail.3. Join the Chinese word extraction and document the function, you can involve all the contents of messages - including sender, recipient, Cc, dark give as gifts, date, subject name, attachment name, size, body, attachment content, the source Client ip and mac address, the purpose of client ip and mac address and the login name for Chinese pop full-text search, so that the full realization of the full-text search application in the mail.4. The new full-text search engine will be integrated into the RoseMK system, and tested to validate its performance advantages. This paper aims to develop a new type of high-performance full-text search engine, to make it in the mail archive system, give full play to the role of e-mail management and monitoring, and from software engineering point of view, respectively, for the new full-text search engine, needs analysis, summary design, detailed design, performance tests are described and notes.
Keywords/Search Tags:E-mail archiving system, Full-text search engine, E-mail Analysis, Document extract, Chinese word segmentation
PDF Full Text Request
Related items