Font Size: a A A

The Research And Application Of Massive E-mail Automatic Analysis Technology

Posted on:2015-07-08Degree:MasterType:Thesis
Country:ChinaCandidate:M WuFull Text:PDF
GTID:2308330473450897Subject:Computer technology
Abstract/Summary:PDF Full Text Request
E-mail contains a wealth of information which has become an important subject in data mining and big data analysis. Using and analysing the information has become the needs of many users concerning about. Making the original mail file quickly and efficiently converted to the metadata, and building a massive automatic e-mail analysis platform to facilitating the analysis and using of data messages which provide a good basis to make good use of this information.This thesis studied the key technologies which the mass e-mail automated analysis involved, designed and implemented a mass email automated analysis system. At first, because these two demands of the massive content and the automation, this thesis created an e-mail quickly import module, analyze and classify the meta information which can enhance the import’s efficiency and reduce the size of data, which also improving the users’ experience and ensuring the completeness of the information, solving the problem between processing speed and e-mail use efficiency in the context of mass e-mail, and providing good conditions for the further’s data mining. Second, through in-depth researching on user’s actual work, this thesis found the human analysis workflow’s characteristics and management features, and implemented the integration of human analysis workflow, which reducing the unnecessary work of manual analysis and the operating cost of the program and enhancing the information analysis ability. Third, on the base of the properly storage of email meta data, email context information and analysis results in database, this thesis implemented the function of indexing and searching those imformation improved the capacity of the quickly retrieving the interesting information in the face of mass e-mail messages. Forth, on these basis above, the system realized the function of automatic classification mark of e-mail, which increasing the ability of automation in system. Fifth, this thesis implemented the functions of statistics and export on the information of interest, which implemented the process of the information from decomposition, classification, indexing, statistics and again to integration. At last, because of the requirements of specific processes and the information management in the actual work environment, this thesis established a role play information management system which improved the level of information and automation of the entire work.In this thesis, I made the analysis on the problems which reflected by statistics, analysis and the comparison of statistical results after system’s deployment. Verification statistics results show that the system basically realized the requirments of users and could serve the actual work.Finally, the thesis also summarizes the experience and the improvement that the system needed. And presenting some of my own ideas of automated analysis system for the mass e-mail in the future development and researching.
Keywords/Search Tags:search, massive data, e-mail, multi-thread, automatic classification
PDF Full Text Request
Related items