Font Size: a A A

Development Of Email Precise Extraction And Targeted Mail Delivery System

Posted on:2016-04-23Degree:MasterType:Thesis
Country:ChinaCandidate:R ZhaoFull Text:PDF
GTID:2308330467473258Subject:Electronic and communication engineering
Abstract/Summary:PDF Full Text Request
Targeted mail delivery is a form of Internet marketing with e-mail as the medium, and iswidely used. For mail marketing, the mails are the most important and directly impact the effectof mail marketing. On one hand, if blindly gets a lot of email addresses, it will not only cause awaste of resources, and also disgusted by the users. On the other hand, if only get user email viaemail subscription, we may miss a lot of potential customers. Therefore, we want to find a wayto get potential users email address, which can precisely obtain user’s e-mail address, so thatpotential customers have access to news and information of interest, and also not disturb otherusers as little as possible.This system is designed to accurately extract email addresses of potential customers, mainlyconsists of email extraction module and targeted mail delivery module. In the email extractionmodule, this system is based on search engine keywords, get the mail address associated with thekeyword. Besides, this system uses a variety of filtering and validation to achieve accuratelyextract email address. In the targeted mail delivery module, this system first send a message tousers tentatively, then remove non-target user’s mails according to bounce, reply and unsubscribetag to improve the precision of mail delivery.In order to improve the accuracy and relevance of this system, the main work of this articleare as follows:First, improve the extraction accuracy of the mail address by filtering links. Filtering outirrelevant links in the search engine results pages, you can effectively reduce the proportion ofunrelated links and improve mail address and keyword relevance. At the same time, matching thekeyword and link title can further filter out irrelevant links.Second, through multithread technology to increase the efficiency of mail address extraction.After obtain the hyperlinked queue, the use of multithread programming technique to achieve thebulk of the work assignment, significantly improve the efficiency of resolving links. Meanwhile,in order to avoid resource-intensive exceeds the system limit, this system limits the number ofmulti-threaded.Third, use deep search technology. Deep Search is to get links and its subordinate linkswithin the search results page. Search depth can be set by the users. But, in order to improve therelevance of extracting email address, this system limits the maximum search depth (themaximum3).Fourth, when matching the email address in the pages, combination of HTML Parser and regular expressions can improve extraction efficiency. When matching the email address, thissystem use HTML Parser to directly extract plain text content for matching instead ofmatching the input stream line by line, greatly enhance the efficiency of extraction.Fifth, use email correction and email validation to improve the quality of mail address.Sixth, system provides real-time transmission status markers, as well as bounces andresponses. And then remove non-target user’s mail from the mailboxes to improve the accuracyof subsequent delivery.
Keywords/Search Tags:search engines, email extraction, HTML Parser, regular expressions, mail delivery
PDF Full Text Request
Related items