Research And Implementation Of A Three-Dimensional Hybrid Spam Filtering Method

Posted on:2009-01-18

Degree:Master

Type:Thesis

Country:China

Candidate:Z N Xu

Full Text:PDF

GTID:2178360242476746

Subject:Computer software and theory

Abstract/Summary:

Spam filtering is a principle anti-spam technique in the tag-and-tug war between spams and anti-spams. Most spam filtering techniques, such as SVM, K-NN, Boosting, Winnow, and Bayesian filtering, are based on machine learning and content analysis. The problems of these techniques are the unsatisfactory recall rates, long training time, and high false alarm rates.This paper proposes a three-dimensional hybrid spam filtering scheme consisted of the following three filtering technologies: collaborative spam filtering based on user feedbacks, whitelist spam filtering based on personal email network, and adaptive Bayesian filtering.Collaborative spam filtering targets mass spam mailing. It uses a modified Nilsimsa abstract algorithm to differentiate similar spams and at the same time makes use of direct and indirect feedback collecting techniques. Whitelist spam filtering targets mass legitimate mailing and works by calculating the clustering coefficient of personal email networks. Adaptive Bayesian filtering builds on top of the previous two techniques and makes use of their outputs in its multi-iteration training.Experiments show that our system improves the recall rate by 4.26%, and the precision rate by 0.27% compared with Naive Bayesian Filtering. It reduces user's Total Cost Ratio by 15%.

Keywords/Search Tags:

anti-spam, bayesian filtering, collaborative filtering, personal email network

Related items

1	Algorithm Based On Bayesian Filtering, Anti-spam Technology And Its Implementation
2	The Research In Anti-spam With Bayesian Data-mining Algorithm
3	Research And Improvement Of Chinese Spam Emails Filtering Method Based On Bayesian Classification
4	A Collaborative Filtering Algorithm Based On The Bayes Classification Of Email Networks
5	Research On Email Filtering Mechanism Based On Cloud-Computing Techniques
6	The Research On Anti-Spam Email Techniques
7	Research And Implementation Of Spam Filtering System Based On Improved Bayesian Algorithm
8	Research And Implementation Of Distributed Anti-Spam System Based On Bayesian
9	The Design And Implementation Of Content-Based Anti-Spam Email System
10	Research And Implementation Of The Anti-spam System Based On Bayesian Algorithm