With the further development of Internet, in which is filled with a great amount of legal and illegal information, In order to offer the user of Internet healthy and secure information, it is necessary to filter out the illegal web pages.Traditional Web filtering technologies can be divided into URL filtering, and content filtering, although the former has good efficiency, but with the increasing resources on the Internet, it must continue to artificially increase the URL database records; with the ability to online content analysis, the latter can save the cost of maintaining URL database, but the its efficiency is not enough to use in real life, because it needs complex computations.This paper presents an enhanced web content filter that combines URL Filtering and content filtering.the filter has URL filtering efficiency, as well as can filtering unknown web resources. Furthermore, we use the characteristics of the HTML and two-stage method to improve the efficiency of our enhanced web content filter. In the face of those website resources which URL filter cannot deal with, we use enhanced Bayesian classifier to process them. Found in the experimental results, the filter we have designed has provided better efficiency than traditional content filter and provide better results than URL filter. |