Font Size: a A A

Research And Implementation Of Filter For Erotic Webpage Based On Content

Posted on:2013-09-21Degree:MasterType:Thesis
Country:ChinaCandidate:J SunFull Text:PDF
GTID:2248330371983219Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet, people can easily transfer and share vastamounts of information resources. It brings great convenience to people’s production, life, andinformation exchange, promoting global economic and cultural exchanges. However, Internetalso provides a chance for lawbreaker to release and spread information such as pornography,violence, reactionary. The amount of information on the Internet is rapid growth at anexponential form, and the type of information has become richer from containing just a singletext gradually into containing images, video and other multimedia information. Pornography,violence and other sensitive video because of its powerful visual impact has been used widelyby lawbreaker, due to Internet’s cross-regional, cross-border, and open communication, itsconsequences is around the corners of the world, and endangers social stability and people’sdaily life. Therefore, the design and development of sensitive Web filter to create a greenChina’s Internet environment, maintain a stable social environment and protect Internet users’especially young people’ s physical and mental health has very important significance.Based on this, sensitive web filter was designed and implemented in this study using theBHO technique. The filter is composed of the URL filter, page text filter, and web-sensitiveimage filter. First, URL filtering, the BHO technology can get access to the URL of the pagefrom the IE browser’s address bar. Compare the URL with sensitive URL databaseinformation, if it is sensitive URL, return a blank page; if it is not, detect web page text andpage image. Second, filter page text, if it is not sensitive URL, the browser device downloadsweb resources and filters web page text and image, it can be informed when the downloadedcompletes by DocunmentComplete event. Once finished downloading, achieve text contentby using DHTML document model, and match the web page text and sensitive wordsdatabase with the largest jump (SMA) algorithm. Last, filtering of sensitive image inWebPages, using the algorithm combination of face detection, color detection, skin texturedetection and classification device identification to detect. The purpose of face detection is todetermine that if the image contains characters. We use Sobel operator and statistics straightside graph model to implement skin color detection based on texture, to determine the regionof the skin color. We use Gabor filtering to detect texture of the skin in the region of the skin color. Identify the sensitive images and non-sensitive image with classifier.The test results show that the sensitive web filter designed in this paper can effectivelyintercept and filter sensitive pages, basically control the access to sensitive sites.
Keywords/Search Tags:Sensitive web filtering, URL filtering, Web page text filtering, sensitive image filtering
PDF Full Text Request
Related items