Research And Application Of Visual Attention Model

Posted on:2013-02-17

Degree:Master

Type:Thesis

Country:China

Candidate:Y Wang

Full Text:PDF

GTID:2218330362459251

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

Visual attention is the ability that human focuses attention on a few important areas, and then utilizes limited computational resource to process them prior, when facing a complex scene. However, many image processing algorithms waste much time and computational resource to deal with complex scene at the moment. So it's important to model the visual attention and introduce it into the procedure of image processing.First, this dissertation introduces the physiological structures of visual attention. Each part of vision is explained along the visual path. Then a few important'top-down'visual attention models are introduced, including Itti's model which models the physiological structures strictly and is used most widely, Spectral Residual algorithm which is most efficient, PQFT algorithm which is expanded from Spectral Residual, and the global contrast algorithm which is the latest and best performance.However, all these algorithms have drawback. For example, Itti's model is inefficient; Spectral Residual algorithm doesn't utilize color information; Spectral Residual algorithm and PQFT algorithm tend to highlight the boarder of object; the global contrast algorithm can't process the image with complex background.At last, this dissertation proposes a new salient region detection algorithm. The goals of the algorithm are:1. Uniformly highlight the whole salient regions instead of edges or boarders.2. Be able to detect salient object from background with complex texture.3. Be efficient, so as to meet the demand of large-scale image processing, retrieval and classification. After analysis, this dissertation proposes the frequency and spatial domain analysis based salient region detection method. We find out that the background which attract less attention usually appear periodically, and lead to sharp spikes in the amplitude spectrum. We remove the spikes in the amplitude spectrum by median filter and thereby suppress the periodical background. How to choose the window size of median filter is a problem. First we filter the amplitude spectrum with different window size, and transform all these results into spatial domain. Then we utilize spatial standard deviation to select the candidate. At last, we define and utilize a contrast function to choose the best saliency map. Our algorithm utilizes the intensity and color information, and utilizes global contrast algorithm to improve the result.The frequency domain analysis aims at detecting salient object from complex background, and it's very efficient. Meanwhile, the spatial domain analysis covers the shortage of frequency domain analysis which highlights the boarder of object, and highlights the whole object uniformly. So the goals are achieved.We compare our algorithm and 10 other state-of-the-art algorithms on MSRA salient object database and the largest accurately labeled salient object dataset. The results show that our algorithm outperforms other methods on precision rate, recall rate, F-measure, and the area under ROC.

Keywords/Search Tags:

visual attention, saliency map, frequency domain, spatial domain, image segmentation

PDF Full Text Request

Related items

1	Saliency Detection Based On Spatial Domain And Frequency Domain
2	Visual Saliency Detection Research Based On Multiple Domains And Features
3	Research On Visual Saliency Detection Method Combining Time Domain And Frequency Domain
4	Research Of Temporal Spatial Fusion VSA Model And Its Application In MPEG Compressed Domain
5	Research On Improved Saliency Detection Model In Frequency Domain And Its Application
6	Saliency Detection And Applications Based On Frequency Domain And Spatial Domain
7	Visual Attention
8	Research On Visual Attention Model With Joint Spatial And Feature Domain Information
9	A Video Saliency Detection Method Based On Frequency Domain Prior And Spatiotemporal Correlation
10	Study On Visual Saliency Detection In Compressed Domain And Its Applications