A Semi-supervised Element Sensitive Saliency Model With Position Bias Learning For Web Pages

Posted on:2020-07-03

Degree:Master

Type:Thesis

Country:China

Candidate:Y J Gu

Full Text:PDF

GTID:2428330623963713

Subject:Major in Electronic and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Understanding human visual attention is important for multimedia applications.Many studies have attempted to learn from eye-tracking data and build computational saliency prediction models.Since the Internet era,web pages,as the core application on the Internet,have become the most common and important visual objects in people's lives.The visual saliency prediction of the webpage can reduce the manual test in the webpage design process for reducing the labor cost,and can also assist the webpage retrieval technology to extract the core content of the webpage for improving the retrieval performance.Therefore,the research on the visual saliency prediction model for webpage images has practical significance and application values.However,there are currently a small number of related researches devoted to the visual saliency prediction of web pages.The existing work is mainly focused on exploring better feature representations and using support vector machines to make saliency predictions without considering two key features of the problem:(1)Web pages contain more diverse content elements than natural images.Different elements usually have different effects on the human visual system.(2)The layout of web pages is greatly affecting the area of interest of human eye on the webpage image,and different webpage layouts will result in different reading modes which lead to different visual position bias.In addition,because the labeling cost of visual saliency prediction is very high,the scale of the existing labeled data set is very small,and the published FiWI data set contains only 149 images,which limits the ability of deep learning models.Based on the above two key features of the visual saliency prediction problem and the limitation of data size,this paper proposes a novel visual saliency model for web page image.The whole model is based on deep neural network design and can be used for end-to-end training and inference.Aiming at the above key points(1),considering that the image and text are the main elements in the webpage,this paper proposes an element-sensitive webpage saliency model.In this paper,a multi-discriminative region detection module and a text region detection module are proposed to locate the semantic image region and the region area in the webpage image respectively.The feature representations extracted by the two modules are more in line with the human attention mechanism,and additional prior knowledge is also introduced into the model.Aiming at the above key points(2),this paper proposes a visual saliency model framework with position bias learning,and proposes the use of variational autoencoder and Gaussian mixture model to model the visual position bias caused by web page layout as probability distribution.After the modeling of the position bias is completed,a position bias learning network is trained to predict the position bias,so that the position bias features affected by the webpage layout are introduced into the model to improve the prediction performance of the model.Aiming at the limitation of the data size,this paper proposes a semi-supervised learning algorithm based on auxiliary task for the visual saliency model of the webpage,and proposes to use the text region detection task and the discriminative region detection task as the auxiliary tasks.These auxiliary tasks have high correlation with the saliency prediction task.Thus,the model can use a large number of webpage images without visually significant annotations for semi-supervised training by using these auxiliary tasks,avoiding the over-fitting to the small dataset,and improve the prediction performance.This paper experiments on the published FiWI dataset to verify the effectiveness of the proposed model and algorithms.The semi-supervised element sensitive saliency model with position bias learning proposed in this paper outperforms other existing stateof-the-art visual saliency models in many performance indicators,and obtains the best web page saliency prediction at present.

Keywords/Search Tags:

Visual saliency, web page, deep learning, variational autoencoder, gaussian mixture model, semi-supervised learning

PDF Full Text Request

Related items

1	Research And Application Of Active Semi-supervised Gaussian Mixture Model Clustering Algorithm
2	Semi-supervised Generalized Zero-shot Learning Based On Modal Fusion
3	Saliency Detection And Weakly Supervised Learning For Intelligent Visual Information Processing
4	Research On Image Generation Algorithm Based On Variational Auto-Encoder
5	Variational Auto-encoders Based On Gaussian Mixture Model
6	Industrial Process Fault Classification Based On Semi-Supervised Deep Learning
7	Research On The Semi-supervised Few-shot Classification Based On Siamese Network And GMM
8	Visual Tracking Algorithm Based On Deep Learning And Semi-supervised Learning
9	Deep Learning-Based Co-saliency Detection Research
10	SAR Image Target Recognition Based On Semi-supervised Learning