A Systematic Review of Deep Learning Approaches to Visual Saliency Prediction on Webpage Images

Faiz, Faadihilah Ahnaf and Wibirama, Sunu and Nurlatifa, Hafzatin and Setiawan, Noor Akhmad (2024) A Systematic Review of Deep Learning Approaches to Visual Saliency Prediction on Webpage Images. In: 2024 7th International Conference on Informatics and Computational Sciences (ICICoS), 17-18 July 2024, Semarang, Indonesia.

[thumbnail of A_Systematic_Review_of_Deep_Learning_Approaches_to_Visual_Saliency_Prediction_on_Webpage_Images.pdf] Text
A_Systematic_Review_of_Deep_Learning_Approaches_to_Visual_Saliency_Prediction_on_Webpage_Images.pdf - Published Version
Restricted to Registered users only

Download (254kB) | Request a copy

Abstract

A webpage provides important visual information for branding and business goals. Studying visual attention and saliency on webpages can lead to understanding where human eyes look at the salient region. With the growth of machine learning and deep learning, models from previous studies have achieved incredible improvements in saliency prediction, particularly for natural image datasets. However, webpage image datasets are different from natural images. Unfortunately, there is a lack of extensive research on predicting visual saliency for webpage images. To address the scientific gap in saliency prediction for webpage images, we conducted a systematic literature review (SLR) based on Kitchenham's method. This approach enables a thorough examination of existing research in this area. Our review includes 13 relevant articles published between 2014 and 2023. This study also reviews several deep learning models or methods developed in previous papers, compares the most common saliency evaluation metrics, identifies the different saliency prediction models used by researchers for benchmarking, and provides a comprehensive discussion of current limitations and future developments in this field of study. Our analysis reveals several key challenges in the current research. One significant issue is the limited fixation data available in existing webpage datasets. Additionally, there is a difficulty in adapting saliency models trained on natural images to webpage contexts due to notable distributional shifts between these domains. © 2024 IEEE.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Cited by: 0
Uncontrolled Keywords: Adversarial machine learning; Contrastive Learning; Deep learning; Image datasets; Learning models; Machine-learning; Natural images; Systematic literature review; Systematic Review; Visual saliency; Web-page; Webpage saliency; Prediction models
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions: Faculty of Engineering > Electrical and Information Technology Department
Depositing User: Rita Yulianti Yulianti
Date Deposited: 19 Feb 2025 01:00
Last Modified: 19 Feb 2025 01:00
URI: https://ir.lib.ugm.ac.id/id/eprint/13577

Actions (login required)

View Item
View Item