Hybrid Wavelet-Attention Model for Detecting Changes in High-Resolution Remote Sensing Images

Fazry, Lhuqita and Ramadhan, Mgs M.Luthfi and Ramadhan, Alif Wicaksana and Rachmadi, Muhammad Febrian and Mantau, Aprinaldi Jasa and Nugroho, Lukito Edi and Chi, Chihung and Jatmiko, Wisnu (2025) Hybrid Wavelet-Attention Model for Detecting Changes in High-Resolution Remote Sensing Images. IEEE Access, 13. 107035 - 107054.

[thumbnail of Hybrid_Wavelet-Attention_Model_for_Detecting_Changes_in_High-Resolution_Remote_Sensing_Images.pdf] Text
Hybrid_Wavelet-Attention_Model_for_Detecting_Changes_in_High-Resolution_Remote_Sensing_Images.pdf - Published Version
Restricted to Registered users only

Download (6MB) | Request a copy

Abstract

Change detection is a remote sensing task for detecting a change from two satellite images in the same area, while being taken at different times. Change detection is one of the most difficult remote sensing tasks because the change to be detected (real-change) is mixed with apparent changes (pseudo-change) due to differences in the two images, such as brightness, humidity, seasonal differences, etc. The emergence of a Vision Transformer (ViT) as a new standard in Computer Vision, replacing Convolutional Neural Network (CNN), also shifts the role of CNN in the field of remote sensing. Although ViT can capture long-range interactions between image patches, its computational complexity increases quadratically with the number of patches. One solution to reduce the computational complexity in ViT is to reduce the key and value matrices in the self-attention (SA) mechanism. However, this causes information loss, resulting in a trade-off between the effectiveness and efficiency of the method. To solve the problem, we developed a new change detection method called WaveCD. WaveCD uses Wave Attention (WA) instead of SA. WA uses the Discrete Wavelet Transform (DWT) decomposition to reduce the key and values matrices. Besides reducing the data, DWT decomposition also serves to extract important features that represent images so that the initial data can be approximated through the Inverse Discrete Wavelet Transform (IDWT) process. On the CDD dataset, WaveCD outperforms the state-of-the-art CD method, SwinSUNet, by 12.3 on IoU and 7.3 on F1 score. While on the LEVIR-CD dataset, WaveCD outperforms SwinSUNet by 4 on IoU and 2.5 on F1 score.

Item Type: Article
Additional Information: Cited by: 0; All Open Access; Gold Open Access
Uncontrolled Keywords: Change detection; deep learning; remote sensing; vision transformer; wave attention.
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions: Faculty of Engineering > Electrical and Information Technology Department
Depositing User: Rita Yulianti Yulianti
Date Deposited: 04 May 2026 01:35
Last Modified: 04 May 2026 01:35
URI: https://ir.lib.ugm.ac.id/id/eprint/24908

Actions (login required)

View Item
View Item