Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net): An application to generating building change maps

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Ehsan Khankeshizadeh
  • Ali Mohammadzadeh
  • Amin Mohsenifar
  • Armin Moghimi
  • Saied Pirasteh
  • Sheng Feng
  • Keli Hu
  • Jonathan Li

External Research Organisations

  • K.N. Toosi University of Technology
  • Shaoxing University
  • University of Waterloo
View graph of relations

Details

Original languageEnglish
Article number101336
JournalRemote Sensing Applications: Society and Environment
Volume36
Early online date2 Sept 2024
Publication statusPublished - Nov 2024

Abstract

In today's era, increasing access to very high-resolution remote sensing images (VHR-RSIs) has enhanced building detection and change assessment capabilities. These applications provide accurate urban mapping, facilitate effective land management, and support disaster assessment by delivering detailed insights into building structures and their temporal changes. This study uses a two-stage process to present a pioneering approach for generating precise building maps (BMs) and subsequent building change maps (BCMs) from VHR-RSIs. The primary question addressed by the research is how to enhance the U-Net architecture to improve its sensitivity to both high-level semantic features (HLSF) and low-level spatial features (LLSF) in the building detection task. For this purpose, in the initial stage of the method, a novel deep learning model called dual attention residual-based U-Net (DAttResU-Net) is introduced. This model incorporates two significant modifications to the conventional U-Net, enhancing its capacity to yield bi-temporal BMs. Firstly, each standard convolutional block (CB) is replaced with an optimized CB incorporating a channel-spatial attention module attuned to the building objects' crucial HLSF. Secondly, an additional attention module is integrated into the encoder-decoder path of the model, heightening the sensitivity of U-Net to vital LLSF of buildings while disregarding extraneous background spatial information during the fusion of HLSF and LLSF. In the subsequent stage, the bi-temporal BMs generated by the DAttResU-Net are subjected to a box-based class-object change detection methodology to produce accurate BCMs. The effectiveness of the proposed architecture is rigorously evaluated against state-of-the-art models in both BM and BCM generation contexts, utilizing the well-established WHU dataset for experimentation. The experimental results indicated that the DAttResU-Net model, boasting an average of PFN/ PFP value of 2.33/1.34 (%) surpasses the performance of the state-of-the-art models in generating bi-temporal BMs. Furthermore, the building change detection outcomes demonstrated the proficient role of the bi-temporal BMs predicted by the proposed model in leading to the most optimal BCMs, exhibiting average PFN/ PFP value of 2.63/8.93 (%), outperforming comparative networks. Finally, we concluded that the proposed DAttResU-Net architecture is a highly promising and applicable model for producing reliable BMs and BCMs.

Keywords

    Building change map, Building detection, DAttResU-Net, VHR-RSIs

ASJC Scopus subject areas

Sustainable Development Goals

Cite this

Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net): An application to generating building change maps. / Khankeshizadeh, Ehsan; Mohammadzadeh, Ali; Mohsenifar, Amin et al.
In: Remote Sensing Applications: Society and Environment, Vol. 36, 101336, 11.2024.

Research output: Contribution to journalArticleResearchpeer review

Khankeshizadeh, E, Mohammadzadeh, A, Mohsenifar, A, Moghimi, A, Pirasteh, S, Feng, S, Hu, K & Li, J 2024, 'Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net): An application to generating building change maps', Remote Sensing Applications: Society and Environment, vol. 36, 101336. https://doi.org/10.1016/j.rsase.2024.101336
Khankeshizadeh, E., Mohammadzadeh, A., Mohsenifar, A., Moghimi, A., Pirasteh, S., Feng, S., Hu, K., & Li, J. (2024). Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net): An application to generating building change maps. Remote Sensing Applications: Society and Environment, 36, Article 101336. https://doi.org/10.1016/j.rsase.2024.101336
Khankeshizadeh E, Mohammadzadeh A, Mohsenifar A, Moghimi A, Pirasteh S, Feng S et al. Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net): An application to generating building change maps. Remote Sensing Applications: Society and Environment. 2024 Nov;36:101336. Epub 2024 Sept 2. doi: 10.1016/j.rsase.2024.101336
Khankeshizadeh, Ehsan ; Mohammadzadeh, Ali ; Mohsenifar, Amin et al. / Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net) : An application to generating building change maps. In: Remote Sensing Applications: Society and Environment. 2024 ; Vol. 36.
Download
@article{3ba43fc9ed1a4da68928958684e6fd8f,
title = "Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net): An application to generating building change maps",
abstract = "In today's era, increasing access to very high-resolution remote sensing images (VHR-RSIs) has enhanced building detection and change assessment capabilities. These applications provide accurate urban mapping, facilitate effective land management, and support disaster assessment by delivering detailed insights into building structures and their temporal changes. This study uses a two-stage process to present a pioneering approach for generating precise building maps (BMs) and subsequent building change maps (BCMs) from VHR-RSIs. The primary question addressed by the research is how to enhance the U-Net architecture to improve its sensitivity to both high-level semantic features (HLSF) and low-level spatial features (LLSF) in the building detection task. For this purpose, in the initial stage of the method, a novel deep learning model called dual attention residual-based U-Net (DAttResU-Net) is introduced. This model incorporates two significant modifications to the conventional U-Net, enhancing its capacity to yield bi-temporal BMs. Firstly, each standard convolutional block (CB) is replaced with an optimized CB incorporating a channel-spatial attention module attuned to the building objects' crucial HLSF. Secondly, an additional attention module is integrated into the encoder-decoder path of the model, heightening the sensitivity of U-Net to vital LLSF of buildings while disregarding extraneous background spatial information during the fusion of HLSF and LLSF. In the subsequent stage, the bi-temporal BMs generated by the DAttResU-Net are subjected to a box-based class-object change detection methodology to produce accurate BCMs. The effectiveness of the proposed architecture is rigorously evaluated against state-of-the-art models in both BM and BCM generation contexts, utilizing the well-established WHU dataset for experimentation. The experimental results indicated that the DAttResU-Net model, boasting an average of PFN/ PFP value of 2.33/1.34 (%) surpasses the performance of the state-of-the-art models in generating bi-temporal BMs. Furthermore, the building change detection outcomes demonstrated the proficient role of the bi-temporal BMs predicted by the proposed model in leading to the most optimal BCMs, exhibiting average PFN/ PFP value of 2.63/8.93 (%), outperforming comparative networks. Finally, we concluded that the proposed DAttResU-Net architecture is a highly promising and applicable model for producing reliable BMs and BCMs.",
keywords = "Building change map, Building detection, DAttResU-Net, VHR-RSIs",
author = "Ehsan Khankeshizadeh and Ali Mohammadzadeh and Amin Mohsenifar and Armin Moghimi and Saied Pirasteh and Sheng Feng and Keli Hu and Jonathan Li",
note = "Publisher Copyright: {\textcopyright} 2024 Elsevier B.V.",
year = "2024",
month = nov,
doi = "10.1016/j.rsase.2024.101336",
language = "English",
volume = "36",

}

Download

TY - JOUR

T1 - Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net)

T2 - An application to generating building change maps

AU - Khankeshizadeh, Ehsan

AU - Mohammadzadeh, Ali

AU - Mohsenifar, Amin

AU - Moghimi, Armin

AU - Pirasteh, Saied

AU - Feng, Sheng

AU - Hu, Keli

AU - Li, Jonathan

N1 - Publisher Copyright: © 2024 Elsevier B.V.

PY - 2024/11

Y1 - 2024/11

N2 - In today's era, increasing access to very high-resolution remote sensing images (VHR-RSIs) has enhanced building detection and change assessment capabilities. These applications provide accurate urban mapping, facilitate effective land management, and support disaster assessment by delivering detailed insights into building structures and their temporal changes. This study uses a two-stage process to present a pioneering approach for generating precise building maps (BMs) and subsequent building change maps (BCMs) from VHR-RSIs. The primary question addressed by the research is how to enhance the U-Net architecture to improve its sensitivity to both high-level semantic features (HLSF) and low-level spatial features (LLSF) in the building detection task. For this purpose, in the initial stage of the method, a novel deep learning model called dual attention residual-based U-Net (DAttResU-Net) is introduced. This model incorporates two significant modifications to the conventional U-Net, enhancing its capacity to yield bi-temporal BMs. Firstly, each standard convolutional block (CB) is replaced with an optimized CB incorporating a channel-spatial attention module attuned to the building objects' crucial HLSF. Secondly, an additional attention module is integrated into the encoder-decoder path of the model, heightening the sensitivity of U-Net to vital LLSF of buildings while disregarding extraneous background spatial information during the fusion of HLSF and LLSF. In the subsequent stage, the bi-temporal BMs generated by the DAttResU-Net are subjected to a box-based class-object change detection methodology to produce accurate BCMs. The effectiveness of the proposed architecture is rigorously evaluated against state-of-the-art models in both BM and BCM generation contexts, utilizing the well-established WHU dataset for experimentation. The experimental results indicated that the DAttResU-Net model, boasting an average of PFN/ PFP value of 2.33/1.34 (%) surpasses the performance of the state-of-the-art models in generating bi-temporal BMs. Furthermore, the building change detection outcomes demonstrated the proficient role of the bi-temporal BMs predicted by the proposed model in leading to the most optimal BCMs, exhibiting average PFN/ PFP value of 2.63/8.93 (%), outperforming comparative networks. Finally, we concluded that the proposed DAttResU-Net architecture is a highly promising and applicable model for producing reliable BMs and BCMs.

AB - In today's era, increasing access to very high-resolution remote sensing images (VHR-RSIs) has enhanced building detection and change assessment capabilities. These applications provide accurate urban mapping, facilitate effective land management, and support disaster assessment by delivering detailed insights into building structures and their temporal changes. This study uses a two-stage process to present a pioneering approach for generating precise building maps (BMs) and subsequent building change maps (BCMs) from VHR-RSIs. The primary question addressed by the research is how to enhance the U-Net architecture to improve its sensitivity to both high-level semantic features (HLSF) and low-level spatial features (LLSF) in the building detection task. For this purpose, in the initial stage of the method, a novel deep learning model called dual attention residual-based U-Net (DAttResU-Net) is introduced. This model incorporates two significant modifications to the conventional U-Net, enhancing its capacity to yield bi-temporal BMs. Firstly, each standard convolutional block (CB) is replaced with an optimized CB incorporating a channel-spatial attention module attuned to the building objects' crucial HLSF. Secondly, an additional attention module is integrated into the encoder-decoder path of the model, heightening the sensitivity of U-Net to vital LLSF of buildings while disregarding extraneous background spatial information during the fusion of HLSF and LLSF. In the subsequent stage, the bi-temporal BMs generated by the DAttResU-Net are subjected to a box-based class-object change detection methodology to produce accurate BCMs. The effectiveness of the proposed architecture is rigorously evaluated against state-of-the-art models in both BM and BCM generation contexts, utilizing the well-established WHU dataset for experimentation. The experimental results indicated that the DAttResU-Net model, boasting an average of PFN/ PFP value of 2.33/1.34 (%) surpasses the performance of the state-of-the-art models in generating bi-temporal BMs. Furthermore, the building change detection outcomes demonstrated the proficient role of the bi-temporal BMs predicted by the proposed model in leading to the most optimal BCMs, exhibiting average PFN/ PFP value of 2.63/8.93 (%), outperforming comparative networks. Finally, we concluded that the proposed DAttResU-Net architecture is a highly promising and applicable model for producing reliable BMs and BCMs.

KW - Building change map

KW - Building detection

KW - DAttResU-Net

KW - VHR-RSIs

UR - http://www.scopus.com/inward/record.url?scp=85203124427&partnerID=8YFLogxK

U2 - 10.1016/j.rsase.2024.101336

DO - 10.1016/j.rsase.2024.101336

M3 - Article

AN - SCOPUS:85203124427

VL - 36

JO - Remote Sensing Applications: Society and Environment

JF - Remote Sensing Applications: Society and Environment

M1 - 101336

ER -