Details
Original language | English |
---|---|
Pages (from-to) | 52067-52085 |
Number of pages | 19 |
Journal | IEEE ACCESS |
Volume | 12 |
Publication status | Published - 5 Apr 2024 |
Abstract
Accurate segmentation of river water in close-range Remote Sensing (RS) images is vital for efficient environmental monitoring and management. However, this task poses significant difficulties due to the dynamic nature of water, which exhibits varying colors and textures reflecting the sky and surrounding structures along the riverbanks. This study addresses these complexities by evaluating and comparing several well-known deep-learning (DL) techniques on four river scene datasets. To achieve this, we fine-tuned the recently introduced "Segment Anything Model" (SAM) along with popular DL segmentation models such as U-Net, DeepLabV3+, LinkNet, PSPNet, and PAN, all using ResNet50 pre-trained on ImageNet as a backbone. Experimental results highlight the diverse performances of these models in river water segmentation. Notably, fine-tuned SAM demonstrates superior performance, followed by U-Net(ResNet50), despite their higher computational costs. In contrast, PSPNet(ResNet50), while less effective, proves to be the most efficient in terms of execution time. In addition to these findings, we introduce a novel river water segmentation dataset, LuFI-RiverSnap.<italic>v</italic>1 (Dataset link), characterized by a more diverse range of scenes and accurate masks compared to existing datasets. To facilitate reproducible research in remote sensing and computer vision, we release the implementations of the fine-tuned SAM model (Code link). The findings from this research, coupled with the presented dataset and the accuracy achieved by fine-tuned SAM segmentation, can support tracking river changes, understanding river water level trends, and exploring river ecosystem dynamics. These can also provide valuable insights for practitioners and researchers seeking models tailored to specific image characteristics with practical means in disaster risk reduction, such as rapid assessments of inundations during floods or automatic extractions of gauge data in watersheds.
Keywords
- Biological system modeling, Cameras, Deep learning, DeepLabV3+, Feature extraction, Image segmentation, LinkNet, PAN, PSPNet, Residual neural networks, river water segmentation, Rivers, RiverSnap, Segment Anything Model (SAM), Task analysis, U-Net, segment anything model (SAM)
ASJC Scopus subject areas
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
In: IEEE ACCESS, Vol. 12, 05.04.2024, p. 52067-52085.
Research output: Contribution to journal › Article › Research › peer review
}
TY - JOUR
T1 - A Comparative Performance Analysis of Popular Deep Learning Models and Segment Anything Model (SAM) for River Water Segmentation in Close-Range Remote Sensing Imagery
T2 - English
AU - Moghimi, Armin
AU - Welzel, Mario
AU - Celik, Turgay
AU - Schlurmann, Torsten
PY - 2024/4/5
Y1 - 2024/4/5
N2 - Accurate segmentation of river water in close-range Remote Sensing (RS) images is vital for efficient environmental monitoring and management. However, this task poses significant difficulties due to the dynamic nature of water, which exhibits varying colors and textures reflecting the sky and surrounding structures along the riverbanks. This study addresses these complexities by evaluating and comparing several well-known deep-learning (DL) techniques on four river scene datasets. To achieve this, we fine-tuned the recently introduced "Segment Anything Model" (SAM) along with popular DL segmentation models such as U-Net, DeepLabV3+, LinkNet, PSPNet, and PAN, all using ResNet50 pre-trained on ImageNet as a backbone. Experimental results highlight the diverse performances of these models in river water segmentation. Notably, fine-tuned SAM demonstrates superior performance, followed by U-Net(ResNet50), despite their higher computational costs. In contrast, PSPNet(ResNet50), while less effective, proves to be the most efficient in terms of execution time. In addition to these findings, we introduce a novel river water segmentation dataset, LuFI-RiverSnap.v1 (Dataset link), characterized by a more diverse range of scenes and accurate masks compared to existing datasets. To facilitate reproducible research in remote sensing and computer vision, we release the implementations of the fine-tuned SAM model (Code link). The findings from this research, coupled with the presented dataset and the accuracy achieved by fine-tuned SAM segmentation, can support tracking river changes, understanding river water level trends, and exploring river ecosystem dynamics. These can also provide valuable insights for practitioners and researchers seeking models tailored to specific image characteristics with practical means in disaster risk reduction, such as rapid assessments of inundations during floods or automatic extractions of gauge data in watersheds.
AB - Accurate segmentation of river water in close-range Remote Sensing (RS) images is vital for efficient environmental monitoring and management. However, this task poses significant difficulties due to the dynamic nature of water, which exhibits varying colors and textures reflecting the sky and surrounding structures along the riverbanks. This study addresses these complexities by evaluating and comparing several well-known deep-learning (DL) techniques on four river scene datasets. To achieve this, we fine-tuned the recently introduced "Segment Anything Model" (SAM) along with popular DL segmentation models such as U-Net, DeepLabV3+, LinkNet, PSPNet, and PAN, all using ResNet50 pre-trained on ImageNet as a backbone. Experimental results highlight the diverse performances of these models in river water segmentation. Notably, fine-tuned SAM demonstrates superior performance, followed by U-Net(ResNet50), despite their higher computational costs. In contrast, PSPNet(ResNet50), while less effective, proves to be the most efficient in terms of execution time. In addition to these findings, we introduce a novel river water segmentation dataset, LuFI-RiverSnap.v1 (Dataset link), characterized by a more diverse range of scenes and accurate masks compared to existing datasets. To facilitate reproducible research in remote sensing and computer vision, we release the implementations of the fine-tuned SAM model (Code link). The findings from this research, coupled with the presented dataset and the accuracy achieved by fine-tuned SAM segmentation, can support tracking river changes, understanding river water level trends, and exploring river ecosystem dynamics. These can also provide valuable insights for practitioners and researchers seeking models tailored to specific image characteristics with practical means in disaster risk reduction, such as rapid assessments of inundations during floods or automatic extractions of gauge data in watersheds.
KW - Biological system modeling
KW - Cameras
KW - Deep learning
KW - DeepLabV3+
KW - Feature extraction
KW - Image segmentation
KW - LinkNet
KW - PAN
KW - PSPNet
KW - Residual neural networks
KW - river water segmentation
KW - Rivers
KW - RiverSnap
KW - Segment Anything Model (SAM)
KW - Task analysis
KW - U-Net
KW - segment anything model (SAM)
UR - http://www.scopus.com/inward/record.url?scp=85189815510&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3385425
DO - 10.1109/ACCESS.2024.3385425
M3 - Article
AN - SCOPUS:85189815510
VL - 12
SP - 52067
EP - 52085
JO - IEEE ACCESS
JF - IEEE ACCESS
SN - 2169-3536
ER -