English
 
Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Comparative performance analysis of simple U-Net, residual attention U-Net, and VGG16-U-Net for inventory inland water bodies

Authors
/persons/resource/ali

Ghaznavi,  Ali
1.4 Remote Sensing, 1.0 Geodesy, Departments, GFZ Publication Database, Deutsches GeoForschungsZentrum;

/persons/resource/saberioo

Saberioon,  Mohammadmehdi
1.4 Remote Sensing, 1.0 Geodesy, Departments, GFZ Publication Database, Deutsches GeoForschungsZentrum;

Brom,  Jakub
External Organizations;

/persons/resource/itzerott

Itzerott,  S.
1.4 Remote Sensing, 1.0 Geodesy, Departments, GFZ Publication Database, Deutsches GeoForschungsZentrum;

External Ressource
No external resources are shared
Fulltext (public)

5025126.pdf
(Publisher version), 3MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Ghaznavi, A., Saberioon, M., Brom, J., Itzerott, S. (2024): Comparative performance analysis of simple U-Net, residual attention U-Net, and VGG16-U-Net for inventory inland water bodies. - Applied Computing and Geosciences, 21, 100150.
https://doi.org/10.1016/j.acags.2023.100150


Cite as: https://gfzpublic.gfz-potsdam.de/pubman/item/item_5025126
Abstract
Inland water bodies play a vital role at all scales in the terrestrial water balance and Earth’s climate variability. Thus, an inventory of inland waters is crucially important for hydrologic and ecological studies and management. Therefore, the main aim of this study was to develop a deep learning-based method for inventorying and mapping inland water bodies using the RGB band of high-resolution satellite imagery automatically and accurately. The Sentinel-2 Harmonized dataset, together with ZABAGED-validated ground truth, was used as the main dataset for the model training step. Three different deep learning algorithms based on U-Net architecture were employed to segment inland waters, including a simple U-Net, Residual Attention U-Net, and VGG16-U-Net. All three algorithms were trained using a combination of Sentinel-2 visible bands (Red [B04; 665nm], Green [B03; 560nm], and Blue [B02; 490 nm]) at a 10-meter spatial resolution. The Residual Attention U-Net achieved the highest computational cost due to the increased number of trainable parameters. The VGG16-U-Net had the shortest run time and the lowest number of trainable parameters, attributed to its architecture compared to the simple and Residual Attention U-Net architectures, respectively. As a result, the VGG16-U-Net provided the best segmentation results with a mean-IoU score of 0.9850, a slight improvement compared to other proposed U-Net-based architectures. Although the accuracy of the model based on VGG16-U-Net does not make a difference from Residual Attention U-Net, the computation costs for training VGG16-U-Net were dramatically lower than Residual Attention U-Net.