Road segmentation in SAR/PolSAR images using Convolutional Neural Networks

Authors

  • Thomas P. Grandin Mackenzie Presbyterian University
  • Anderson A. de Borba Mackenzie Presbyterian University
  • Alejandro C. Frery Victoria University of Wellington
  • Maurício Marengoni Albion College

Abstract

Synthetic aperture radar (SAR) and polarimetric aperture radar (PolSAR) images are remote sensing images captured from an aircraft or artificial satellite to produce topological scans of the earth’s surface. The advantage of using these images is that they can be acquired day or night, regardless of the weather. They are used primarily to study deforestation, glacier defrosting, urban growth, and the prevention of natural hazards [3]. Neural networks have successfully solved problems in artificial intelligence (AI) in the last two decades. Among all possible architectures, the Convolution Neural Networks (CNNs) architecture dominates the field of computer vision [2]. CNN is very adaptable and can precisely segment objects in images. This article’s primary approach was to build a synthetic dataset that could be used for training, validation, and testing. Then, the Speckle noise, inherent noise in SAR images, was introduced into the Massachusetts Road Dataset, an aerial optical image database, to create a database that simulates SAR images. We use this database to study how different CNN architectures perform when trained to detect roads. Our research methodology consists of building a simulated dataset based on the optical Massachusetts Roads Dataset [4] by adding the Speckle noise with a Gamma law represented by a univariate probability density function fZ(z; μ, L) = LL/(Γ(L)μL) zL-1 exp {- (L/μ) z} 1R+(z), where L > 0, μ > 0 is the mean, 1A is the indicator function of the set A, and Γ(L) is the Gamma function. After this, we trained two CNN architectures: the U-Net architecture[5], which uses skip connections between the encoder and decoder blocks to transfer information, and the DeepLabV3[1] architecture, which uses Atrous Convolution, a technique that changes the kernel size to gather context from multiple scales. Fig. 1, and Fig. 2 show the results achieved until this research phase on an image chosen arbitrarily in the test dataset. By visual inspection of Fig. 1 and Fig. 2, and with results shown in Tab. 1, we can note that the U-net shows a slight advantage in road detection over DeepLabV3. We have ongoing work such as measuring road detection accuracy, adding real SAR images to the dataset, or using Generative Adversarial Networks (GANs) to transform optical images into SAR images. [...]

Downloads

Download data is not yet available.

References

S. Chen, X. Wei, and W. Zheng. “ASA-DRNet: An Improved Deeplabv3+ Framework for SAR Image Segmentation”. In: Electronics (2023).

S. Cong and Y. Zhou. “A review of convolutional neural network architectures and their optimizations”. In: Artificial Intelligence Review 56 (June 2022), pp. 1–65. doi: 10.1007/s10462-022-10213-5.

J. S. Lee and E. Pottier. Polarimetric Radar Imaging: From Basics to Applications. 1st ed. Boca Raton: CRC Press, 2009. isbn: 9781315219332.

V. Mnih. “Machine Learning for Aerial Image Labeling”. PhD thesis. University of Toronto, 2013.

O. Ronneberger, P. Fischer, and T. Brox. U-Net: Convolutional Networks for Biomedical Image Segmentation. 2015. arXiv: 1505.04597 [cs.CV]. url: https://arxiv.org/abs/1505.04597.

Downloads

Published

2026-02-13